This is a new version of the repository. Do let us know (lindat-help at ufal.mff.cuni.cz) if you encounter any issues.
 

Czech and English abstracts of ÚFAL papers

Please use the following text to cite this item or export to a predefined format:
Rosa, Rudolf, 2016, Czech and English abstracts of ÚFAL papers, LINDAT/CLARIAH-CZ digital library at the Institute of Formal and Applied Linguistics (ÚFAL), http://hdl.handle.net/11234/1-1731.
Date issued
2016-06-12
Size
1556 entries,
12000 sentences,
200000 words
Language(s)
Description
This is a document-aligned parallel corpus of English and Czech abstracts of scientific papers published by authors from the Institute of Formal and Applied Linguistics, Charles University in Prague, as reported in the institute's system Biblio. For each publication, the authors are obliged to provide both the original abstract in Czech or English, and its translation into English or Czech, respectively. No filtering was performed, except for removing entries missing the Czech or English abstract, and replacing newline and tabulator characters by spaces.
Acknowledgement

Version History

Showing 1 - 2 out of 2 results
VersionDateSummary
2022-11-11 00:00:00
1*
2016-06-12 00:00:00
* Selected version
This item isPublicly Available
and licensed under:
 Files in this item
Name
publications.tsv
Size
1.39 MB
Format
application/octet-stream
Description
Unknown
MD5
7b46974782ea692d80f2b7b4a78306fe
Preview
  File Preview
Name
xml2tsv.pl
Size
1.08 KB
Format
application/octet-stream
Description
Unknown
MD5
4dade72664c05943b583ba7df279b9f6
Preview
  File Preview