Czech-English Parallel Corpus 1.0 (CzEng 1.0)
- Title:
- Czech-English Parallel Corpus 1.0 (CzEng 1.0)
- Creator:
- Bojar, Ondřej, Žabokrtský, Zdeněk, Dušek, Ondřej, Galuščáková, Petra, Majliš, Martin, Mareček, David, Maršík, Jiří, Novák, Michal, Popel, Martin, and Tamchyna, Aleš
- Contributor:
- European Union@@FP7-ICT-2007-3-231720@@EuroMatrix Plus@@euFunds@@info:eu-repo/grantAgreement/EC/FP7/231720, Ministerstvo školství, mládeže a tělovýchovy České republiky@@7E09003@@EuroMatrixPlus – Bringing Machine Translation for European Languages to the User@@nationalFunds@@, Ministerstvo školství, mládeže a tělovýchovy České republiky@@7E11051@@EuroMatrixPlus - Enlarged European Union Bringing Machine Translation for European Languages to the User@@nationalFunds@@, European Union@@FP7-ICT-2009-4-247762@@Faust@@euFunds@@info:eu-repo/grantAgreement/EC/FP7/247762, Ministerstvo školství, mládeže a tělovýchovy České republiky@@7E11041@@Feedback Analysis for User Adaptive Statistical Translation@@nationalFunds@@, Grantová agentura České republiky@@GPP406/10/P259@@Hybridní frázový a hloubkově-syntaktický strojový překlad@@nationalFunds@@, Grantová agentura Univerzity Karlovy v Praze@@GAUK 116310/2010@@Anglicko-český strojový překlad s využitím hloubkové syntaxe@@nationalFunds@@, and Grantová agentura Univerzity Karlovy v Praze@@GAUK 4226/2011@@Využití koreference ve strojovém překladu@@nationalFunds@@
- Publisher:
- Charles University, Faculty of Mathematics and Physics, Institute of Formal and Applied Linguistics (UFAL)
- Identifier:
- http://hdl.handle.net/11234/1-1458
- Subject:
- corpus, parallel corpus, treebank, and alignment
- Type:
- text and corpus
- Description:
- CzEng 1.0 is the fourth release of a sentence-parallel Czech-English corpus compiled at the Institute of Formal and Applied Linguistics (ÚFAL) freely available for non-commercial research purposes. CzEng 1.0 contains 15 million parallel sentences (233 million English and 206 million Czech tokens) from seven different types of sources automatically annotated at surface and deep (a- and t-) layers of syntactic representation. and EuroMatrix Plus (FP7-ICT-2007-3-231720 of the EU and 7E09003+7E11051 of the Ministry of Education, Youth and Sports of the Czech Republic), Faust (FP7-ICT-2009-4-247762 of the EU and 7E11041 of the Ministry of Education, Youth and Sports of the Czech Republic), GAČR P406/10/P259, GAUK 116310, GAUK 4226/2011
- Language:
- Czech and English
- Rights:
- Attribution-NonCommercial-ShareAlike 3.0 Unported (CC BY-NC-SA 3.0)
http://creativecommons.org/licenses/by-nc-sa/3.0/
PUB - Relation:
- http://lindat.mff.cuni.cz/services/kontext/run.cgi/first_form?corpname=czeng_10_cs_a
info:eu-repo/grantAgreement/EC/FP7/231720
info:eu-repo/grantAgreement/EC/FP7/247762
http://hdl.handle.net/11858/00-097C-0000-0001-4916-9 - Source:
- http://ufal.mff.cuni.cz/czeng
- Harvested from:
- LINDAT/CLARIAH-CZ repository
- Metadata only:
- false
- Date:
- 2011-12
The item or associated files might be "in copyright"; review the provided rights metadata:
- Attribution-NonCommercial-ShareAlike 3.0 Unported (CC BY-NC-SA 3.0)
- http://creativecommons.org/licenses/by-nc-sa/3.0/
- PUB