Czech Translation of SQuAD 2.0 and 1.1
Please use the following text to cite this item or export to a predefined format:
Macková, Kateřina and Straka, Milan, 2020,
Czech Translation of SQuAD 2.0 and 1.1, LINDAT/CLARIAH-CZ digital library at the Institute of Formal and Applied Linguistics (ÚFAL),
http://hdl.handle.net/11234/1-3249.
Authors
Item identifier
Referenced by
Date issued
2020-07-01
Size
117933 items
Language(s)
Description
The Czech translation of SQuAD 2.0 and SQuAD 1.1 datasets contains automatically translated texts, questions and answers from the training set and the development set of the respective datasets.
The test set is missing, because it is not publicly available.
The data is released under the CC BY-NC-SA 4.0 license.
If you use the dataset, please cite the following paper (the exact format was not available during the submission of the dataset): Kateřina Macková and Straka Milan: Reading Comprehension in Czech via Machine Translation and Cross-lingual Transfer, presented at TSD 2020, Brno, Czech Republic, September 8-11 2020.
Acknowledgement
Grantová agentura České republiky
Project code:GX20-16819X
Project name:LUSyD – Language Understanding: from Syntax to Discourse
Ministerstvo školství, mládeže a tělovýchovy České republiky
Project code:LM2018101
Project name:LINDAT/CLARIAH-CZ: Digitální výzkumná infrastruktura pro jazykové technologie, umění a humanitní vědy
Univerzita Karlova (mimo GAUK)
Project code:SVV 260 575
Project name:Specifický vysokoškolský výzkum
Subject(s)
Collections
This item isPublicly Available
and licensed under:
Files in this item
- Name
- czech-squad.zip
- Size
- 19.56 MB
- Format
- application/zip
- Description
- Czech Translation of SQuAD 2.0 and 1.1
- MD5
- 32d7b5ae6daf4856a6c3924ba0610e05

- squad-1.1-cs
- train-v1.1.json27 MB
- dev-v1.1.json4 MB
- squad-2.0-cs
- train-v2.0.json37 MB
- dev-v2.0.json4 MB
-
- LICENSE20 kB
- README.md1 kB

