dc.contributor.author | Šebesta, Karel |
dc.contributor.author | Bedřichová, Zuzanna |
dc.contributor.author | Šormová, Kateřina |
dc.contributor.author | Štindlová, Barbora |
dc.contributor.author | Hrdlička, Milan |
dc.contributor.author | Hrdličková, Tereza |
dc.contributor.author | Hana, Jiří |
dc.contributor.author | Petkevič, Vladimír |
dc.contributor.author | Jelínek, Tomáš |
dc.contributor.author | Škodová, Svatava |
dc.contributor.author | Janeš, Petr |
dc.contributor.author | Lundáková, Kateřina |
dc.contributor.author | Skoumalová, Hana |
dc.contributor.author | Sládek, Šimon |
dc.contributor.author | Pierscieniak, Piotr |
dc.contributor.author | Toufarová, Dagmar |
dc.contributor.author | Straka, Milan |
dc.contributor.author | Rosen, Alexandr |
dc.contributor.author | Náplava, Jakub |
dc.contributor.author | Poláčková, Marie |
dc.date.accessioned | 2019-10-02T09:22:18Z |
dc.date.available | 2019-10-02T09:22:18Z |
dc.date.issued | 2019-09-27 |
dc.identifier.uri | http://hdl.handle.net/11234/1-3057 |
dc.description | AKCES-GEC is a grammar error correction corpus for Czech generated from a subset of AKCES. It contains train, dev and test files annotated in M2 format. Note that in comparison to CZESL-GEC dataset, this dataset contains separated edits together with their type annotations in M2 format and also has two times more sentences. If you use this dataset, please use following citation: @article{naplava2019wnut, title={Grammatical Error Correction in Low-Resource Scenarios}, author={N{\'a}plava, Jakub and Straka, Milan}, journal={arXiv preprint arXiv:1910.00353}, year={2019} } |
dc.language.iso | ces |
dc.publisher | Charles University, Faculty of Mathematics and Physics, Institute of Formal and Applied Linguistics (UFAL) |
dc.relation.isreferencedby | https://arxiv.org/abs/1910.00353 |
dc.relation.replaces | http://hdl.handle.net/11234/1-2143 |
dc.rights | Creative Commons - Attribution-NonCommercial-ShareAlike 4.0 International (CC BY-NC-SA 4.0) |
dc.rights.uri | http://creativecommons.org/licenses/by-nc-sa/4.0/ |
dc.subject | natural language correction |
dc.subject | grammatical error correction |
dc.subject | gec |
dc.title | AKCES-GEC Grammatical Error Correction Dataset for Czech |
dc.type | corpus |
metashare.ResourceInfo#ContentInfo.mediaType | text |
dc.rights.label | PUB |
has.files | yes |
branding | LINDAT / CLARIAH-CZ |
contact.person | Milan Straka straka@ufal.mff.cuni.cz Charles University, UFAL |
contact.person | Jakub Náplava naplava@ufal.mff.cuni.cz Charles University, UFAL |
sponsor | Ministerstvo školství, mládeže a tělovýchovy České republiky LM2015071 LINDAT/CLARIN: Institut pro analýzu, zpracování a distribuci lingvistických dat nationalFunds |
sponsor | Grantová agentura České republiky GAČR 16-10185S Čeština nerodilých mluvčích z pohledu teoretického a komputačního / Non-native Czech from the Theoretical and Computational Perspective nationalFunds |
sponsor | Ministerstvo školství, mládeže a tělovýchovy České republiky LM2015071 LINDAT/CLARIN: Institut pro analýzu, zpracování a distribuci lingvistických dat nationalFunds |
size.info | 47371 sentences |
size.info | 11 files |
size.info | 505275 words |
files.size | 3534547 |
files.count | 1 |
Soubory tohoto záznamu
Licenční kategorie:
Licence: Creative Commons - Attribution-NonCommercial-ShareAlike 4.0 International (CC BY-NC-SA 4.0)
Publicly Available
Licence: Creative Commons - Attribution-NonCommercial-ShareAlike 4.0 International (CC BY-NC-SA 4.0)
- Název
- AKCES-GEC.zip
- Velikost
- 3.37 MB
- Formát
- application/zip
- Popis
- corpus data and metadata, zipped
- MD5
- 84eb88aa9e0ec2de7626c3336d2fe005