dc.contributor.author | Šebesta, Karel |
dc.contributor.author | Bedřichová, Zuzanna |
dc.contributor.author | Šormová, Kateřina |
dc.contributor.author | Štindlová, Barbora |
dc.contributor.author | Hrdlička, Milan |
dc.contributor.author | Hrdličková, Tereza |
dc.contributor.author | Hana, Jiří |
dc.contributor.author | Petkevič, Vladimír |
dc.contributor.author | Jelínek, Tomáš |
dc.contributor.author | Škodová, Svatava |
dc.contributor.author | Poláčková, Marie |
dc.contributor.author | Janeš, Petr |
dc.contributor.author | Lundáková, Kateřina |
dc.contributor.author | Skoumalová, Hana |
dc.contributor.author | Sládek, Šimon |
dc.contributor.author | Pierscieniak, Piotr |
dc.contributor.author | Toufarová, Dagmar |
dc.contributor.author | Richter, Michal |
dc.contributor.author | Straka, Milan |
dc.contributor.author | Rosen, Alexandr |
dc.date.accessioned | 2014-07-27T17:06:49Z |
dc.date.available | 2014-07-27T17:06:49Z |
dc.date.issued | 2014-07-27 |
dc.identifier.uri | http://hdl.handle.net/11234/1-162 |
dc.description | Essays written by non-native learners of Czech, a part of AKCES/CLAC – Czech Language Acquisition Corpora. CzeSL-SGT stands for Czech as a Second Language with Spelling, Grammar and Tags. Extends the “foreign” (ciz) part of AKCES 3 (CzeSL-plain) by texts collected in 2013. Original forms and automatic corrections are tagged, lemmatized and assigned erros labels. Most texts have metadata attributes (30 items) about the author and the text. In addition to a few minor bugs, fixes a critical issue in Release 1: the native speakers of Ukrainian (s_L1:"uk") were wrongly labelled as speakers of "other European languages" (s_L1_group="IE"), instead of speakers of a Slavic language (s_L1_group="S"). The file is now a regular XML document, with all annotation represented as XML attributes. |
dc.language.iso | ces |
dc.publisher | Charles University |
dc.relation.replaces | http://hdl.handle.net/11858/00-097C-0000-0023-95B1-E |
dc.rights | Attribution-ShareAlike 3.0 Unported (CC BY-SA 3.0) |
dc.rights.uri | http://creativecommons.org/licenses/by-sa/3.0/ |
dc.source.uri | http://utkl.ff.cuni.cz/learncorp/ |
dc.subject | learner corpus |
dc.subject | Czech as a foreign language |
dc.subject | Czech language acquisition corpora |
dc.subject | AKCES |
dc.subject | non-native speakers |
dc.subject | second language acquistion |
dc.title | AKCES 5 (CzeSL-SGT) Release 2 |
dc.type | corpus |
metashare.ResourceInfo#ContactInfo#PersonInfo.surname | Rosen |
metashare.ResourceInfo#ContactInfo#PersonInfo.givenName | Alexandr |
metashare.ResourceInfo#ContactInfo#PersonInfo#OrganizationInfo.organizationName | Charles University in Prague, ÚTKL |
metashare.ResourceInfo#DistributionInfo.availability | notAvailable |
metashare.ResourceInfo#DistributionInfo#LicenseInfo.restrictionsOfUse | attribution |
metashare.ResourceInfo#DistributionInfo#LicenseInfo.restrictionsOfUse | shareAlike |
metashare.ResourceInfo#ContentInfo.mediaType | text |
metashare.ResourceInfo#TextInfo#SizeInfo.size | 958000 |
metashare.ResourceInfo#TextInfo#SizeInfo.sizeUnit | words |
metashare.ResourceInfo#ContactInfo#PersonInfo#OrganizationInfo#CommunicationInfo.email | alexandr.rosen@ff.cuni.cz |
dc.rights.label | PUB |
has.files | yes |
branding | LINDAT / CLARIAH-CZ |
size.info | 958000 words |
files.size | 15770234 |
files.count | 2 |
Files in this item
Download all files in item (15.04 MB)This item is
Attribution-ShareAlike 3.0 Unported (CC BY-SA 3.0)
Publicly Available
and licensed under:Attribution-ShareAlike 3.0 Unported (CC BY-SA 3.0)
- Name
- 2014-czesl-sgt-en-all-v2.zip
- Size
- 14.94 MB
- Format
- application/zip
- Description
- corpus data and metadata, zipped
- MD5
- e8a81fa41fb911af47ec5e29640546ab
- Name
- 2014-czesl-sgt-en-v2.pdf
- Size
- 104.48 KB
- Format
- Description
- description of the corpus
- MD5
- 2ec4d8f27b22b73f324814577fe097aa