Show simple item record

 
dc.contributor.author Kučera, Karel
dc.contributor.author Řehořková, Anna
dc.contributor.author Stluka, Martin
dc.date.accessioned 2024-02-01T21:14:24Z
dc.date.available 2024-02-01T21:14:24Z
dc.date.issued 2015-12-18
dc.identifier.uri http://hdl.handle.net/11234/1-5413
dc.description Diachronic corpus of Czech sized 3.45 million words (i.e. 4.1 million tokens). It contains 116 texts from the 14th-20th century period. The texts are transcribed, not transliterated. Diakorp v6 is provided in a CoNLL-U-like vertical format used as an input to the Manatee query engine. The data thus correspond to the corpus available via the KonText query interface to the registered users of CNC at http://www.korpus.cz
dc.language.iso ces
dc.publisher Charles University, Faculty of Arts, Institute of the Czech National Corpus
dc.rights Creative Commons - Attribution-NonCommercial-ShareAlike 4.0 International (CC BY-NC-SA 4.0)
dc.rights.uri http://creativecommons.org/licenses/by-nc-sa/4.0/
dc.source.uri https://wiki.korpus.cz/doku.php/en:cnk:diakorp
dc.subject corpus
dc.subject diachronic
dc.subject Czech
dc.title Diakorp v6: diachronic corpus of Czech
dc.type corpus
metashare.ResourceInfo#ContentInfo.mediaType text
dc.rights.label PUB
has.files yes
branding LINDAT / CLARIAH-CZ
contact.person Michal Křen michal.kren@ff.cuni.cz Charles University, Faculty of Arts, Institute of the Czech National Corpus
sponsor Ministerstvo školství, mládeže a tělovýchovy LM2011023 Český národní korpus nationalFunds
size.info 3450000 words
files.size 9657405
files.count 1


 Files in this item

Icon
Name
diakorp_v6.gz
Size
9.21 MB
Format
application/x-gzip
Description
Neznámý
MD5
cd67d27a33a55d56548505cf73857b00
 Download file

Show simple item record