|

Corpora

Below is the list of corpora in the TEITOK/Kontext hybrid set-up, hosted at the UFAL institute. To get a larger list of TEITOK projects, see the TEITOK project page. A larger list of Kontext corpora at the UFAL institute can be found in the Kontext corpus list, or in the repository. For corpora that have previous in TEITOK, you can click on the version number to see all versions of the corpus.


AcronymLatestToken sizeCorpus TypeCorpus StatusCorpus Language(s)
infoPDT-C1.04MTreebankstableCzech
infoParCzech PS72.012MParliamentary corpusstableCzech
infoSkript 2015400kLearner CorpusliveCzech
infoUniversal Dependencies2.726MTreebankstableMany

5 results - showing 1-100 - click on a value to reduce selection - click on a column to sort - Search