What's New
lexicalConceptualResource

Description:
MorfFlex CZ 2.0 is the Czech morphological dictionary developed originally by Jan Hajič as a spelling checker and lemmatization dictionary. MorfFlex is a flat list of lemma-tag-wordform triples. For each wordform, full ...
This item contains 1 file (234.84
MB).
Publicly Available




corpus

Description:
A richly annotated and genre-diversified language resource, The Prague Dependency Treebank – Consolidated 1.0 (PDT-C 1.0, or PDT-C in short in the sequel) is a consolidated release of the existing PDT-corpora of Czech ...
This item contains 1 file (2.6
GB).
Publicly Available




lexicalConceptualResource

Description:
CzeDLex 0.7 is the third development version of the Lexicon of Czech discourse connectives. The lexicon contains connectives partially automatically extracted from the Prague Discourse Treebank 2.0 (PDiT 2.0) and, as a ...
This item contains 2 files (1.1
MB).
Publicly Available




Most Viewed Items
Top Last Week
corpus

Description:
Universal Dependencies is a project that seeks to develop cross-linguistically consistent treebank annotation for many languages, with the goal of facilitating multilingual parser development, cross-lingual learning, and ...
This item contains 3 files (479.93
MB).
Publicly Available


corpus

Description:
A richly annotated and genre-diversified language resource, The Prague Dependency Treebank – Consolidated 1.0 (PDT-C 1.0, or PDT-C in short in the sequel) is a consolidated release of the existing PDT-corpora of Czech ...
This item contains 1 file (2.6
GB).
Publicly Available




corpus

Description:
Training and development data for the WMT17 QE task. Test data will be published as a separate item.
This shared task will build on its previous five editions to further examine automatic methods for estimating the ...
This item contains 8 files (90.25
MB).
Publicly Available