What's New
lexicalConceptualResource

Description:
The valency lexicon PDT-Vallex 4.0 has been built in close connection with the annotation of the Prague Dependency Treebank project (PDT) and its successors (mainly the Prague Czech-English Dependency Treebank project, ...
This item contains 1 file (1.61
MB).
Publicly Available




corpus

Description:
The Sequoia corpus is a set of 3,099 linguistically-annotated French sentences, originating from four sources (Europarl, European Agency Reports, French regional journal L'Est Républicain, and French wikipedia).
Several ...
This item contains 1 file (4.37
MB).
Publicly Available


lexicalConceptualResource

Description:
MorfFlex CZ 2.0 is the Czech morphological dictionary developed originally by Jan Hajič as a spelling checker and lemmatization dictionary. MorfFlex is a flat list of lemma-tag-wordform triples. For each wordform, full ...
This item contains 1 file (234.84
MB).
Publicly Available




Most Viewed Items
Top Last Week
corpus

Description:
Universal Dependencies is a project that seeks to develop cross-linguistically consistent treebank annotation for many languages, with the goal of facilitating multilingual parser development, cross-lingual learning, and ...
This item contains 3 files (479.93
MB).
Publicly Available


languageDescription

Description:
Automatic segmentation, tokenization and morphological and syntactic annotations of raw texts in 45 languages, generated by UDPipe (http://ufal.mff.cuni.cz/udpipe), together with word embeddings of dimension 100 computed ...
This item contains 47 files (629.67
GB).
Publicly Available




lexicalConceptualResource

Description:
The valency lexicon PDT-Vallex 4.0 has been built in close connection with the annotation of the Prague Dependency Treebank project (PDT) and its successors (mainly the Prague Czech-English Dependency Treebank project, ...
This item contains 1 file (1.61
MB).
Publicly Available



