What's New
lexicalConceptualResource

Description:
Data collection has been done by the means of Sketch Engine program.
Data were extrapolated from the annotated English web corpus enTenTen20.
Data collection and analysis has been done during the period of two months: ...
This item contains 1 file (18.01
KB).
Publicly Available




corpus

Description:
AlbNER is a Named Entity Recognition corpus of Wikipedia sentences in Albanian, consisting of 900 records. The sentence tokens are manually labeled complying with the CoNLL-2003 shared task annotation scheme explained at ...
This item contains 2 files (50.39
KB).
Publicly Available


corpus

Description:
The goal of the Uniform Meaning Representation (UMR) project is to design a meaning representation that can be used to annotate the semantic content of a text. UMR is primarily based on Abstract Meaning Representation ...
This item contains 1 file (3.02
MB).
Publicly Available

Most Viewed Items
Top Last Week
corpus

Description:
Universal Dependencies is a project that seeks to develop cross-linguistically consistent treebank annotation for many languages, with the goal of facilitating multilingual parser development, cross-lingual learning, and ...
This item contains 3 files (598.92
MB).
Publicly Available


clip

Description:
Segment from Československý zvukový týdeník Aktualita (Czechoslovak Aktualita Sound Newsreel) 1941, issue no. 40, captures events linked to the accession of SS-Obergruppenführer Reinhard Heydrich to the office of Deputy ...
This item contains 2 files (3.05
GB).
Publicly Available




corpus

Description:
AlbNER is a Named Entity Recognition corpus of Wikipedia sentences in Albanian, consisting of 900 records. The sentence tokens are manually labeled complying with the CoNLL-2003 shared task annotation scheme explained at ...
This item contains 2 files (50.39
KB).
Publicly Available

