LINDAT/CLARIAH-CZ Repository Home

What's New

lexicalConceptualResource

Author(s):

Description:

Czech OOV Inflection Dataset is a Czech inflection dataset of nouns, focused on evaluation in out-of-vocabulary (OOV) conditions. It consists of two parts: a standard lemma-disjoint train-dev-test split of a subset of noun ...

This item contains 1 file (17.08 MB).

Publicly Available Distributed under Creative Commons

lexicalConceptualResource

LINDAT / CLARIAH-CZ

Mapping Czech Verbal Valency to PropBank Argument Labels: LREC2024 - verification data

Author(s):

Hajič, Jan ; Fučíková, Eva ; Lopatková, Markéta and Urešová, Zdeňka

Description:

Mapping table for the article Hajič et al., 2024: Mapping Czech Verbal Valency to PropBank Argument Labels, in LREC-COLING 2024, as preprocess by the algorithm described in the paper. This dataset i smeant for verification ...

This item contains 1 file (4.26 MB).

Publicly Available Distributed under Creative Commons

corpus

LINDAT / CLARIAH-CZ

Coreference in Universal Dependencies 1.2 (CorefUD 1.2)

Description:

CorefUD is a collection of previously existing datasets annotated with coreference, which we converted into a common annotation scheme. In total, CorefUD in its current version 1.2 consists of 25 datasets for 16 languages. ...

This item contains 1 file (83.66 MB).

Publicly Available Distributed under Creative Commons

Most Viewed Items

Top Last Week

toolService

LRT + Open Submissions

Estonian Text-to-Speech Synthesiser for the Blind

Author(s):

Unknown author

This item contains no files.

corpus

LRT + Open Submissions

BABEL Estonian Database

Author(s):

Meister, Einar

Description:

The database consists of three sets: - Many Talker Set: 30 males, 30 females; each to read 50 numbers, 1-2 connected passages, 1 block of "filler" sentences, and 1 block of syllables. - Few Talker Set: 4 males, 4 females; ...

This item contains no files.

toolService

LRT + Open Submissions

Parole frequency list

Author(s):

Unknown author

Description:

frequency list of the Parole corpus, 1 339 787 words

This item contains no files.

Linguistic Data and NLP Tools

Find

Citation Support (with Persistent IDs)

Deposit Free and Safe

License of your Choice (Open licenses encouraged)

Easy to Find

Easy to Cite

What's New

Most Viewed Items