Nově přidané

 lexicalConceptualResource 
lexicalConceptualResource
Popis:
Czech OOV Inflection Dataset is a Czech inflection dataset of nouns, focused on evaluation in out-of-vocabulary (OOV) conditions. It consists of two parts: a standard lemma-disjoint train-dev-test split of a subset of noun ...
 Tento záznam obsahuje 1 soubor (17.08 MB).
 
Publicly Available Distributed under Creative Commons Attribution Required Noncommercial Share Alike
 lexicalConceptualResource 
lexicalConceptualResource
Popis:
Mapping table for the article Hajič et al., 2024: Mapping Czech Verbal Valency to PropBank Argument Labels, in LREC-COLING 2024, as preprocess by the algorithm described in the paper. This dataset i smeant for verification ...
 Tento záznam obsahuje 1 soubor (4.26 MB).
 
Publicly Available Distributed under Creative Commons Attribution Required Noncommercial No Derivative Works
 corpus 
corpus
Popis:
CorefUD is a collection of previously existing datasets annotated with coreference, which we converted into a common annotation scheme. In total, CorefUD in its current version 1.2 consists of 25 datasets for 16 languages. ...
 Tento záznam obsahuje 1 soubor (83.66 MB).
 
Publicly Available Distributed under Creative Commons

Nejnavštěvovanější záznamy

Za poslední týden
 toolService 
toolService
Autoři
Neznámý autor
 Tento záznam neobsahuje soubory.
 corpus 
corpus
Autoři
Popis:
The database consists of three sets: - Many Talker Set: 30 males, 30 females; each to read 50 numbers, 1-2 connected passages, 1 block of "filler" sentences, and 1 block of syllables. - Few Talker Set: 4 males, 4 females; ...
 Tento záznam neobsahuje soubory.
 toolService 
toolService
Autoři
Neznámý autor
Popis:
frequency list of the Parole corpus, 1 339 787 words
 Tento záznam neobsahuje soubory.