Verbs annotated for morphemic structure in Czech, English, German, Spanish
Please use the following text to cite this item or export to a predefined format:
Hledíková, Hana, 2024,
Verbs annotated for morphemic structure in Czech, English, German, Spanish, LINDAT/CLARIAH-CZ digital library at the Institute of Formal and Applied Linguistics (ÚFAL),
http://hdl.handle.net/11234/1-5824.
Authors
Item identifier
Date issued
2024
Size
69107 items
Description
A sample of verb lemmas in four languages: Czech (19,030 lemmas), English (9,965 lemmas), German (27,224 lemmas), Spanish (11,888 lemmas). Each verb lemma is annotated for its morphemic structure (i.e., segmented into the prefiex(es), root(s), suffix(es) and ending(s) that the given lemma contains), classification of its root morph to a root morpheme where needed (to facilitate grouping of verbs with the same root morpheme), and its frequency of the verb in a 100 M corpus. Two versions are available for each language: one with a more coarse-grained segmentation, which captures the morphemic structure that is synchronically available, and a version with a more fine-grained segmentation, which also captures the word's etymology.
Acknowledgement
Charles University Grant Agency
Project code:GAUK 246723
Project name:Morfematická komplexita slovesné slovní zásoby ve čtyřech jazycích: Kvantitativní výzkum založený na korpusových datech
Subject(s)
Collections
This item isPublicly Available
and licensed under:
Files in this item
- Name
- Spanish_less_final.tsv
- Size
- 561.08 KB
- Format
- application/octet-stream
- Description
- Unknown
- MD5
- d0094496852d367d82dc6a51faf9b71a

The file preview has not been generated yet. Please try again later or contact the system administrator lindat-help@ufal.mff.cuni.cz
- Name
- Spanish_final.tsv
- Size
- 607.19 KB
- Format
- application/octet-stream
- Description
- Unknown
- MD5
- e1631822a73ea528ea1c0a29ee75926e

The file preview has not been generated yet. Please try again later or contact the system administrator lindat-help@ufal.mff.cuni.cz
- Name
- German_less_final.tsv
- Size
- 1.25 MB
- Format
- application/octet-stream
- Description
- Unknown
- MD5
- 50d1e4ec0917e897389185db8b2197f6

The file preview has not been generated yet. Please try again later or contact the system administrator lindat-help@ufal.mff.cuni.cz
- Name
- German_final.tsv
- Size
- 1.27 MB
- Format
- application/octet-stream
- Description
- Unknown
- MD5
- a0488af373b2ffee3d3b854d96d6bf77

The file preview has not been generated yet. Please try again later or contact the system administrator lindat-help@ufal.mff.cuni.cz
- Name
- English_less_final.tsv
- Size
- 370.21 KB
- Format
- application/octet-stream
- Description
- Unknown
- MD5
- bde4a3bea7dc75988f841e2e928ed226

The file preview has not been generated yet. Please try again later or contact the system administrator lindat-help@ufal.mff.cuni.cz
- Name
- English_final.tsv
- Size
- 404.49 KB
- Format
- application/octet-stream
- Description
- Unknown
- MD5
- 3d610a79ef8395102a7f3faac9f2fc3a

The file preview has not been generated yet. Please try again later or contact the system administrator lindat-help@ufal.mff.cuni.cz
- Name
- Czech_less_final.tsv
- Size
- 1012.67 KB
- Format
- application/octet-stream
- Description
- Unknown
- MD5
- 96767be1ad1b2926796e55be4949c7b2

The file preview has not been generated yet. Please try again later or contact the system administrator lindat-help@ufal.mff.cuni.cz
- Name
- Czech_final.tsv
- Size
- 1011.73 KB
- Format
- application/octet-stream
- Description
- Unknown
- MD5
- ff63a33d565cf134113fed9838fdf29e

The file preview has not been generated yet. Please try again later or contact the system administrator lindat-help@ufal.mff.cuni.cz

