Universal Segmentations 1.0 (UniSegments 1.0)
Please use the following text to cite this item or export to a predefined format:
Žabokrtský, Zdeněk; et al., 2022,
Universal Segmentations 1.0 (UniSegments 1.0), LINDAT/CLARIAH-CZ digital library at the Institute of Formal and Applied Linguistics (ÚFAL),
http://hdl.handle.net/11234/1-4629.
Authors
Žabokrtský, Zdeněk ; et al.
Item identifier
Project URL
Referenced by
Date issued
2022-01-17
Size
38 files
Description
Universal Segmentations (UniSegments) is a collection of lexical resources capturing morphological segmentations harmonised into a cross-linguistically consistent annotation scheme for many languages. The annotation scheme consists of simple tab-separated columns that stores a word and its morphological segmentations, including pieces of information about the word and the segmented units, e.g., part-of-speech categories, type of morphs/morphemes etc. The current public version of the collection contains 38 harmonised segmentation datasets covering 30 different languages.
Acknowledgement
Grantová agentura České Republiky
Project code:19-14534S
Project name:Popis slovotvorné struktury českých slov na základě jazykových dat
Charles University
Project code:START/HUM/010
Project name:A data-based approach to competition in word-formation: selected semantic categories across seven languages
Univerzita Karlova (mimo GAUK)
Project code:SVV 260 453
Project name:Specifický vysokoškolský výzkum
Ministerstvo školství, mládeže a tělovýchovy České republiky
Project code:LM2015071
Project name:LINDAT/CLARIN: Institut pro analýzu, zpracování a distribuci lingvistických dat
Ministerstvo školství, mládeže a tělovýchovy České republiky
Project code:LM2018101
Project name:LINDAT/CLARIAH-CZ: Digitální výzkumná infrastruktura pro jazykové technologie, umění a humanitní vědy
Collections
Files in this item
- Name
- UniSegments-1.0-public.tar.gz
- Size
- 130.55 MB
- Format
- application/x-gzip
- Description
- gzip Archive
- MD5
- d8a436b31b51e0123231213290f455fd

The file preview has not been generated yet. Please try again later or contact the system administrator lindat-help@ufal.mff.cuni.cz

