Czech Verbal MWEs
Please use the following text to cite this item or export to a predefined format:
Bejček, Eduard, 2017,
Czech Verbal MWEs, LINDAT/CLARIAH-CZ digital library at the Institute of Formal and Applied Linguistics (ÚFAL),
http://hdl.handle.net/11234/1-2603.
Authors
Item identifier
Referenced by
Date issued
2017
Size
4785 items
Language(s)
Description
Lexicon of Czech verbal multiword expressions (VMWEs) used in Parseme Shared Task 2017. https://typo.uni-konstanz.de/parseme/index.php/2-general/142-parseme-shared-task-on-automatic-detection-of-verbal-mwes
Lexicon consists of 4785 VMWEs, categorized into four categories according to Parseme Shared Task (PST) typology: IReflV (inherently reflexive verbs), LVC (light verb constructions), ID (idiomatic expressions) and OTH (other VMWEs with other than verbal syntactic head).
Verbal multiword expressions as well as deverbative variants of VMWEs were annotated during the preparation phase of PST. These data were published as http://hdl.handle.net/11372/LRT-2282. Czech part includes 14,536 VMWE occurences:
1611 ID
10000 IReflV
2923 LVC
2 OTH
This lexicon was created out of Czech data. Each lexicon entry is represented by one line in the form:
type lemmas frequency PoS [used form 1; used form 2; ... ]
(columns are separated by tabs) where:
type ... is the type of VMWE in PST typology
lemmas ... are space separated lemmatized forms of all words that constitutes the VMWE
frequency ... is the absolute frequency of this item in PST data
PoS ... is a space separated list of parts of speech of individual words (in the same order as in "lemmas")
final field contains a list of all (1 to 18) used forms found in the data (since Czech is a flective language).
Acknowledgement
Institute of Formal and Applied Linguistics
Project code:LD14117
Project name:Parseme CZ
Subject(s)
Collections
This item isPublicly Available
and licensed under:
Files in this item
- Name
- lexicon_czech_VMWEs.tsv
- Size
- 313.46 KB
- Format
- application/octet-stream
- Description
- Unknown
- MD5
- 65c5fa7b9391f8e33fbb81c8f42c1d15

The file preview has not been generated yet. Please try again later or contact the system administrator lindat-help@ufal.mff.cuni.cz

