This is not the latest version of this item. The latest version can be found here.
Annotated corpora and tools of the PARSEME Shared Task on Automatic Identification of Verbal Multiword Expressions (edition 1.1)
Please use the following text to cite this item or export to a predefined format:
Ramisch, Carlos; et al., 2018,
Annotated corpora and tools of the PARSEME Shared Task on Automatic Identification of Verbal Multiword Expressions (edition 1.1), LINDAT/CLARIAH-CZ digital library at the Institute of Formal and Applied Linguistics (ÚFAL),
http://hdl.handle.net/11372/LRT-2842.
Authors
Ramisch, Carlos ; et al.
Item identifier
Project URL
Referenced by
Date issued
2018-04-30
Size
277701 sentences,
5807087 tokens,
75107 multiWordUnits
Description
This multilingual resource contains corpora in which verbal MWEs have been manually annotated. VMWEs include idioms (let the cat out of the bag), light-verb constructions (make a decision), verb-particle constructions (give up), inherently reflexive verbs (help oneself), and multi-verb constructions (make do). VMWEs were annotated according to the universal guidelines in 19 languages. The corpora are provided in the cupt format, inspired by the CONLL-U format. The corpora were used in the 1.1 edition of the PARSEME Shared Task (2018).
For most languages, morphological and syntactic information – not necessarily using UD tagsets – including parts of speech, lemmas, morphological features and/or syntactic dependencies are also provided. Depending on the language, the information comes from treebanks (e.g., Universal Dependencies) or from automatic parsers trained on treebanks (e.g., UDPipe).
This item contains training, development and test data, as well as the evaluation tools used in the PARSEME Shared Task 1.1 (2018).
The annotation guidelines are available online: http://parsemefr.lif.univ-mrs.fr/parseme-st-guidelines/1.1
Publisher
Collections
Files in this item
- Name
- EN.tgz
- Size
- 2.05 MB
- Format
- application/x-gzip
- Description
- gzip Archive
- MD5
- 561093f4482a52e05f58482e6f98599e

The file preview has not been generated yet. Please try again later or contact the system administrator lindat-help@ufal.mff.cuni.cz
- Name
- ES.tgz
- Size
- 2.55 MB
- Format
- application/x-gzip
- Description
- gzip Archive
- MD5
- 69e4ec35058e366b45a7316987353319

The file preview has not been generated yet. Please try again later or contact the system administrator lindat-help@ufal.mff.cuni.cz
- Name
- BG.tgz
- Size
- 6.38 MB
- Format
- application/x-gzip
- Description
- gzip Archive
- MD5
- 2c42058921f4f2aa09402fcbb7494077

The file preview has not been generated yet. Please try again later or contact the system administrator lindat-help@ufal.mff.cuni.cz
- Name
- HU.tgz
- Size
- 1.86 MB
- Format
- application/x-gzip
- Description
- gzip Archive
- MD5
- a5aa443cc44cdf750af82d1631b12e05

The file preview has not been generated yet. Please try again later or contact the system administrator lindat-help@ufal.mff.cuni.cz
- Name
- HE.tgz
- Size
- 5.55 MB
- Format
- application/x-gzip
- Description
- gzip Archive
- MD5
- 3c9bdcd6ae9372e00f74f90afda741d8

The file preview has not been generated yet. Please try again later or contact the system administrator lindat-help@ufal.mff.cuni.cz
- Name
- PL.tgz
- Size
- 4.36 MB
- Format
- application/x-gzip
- Description
- gzip Archive
- MD5
- 6cfb66bd4176b794ebe5a64c34a342a3

The file preview has not been generated yet. Please try again later or contact the system administrator lindat-help@ufal.mff.cuni.cz
- Name
- TR.tgz
- Size
- 4.39 MB
- Format
- application/x-gzip
- Description
- gzip Archive
- MD5
- 88ed39565e0b5cd38f4bc2ff0cced2bd

The file preview has not been generated yet. Please try again later or contact the system administrator lindat-help@ufal.mff.cuni.cz
- Name
- EL.tgz
- Size
- 3.4 MB
- Format
- application/x-gzip
- Description
- gzip Archive
- MD5
- 36f6602138c4bfb397baea31f8ecf7a7

The file preview has not been generated yet. Please try again later or contact the system administrator lindat-help@ufal.mff.cuni.cz
- Name
- HR.tgz
- Size
- 1.32 MB
- Format
- application/x-gzip
- Description
- gzip Archive
- MD5
- f401a6c094e6e1bf48058bef651cd220

The file preview has not been generated yet. Please try again later or contact the system administrator lindat-help@ufal.mff.cuni.cz
- Name
- SL.tgz
- Size
- 3.24 MB
- Format
- application/x-gzip
- Description
- gzip Archive
- MD5
- 71b95675e197aaa6aa75dcfd07483583

The file preview has not been generated yet. Please try again later or contact the system administrator lindat-help@ufal.mff.cuni.cz
- Name
- bin.tgz
- Size
- 19.6 KB
- Format
- application/x-gzip
- Description
- gzip Archive
- MD5
- 45e16f3ea085b9d64b2839046327a1b6

The file preview has not been generated yet. Please try again later or contact the system administrator lindat-help@ufal.mff.cuni.cz
- Name
- PT.tgz
- Size
- 7.1 MB
- Format
- application/x-gzip
- Description
- gzip Archive
- MD5
- 1b9a5b3a3455e8c3fa9ecb4d7a73a944

The file preview has not been generated yet. Please try again later or contact the system administrator lindat-help@ufal.mff.cuni.cz
- Name
- RO.tgz
- Size
- 12.14 MB
- Format
- application/x-gzip
- Description
- gzip Archive
- MD5
- 7afa2507293f5251b07057d4cec7fab8

The file preview has not been generated yet. Please try again later or contact the system administrator lindat-help@ufal.mff.cuni.cz
- Name
- DE.tgz
- Size
- 2.38 MB
- Format
- application/x-gzip
- Description
- gzip Archive
- MD5
- b28e381834aa389e074f184733fcfb57

The file preview has not been generated yet. Please try again later or contact the system administrator lindat-help@ufal.mff.cuni.cz
- Name
- EU.tgz
- Size
- 2.05 MB
- Format
- application/x-gzip
- Description
- gzip Archive
- MD5
- 7b6178745a43a1a83f778420972ffecd

The file preview has not been generated yet. Please try again later or contact the system administrator lindat-help@ufal.mff.cuni.cz
- Name
- LT.tgz
- Size
- 3.44 MB
- Format
- application/x-gzip
- Description
- gzip Archive
- MD5
- 39a7a0d6f40d7dfb3eef51b86ed066b4

The file preview has not been generated yet. Please try again later or contact the system administrator lindat-help@ufal.mff.cuni.cz
- Name
- HI.tgz
- Size
- 626.27 KB
- Format
- application/x-gzip
- Description
- gzip Archive
- MD5
- 9a02e6003be29d13a546b6af053d2baa

The file preview has not been generated yet. Please try again later or contact the system administrator lindat-help@ufal.mff.cuni.cz
- Name
- FA.tgz
- Size
- 696.13 KB
- Format
- application/x-gzip
- Description
- gzip Archive
- MD5
- 4c8266063c4535c71edd3dd6b7b2383c

The file preview has not been generated yet. Please try again later or contact the system administrator lindat-help@ufal.mff.cuni.cz
- Name
- IT.tgz
- Size
- 4.47 MB
- Format
- application/x-gzip
- Description
- gzip Archive
- MD5
- 5927b51aa913c4d0c863dcbbc256e2ce

The file preview has not been generated yet. Please try again later or contact the system administrator lindat-help@ufal.mff.cuni.cz
- Name
- FR.tgz
- Size
- 6.04 MB
- Format
- application/x-gzip
- Description
- gzip Archive
- MD5
- dcd671a8f7e9737fcb45933b2913daea

The file preview has not been generated yet. Please try again later or contact the system administrator lindat-help@ufal.mff.cuni.cz
- Name
- README.md
- Size
- 5.95 KB
- Format
- application/octet-stream
- Description
- Unknown
- MD5
- 829b0937130ce28abc179a9619e88e9c

The file preview has not been generated yet. Please try again later or contact the system administrator lindat-help@ufal.mff.cuni.cz
- Name
- trial.tgz
- Size
- 4.74 KB
- Format
- application/x-gzip
- Description
- gzip Archive
- MD5
- 8d3fbcfe9a832e66c1a4e61dce8d1f14

The file preview has not been generated yet. Please try again later or contact the system administrator lindat-help@ufal.mff.cuni.cz

