This is not the latest version of this item. The latest version can be found here.
Annotated corpora and tools of the PARSEME Shared Task on Semi-Supervised Identification of Verbal Multiword Expressions (edition 1.2)
Please use the following text to cite this item or export to a predefined format:
Ramisch, Carlos; et al., 2020,
Annotated corpora and tools of the PARSEME Shared Task on Semi-Supervised Identification of Verbal Multiword Expressions (edition 1.2), LINDAT/CLARIAH-CZ digital library at the Institute of Formal and Applied Linguistics (ÚFAL),
http://hdl.handle.net/11234/1-3367.
Authors
Ramisch, Carlos ; et al.
Item identifier
Project URL
Date issued
2020-07-09
Size
279785 sentences,
5517910 tokens,
68503 multiWordUnits
Description
This multilingual resource contains corpora in which verbal MWEs have been manually annotated, gathered at the occasion of the 1.2 edition of the PARSEME Shared Task on semi-supervised Identification of Verbal MWEs (2020).
VMWEs include idioms (let the cat out of the bag), light-verb constructions (make a decision), verb-particle constructions (give up), inherently reflexive verbs (help oneself), and multi-verb constructions (make do).
For the 1.2 shared task edition, the data covers 14 languages, for which VMWEs were annotated according to the universal guidelines. The corpora are provided in the cupt format, inspired by the CONLL-U format.
Morphological and syntactic information – not necessarily using UD tagsets – including parts of speech, lemmas, morphological features and/or syntactic dependencies are also provided. Depending on the language, the information comes from treebanks (e.g., Universal Dependencies) or from automatic parsers trained on treebanks (e.g., UDPipe).
This item contains training, development and test data, as well as the evaluation tools used in the PARSEME Shared Task 1.2 (2020). The annotation guidelines are available online: http://parsemefr.lif.univ-mrs.fr/parseme-st-guidelines/1.2
Publisher
Collections
Files in this item
- Name
- GA.tgz
- Size
- 802.56 KB
- Format
- application/x-gzip
- Description
- gzip Archive
- MD5
- b554c8424df0d989c4580ad7ab43ce56

The file preview has not been generated yet. Please try again later or contact the system administrator lindat-help@ufal.mff.cuni.cz
- Name
- trial.tgz
- Size
- 113.47 KB
- Format
- application/x-gzip
- Description
- gzip Archive
- MD5
- 17c8e72d5cd58194868598f0579ab524

The file preview has not been generated yet. Please try again later or contact the system administrator lindat-help@ufal.mff.cuni.cz
- Name
- SV.tgz
- Size
- 1.22 MB
- Format
- application/x-gzip
- Description
- gzip Archive
- MD5
- 868c003f369f6af324a5699dd4be0726

The file preview has not been generated yet. Please try again later or contact the system administrator lindat-help@ufal.mff.cuni.cz
- Name
- IT.tgz
- Size
- 5.82 MB
- Format
- application/x-gzip
- Description
- gzip Archive
- MD5
- 4b6dc92ccf29768d80e7d136285efa09

The file preview has not been generated yet. Please try again later or contact the system administrator lindat-help@ufal.mff.cuni.cz
- Name
- DE.tgz
- Size
- 2.73 MB
- Format
- application/x-gzip
- Description
- gzip Archive
- MD5
- 55d0f986e358739e56573c221c617282

The file preview has not been generated yet. Please try again later or contact the system administrator lindat-help@ufal.mff.cuni.cz
- Name
- HE.tgz
- Size
- 6.45 MB
- Format
- application/x-gzip
- Description
- gzip Archive
- MD5
- 384e4e150b958ebc98d2d420feb1f4d8

The file preview has not been generated yet. Please try again later or contact the system administrator lindat-help@ufal.mff.cuni.cz
- Name
- EU.tgz
- Size
- 2.95 MB
- Format
- application/x-gzip
- Description
- gzip Archive
- MD5
- a3cdb2376ab2e000820a98e1cf22ba87

The file preview has not been generated yet. Please try again later or contact the system administrator lindat-help@ufal.mff.cuni.cz
- Name
- PT.tgz
- Size
- 10.15 MB
- Format
- application/x-gzip
- Description
- gzip Archive
- MD5
- 4827ddfb24df8f634543be85b5939609

The file preview has not been generated yet. Please try again later or contact the system administrator lindat-help@ufal.mff.cuni.cz
- Name
- TR.tgz
- Size
- 5.21 MB
- Format
- application/x-gzip
- Description
- gzip Archive
- MD5
- 7ed6b16e8fcc30d646d04791ebb54b4a

The file preview has not been generated yet. Please try again later or contact the system administrator lindat-help@ufal.mff.cuni.cz
- Name
- HI.tgz
- Size
- 771.32 KB
- Format
- application/x-gzip
- Description
- gzip Archive
- MD5
- 8ea2dd1a1f9082f53234c597e330eaad

The file preview has not been generated yet. Please try again later or contact the system administrator lindat-help@ufal.mff.cuni.cz
- Name
- ZH.tgz
- Size
- 8.21 MB
- Format
- application/x-gzip
- Description
- gzip Archive
- MD5
- 052a11ca5136136be9d46935765b7a2a

The file preview has not been generated yet. Please try again later or contact the system administrator lindat-help@ufal.mff.cuni.cz
- Name
- EL.tgz
- Size
- 9.41 MB
- Format
- application/x-gzip
- Description
- gzip Archive
- MD5
- 4dfafa58e1f504f3600d54401beccf24

The file preview has not been generated yet. Please try again later or contact the system administrator lindat-help@ufal.mff.cuni.cz
- Name
- FR.tgz
- Size
- 7.43 MB
- Format
- application/x-gzip
- Description
- gzip Archive
- MD5
- a3f1331707d34c31b0dc221799a5e8b6

The file preview has not been generated yet. Please try again later or contact the system administrator lindat-help@ufal.mff.cuni.cz
- Name
- PL.tgz
- Size
- 8.29 MB
- Format
- application/x-gzip
- Description
- gzip Archive
- MD5
- f1db1c5bd299d7d4ed1eaa4d76ab91a8

The file preview has not been generated yet. Please try again later or contact the system administrator lindat-help@ufal.mff.cuni.cz
- Name
- RO.tgz
- Size
- 20.59 MB
- Format
- application/x-gzip
- Description
- gzip Archive
- MD5
- 2972d6276e83a15596ec1d527c8689b0

The file preview has not been generated yet. Please try again later or contact the system administrator lindat-help@ufal.mff.cuni.cz
- Name
- bin.tgz
- Size
- 19.7 KB
- Format
- application/x-gzip
- Description
- gzip Archive
- MD5
- 456e2a812566cc791a0d6be38f507bdd

The file preview has not been generated yet. Please try again later or contact the system administrator lindat-help@ufal.mff.cuni.cz
- Name
- README.md
- Size
- 6.7 KB
- Format
- application/octet-stream
- Description
- Unknown
- MD5
- a8b7e1ba4c2b8b09cf76c040fb5d41ab

The file preview has not been generated yet. Please try again later or contact the system administrator lindat-help@ufal.mff.cuni.cz

