This is a new version of the repository. Do let us know (lindat-help at ufal.mff.cuni.cz) if you encounter any issues.
 

Annotated corpora and tools of the PARSEME Shared Task on Automatic Identification of Verbal Multiword Expressions (edition 1.1)

Please use the following text to cite this item or export to a predefined format:
Ramisch, Carlos; et al., 2018, Annotated corpora and tools of the PARSEME Shared Task on Automatic Identification of Verbal Multiword Expressions (edition 1.1), LINDAT/CLARIAH-CZ digital library at the Institute of Formal and Applied Linguistics (ÚFAL), http://hdl.handle.net/11372/LRT-2842.
Authors
show everyone
Date issued
2018-04-30
Size
277701 sentences,
5807087 tokens,
75107 multiWordUnits
Description
This multilingual resource contains corpora in which verbal MWEs have been manually annotated. VMWEs include idioms (let the cat out of the bag), light-verb constructions (make a decision), verb-particle constructions (give up), inherently reflexive verbs (help oneself), and multi-verb constructions (make do). VMWEs were annotated according to the universal guidelines in 19 languages. The corpora are provided in the cupt format, inspired by the CONLL-U format. The corpora were used in the 1.1 edition of the PARSEME Shared Task (2018). For most languages, morphological and syntactic information ­­­­– not necessarily using UD tagsets – including parts of speech, lemmas, morphological features and/or syntactic dependencies are also provided. Depending on the language, the information comes from treebanks (e.g., Universal Dependencies) or from automatic parsers trained on treebanks (e.g., UDPipe). This item contains training, development and test data, as well as the evaluation tools used in the PARSEME Shared Task 1.1 (2018). The annotation guidelines are available online: http://parsemefr.lif.univ-mrs.fr/parseme-st-guidelines/1.1
Publisher

Version History

Showing 1 - 4 out of 4 results
VersionDateSummary
2023-05-10 00:00:00
2020-07-09 00:00:00
2*
2018-04-30 00:00:00
2017-01-20 00:00:00
* Selected version
This item isPublicly Available
and licensed under:
 Files in this item
Name
EN.tgz
Size
2.05 MB
Format
application/x-gzip
Description
gzip Archive
MD5
561093f4482a52e05f58482e6f98599e
Preview
  File Preview
Name
ES.tgz
Size
2.55 MB
Format
application/x-gzip
Description
gzip Archive
MD5
69e4ec35058e366b45a7316987353319
Preview
  File Preview
Name
BG.tgz
Size
6.38 MB
Format
application/x-gzip
Description
gzip Archive
MD5
2c42058921f4f2aa09402fcbb7494077
Preview
  File Preview
Name
HU.tgz
Size
1.86 MB
Format
application/x-gzip
Description
gzip Archive
MD5
a5aa443cc44cdf750af82d1631b12e05
Preview
  File Preview
Name
HE.tgz
Size
5.55 MB
Format
application/x-gzip
Description
gzip Archive
MD5
3c9bdcd6ae9372e00f74f90afda741d8
Preview
  File Preview
Name
PL.tgz
Size
4.36 MB
Format
application/x-gzip
Description
gzip Archive
MD5
6cfb66bd4176b794ebe5a64c34a342a3
Preview
  File Preview
Name
TR.tgz
Size
4.39 MB
Format
application/x-gzip
Description
gzip Archive
MD5
88ed39565e0b5cd38f4bc2ff0cced2bd
Preview
  File Preview
Name
EL.tgz
Size
3.4 MB
Format
application/x-gzip
Description
gzip Archive
MD5
36f6602138c4bfb397baea31f8ecf7a7
Preview
  File Preview
Name
HR.tgz
Size
1.32 MB
Format
application/x-gzip
Description
gzip Archive
MD5
f401a6c094e6e1bf48058bef651cd220
Preview
  File Preview
Name
SL.tgz
Size
3.24 MB
Format
application/x-gzip
Description
gzip Archive
MD5
71b95675e197aaa6aa75dcfd07483583
Preview
  File Preview
Name
bin.tgz
Size
19.6 KB
Format
application/x-gzip
Description
gzip Archive
MD5
45e16f3ea085b9d64b2839046327a1b6
Preview
  File Preview
Name
PT.tgz
Size
7.1 MB
Format
application/x-gzip
Description
gzip Archive
MD5
1b9a5b3a3455e8c3fa9ecb4d7a73a944
Preview
  File Preview
Name
RO.tgz
Size
12.14 MB
Format
application/x-gzip
Description
gzip Archive
MD5
7afa2507293f5251b07057d4cec7fab8
Preview
  File Preview
Name
DE.tgz
Size
2.38 MB
Format
application/x-gzip
Description
gzip Archive
MD5
b28e381834aa389e074f184733fcfb57
Preview
  File Preview
Name
EU.tgz
Size
2.05 MB
Format
application/x-gzip
Description
gzip Archive
MD5
7b6178745a43a1a83f778420972ffecd
Preview
  File Preview
Name
LT.tgz
Size
3.44 MB
Format
application/x-gzip
Description
gzip Archive
MD5
39a7a0d6f40d7dfb3eef51b86ed066b4
Preview
  File Preview
Name
HI.tgz
Size
626.27 KB
Format
application/x-gzip
Description
gzip Archive
MD5
9a02e6003be29d13a546b6af053d2baa
Preview
  File Preview
Name
FA.tgz
Size
696.13 KB
Format
application/x-gzip
Description
gzip Archive
MD5
4c8266063c4535c71edd3dd6b7b2383c
Preview
  File Preview
Name
IT.tgz
Size
4.47 MB
Format
application/x-gzip
Description
gzip Archive
MD5
5927b51aa913c4d0c863dcbbc256e2ce
Preview
  File Preview
Name
FR.tgz
Size
6.04 MB
Format
application/x-gzip
Description
gzip Archive
MD5
dcd671a8f7e9737fcb45933b2913daea
Preview
  File Preview
Name
README.md
Size
5.95 KB
Format
application/octet-stream
Description
Unknown
MD5
829b0937130ce28abc179a9619e88e9c
Preview
  File Preview
Name
trial.tgz
Size
4.74 KB
Format
application/x-gzip
Description
gzip Archive
MD5
8d3fbcfe9a832e66c1a4e61dce8d1f14
Preview
  File Preview