This is not the latest version of this item. The latest version can be found here.
Annotated corpora and tools of the PARSEME Shared Task on Automatic Identification of Verbal Multiword Expressions (edition 1.0)
Please use the following text to cite this item or export to a predefined format:
Savary, Agata; et al., 2017,
Annotated corpora and tools of the PARSEME Shared Task on Automatic Identification of Verbal Multiword Expressions (edition 1.0), LINDAT/CLARIAH-CZ digital library at the Institute of Formal and Applied Linguistics (ÚFAL),
http://hdl.handle.net/11372/LRT-2282.
Authors
Savary, Agata ; et al.
Item identifier
Project URL
Date issued
2017-01-20
Size
274376 sentences,
5439204 tokens,
62218 multiWordUnits
Description
The PARSEME shared task aims at identifying verbal MWEs in running texts. Verbal MWEs include idioms (let the cat out of the bag), light verb constructions (make a decision), verb-particle constructions (give up), and inherently reflexive verbs (se suicider 'to suicide' in French). VMWEs were annotated according to the universal guidelines in 18 languages. The corpora are provided in the parsemetsv format, inspired by the CONLL-U format.
For most languages, paired files in the CONLL-U format - not necessarily using UD tagsets - containing parts of speech, lemmas, morphological features and/or syntactic dependencies are also provided. Depending on the language, the information comes from treebanks (e.g., Universal Dependencies) or from automatic parsers trained on treebanks (e.g., UDPipe).
This item contains training and test data, tools and the universal guidelines file.
Publisher
Acknowledgement
COST
Project code:IC1207
Project name:PARSEME: PARSing and Multi-word Expressions
Collections
Files in this item
- Name
- IT.tgz
- Size
- 4.82 MB
- Format
- application/x-gzip
- Description
- gzip Archive
- MD5
- 64ab3ab19e87767e9fa9764130e41046

The file preview has not been generated yet. Please try again later or contact the system administrator lindat-help@ufal.mff.cuni.cz
- Name
- CS.tgz
- Size
- 10.56 MB
- Format
- application/x-gzip
- Description
- gzip Archive
- MD5
- b97b0f5bed1ed94f096be4150ee68049

The file preview has not been generated yet. Please try again later or contact the system administrator lindat-help@ufal.mff.cuni.cz
- Name
- TR.tgz
- Size
- 4.65 MB
- Format
- application/x-gzip
- Description
- gzip Archive
- MD5
- 4d34bb3f81dec21184b9877da2dcf12b

The file preview has not been generated yet. Please try again later or contact the system administrator lindat-help@ufal.mff.cuni.cz
- Name
- ES.tgz
- Size
- 2.16 MB
- Format
- application/x-gzip
- Description
- gzip Archive
- MD5
- e1ae9704c3608f78cf57e09bb9b165dd

The file preview has not been generated yet. Please try again later or contact the system administrator lindat-help@ufal.mff.cuni.cz
- Name
- SV.tgz
- Size
- 499.71 KB
- Format
- application/x-gzip
- Description
- gzip Archive
- MD5
- 52db26a0ba0dbc2e283c4551795b5271

The file preview has not been generated yet. Please try again later or contact the system administrator lindat-help@ufal.mff.cuni.cz
- Name
- FR.tgz
- Size
- 7.89 MB
- Format
- application/x-gzip
- Description
- gzip Archive
- MD5
- 69f9a65d2a6c127573f8b646cc10eeb3

The file preview has not been generated yet. Please try again later or contact the system administrator lindat-help@ufal.mff.cuni.cz
- Name
- EL.tgz
- Size
- 3.59 MB
- Format
- application/x-gzip
- Description
- gzip Archive
- MD5
- 0956143f60ed16a0c01c4ec87e6c07f3

The file preview has not been generated yet. Please try again later or contact the system administrator lindat-help@ufal.mff.cuni.cz
- Name
- FA.tgz
- Size
- 593.52 KB
- Format
- application/x-gzip
- Description
- gzip Archive
- MD5
- cc0686b1f93e0b0782855e514c54823a

The file preview has not been generated yet. Please try again later or contact the system administrator lindat-help@ufal.mff.cuni.cz
- Name
- HE.tgz
- Size
- 804.67 KB
- Format
- application/x-gzip
- Description
- gzip Archive
- MD5
- 6e20cc548086eeb18469841b0c5b1393

The file preview has not been generated yet. Please try again later or contact the system administrator lindat-help@ufal.mff.cuni.cz
- Name
- DE.tgz
- Size
- 2.09 MB
- Format
- application/x-gzip
- Description
- gzip Archive
- MD5
- 94ea5c3f074b783a090946e9e7e208ce

The file preview has not been generated yet. Please try again later or contact the system administrator lindat-help@ufal.mff.cuni.cz
- Name
- Annotation_guidelines_PARSEME_Shared_Task_1.0.pdf
- Size
- 608.46 KB
- Format
- application/pdf
- Description
- Adobe PDF
- MD5
- 7efe5547bd0d85cd3f341f0125a35a6c

The file preview has not been generated yet. Please try again later or contact the system administrator lindat-help@ufal.mff.cuni.cz
- Name
- Description_paper_PARSEME_Shared_Task_1.0.pdf
- Size
- 278.76 KB
- Format
- application/pdf
- Description
- Adobe PDF
- MD5
- 6947539d298d53bbcd9024437bd29939

The file preview has not been generated yet. Please try again later or contact the system administrator lindat-help@ufal.mff.cuni.cz
- Name
- RO.tgz
- Size
- 8 MB
- Format
- application/x-gzip
- Description
- gzip Archive
- MD5
- 83f76629b83fc380facb4e11e98f119e

The file preview has not been generated yet. Please try again later or contact the system administrator lindat-help@ufal.mff.cuni.cz
- Name
- PT.tgz
- Size
- 5.1 MB
- Format
- application/x-gzip
- Description
- gzip Archive
- MD5
- eec1f919ce50aad1099d69381ab1f76e

The file preview has not been generated yet. Please try again later or contact the system administrator lindat-help@ufal.mff.cuni.cz
- Name
- PL.tgz
- Size
- 3.45 MB
- Format
- application/x-gzip
- Description
- gzip Archive
- MD5
- 4f54d970d85b325b4c3b3f621e6c192d

The file preview has not been generated yet. Please try again later or contact the system administrator lindat-help@ufal.mff.cuni.cz
- Name
- HU.tgz
- Size
- 1.3 MB
- Format
- application/x-gzip
- Description
- gzip Archive
- MD5
- 06ce4fa53dcaeda0b1bdce90a24637ea

The file preview has not been generated yet. Please try again later or contact the system administrator lindat-help@ufal.mff.cuni.cz
- Name
- SL.tgz
- Size
- 2.89 MB
- Format
- application/x-gzip
- Description
- gzip Archive
- MD5
- abf463cd1bf7855d35efb581706ebf66

The file preview has not been generated yet. Please try again later or contact the system administrator lindat-help@ufal.mff.cuni.cz
- Name
- MT.tgz
- Size
- 2.89 MB
- Format
- application/x-gzip
- Description
- gzip Archive
- MD5
- 58f1b8bc4dc99429f504df50c59d21e5

The file preview has not been generated yet. Please try again later or contact the system administrator lindat-help@ufal.mff.cuni.cz
- Name
- LT.tgz
- Size
- 1.18 MB
- Format
- application/x-gzip
- Description
- gzip Archive
- MD5
- 4b2e19fdb954c52a1adf1b1dc05de4a0

The file preview has not been generated yet. Please try again later or contact the system administrator lindat-help@ufal.mff.cuni.cz
- Name
- BG.tgz
- Size
- 959.28 KB
- Format
- application/x-gzip
- Description
- gzip Archive
- MD5
- b29b32b039d4b7cfafc02569a9e90dcd

The file preview has not been generated yet. Please try again later or contact the system administrator lindat-help@ufal.mff.cuni.cz
- Name
- README.md
- Size
- 2.67 KB
- Format
- application/octet-stream
- Description
- Unknown
- MD5
- 3b65e76fcb453f3dbe570240b4a0ca3a

The file preview has not been generated yet. Please try again later or contact the system administrator lindat-help@ufal.mff.cuni.cz

