This is not the latest version of this item. The latest version can be found here.
Please use the following text to cite this item or export to a predefined format:
Savary, Agata; et al., 2017,
Annotated corpora and tools of the PARSEME Shared Task on Automatic Identification of Verbal Multiword Expressions (edition 1.0), LINDAT/CLARIAH-CZ digital library at the Institute of Formal and Applied Linguistics (ÚFAL),
http://hdl.handle.net/11372/LRT-2282.
| dc.contributor.author | Savary, Agata |
| dc.contributor.author | Ramisch, Carlos |
| dc.contributor.author | Cordeiro, Silvio Ricardo |
| dc.contributor.author | Sangati, Federico |
| dc.contributor.author | Vincze, Veronika |
| dc.contributor.author | QasemiZadeh, Behrang |
| dc.contributor.author | Candito, Marie |
| dc.contributor.author | Cap, Fabienne |
| dc.contributor.author | Giouli, Voula |
| dc.contributor.author | Stoyanova, Ivelina |
| dc.contributor.author | Doucet, Antoine |
| dc.contributor.author | Adalı, Kübra |
| dc.contributor.author | Barbu Mititelu, Verginica |
| dc.contributor.author | Bejček, Eduard |
| dc.contributor.author | El Maarouf, Ismail |
| dc.contributor.author | Eryiğit, Gülşen |
| dc.contributor.author | Galea, Luke |
| dc.contributor.author | Ha-Cohen Kerner, Yaakov |
| dc.contributor.author | Liebeskind, Chaya |
| dc.contributor.author | Monti, Johanna |
| dc.contributor.author | Parra Escartín, Carla |
| dc.contributor.author | Kovalevskaitė, Jolanta |
| dc.contributor.author | Krek, Simon |
| dc.contributor.author | van der Plas, Lonneke |
| dc.contributor.author | Aceta, Cristina |
| dc.contributor.author | Aduriz, Itziar |
| dc.contributor.author | Antoine, Jean-Yves |
| dc.contributor.author | Attard, Greta |
| dc.contributor.author | Azzopardi, Kirsty |
| dc.contributor.author | Boizou, Loic |
| dc.contributor.author | Bonnici, Janice |
| dc.contributor.author | Boz, Mert |
| dc.contributor.author | Bumbulienė, Ieva |
| dc.contributor.author | Busuttil, Jael |
| dc.contributor.author | Caruso, Valeria |
| dc.contributor.author | Cherchi, Manuela |
| dc.contributor.author | Constant, Matthieu |
| dc.contributor.author | Czerepowicka, Monika |
| dc.contributor.author | De Santis, Anna |
| dc.contributor.author | Dimitrova, Tsvetana |
| dc.contributor.author | Dinç, Tutkum |
| dc.contributor.author | Elyovich, Hevi |
| dc.contributor.author | Fabri, Ray |
| dc.contributor.author | Farrugia, Alison |
| dc.contributor.author | Findlay, Jamie |
| dc.contributor.author | Fotopoulou, Aggeliki |
| dc.contributor.author | Foufi, Vassiliki |
| dc.contributor.author | Galea, Sara Anne |
| dc.contributor.author | Gantar, Polona |
| dc.contributor.author | Gatt, Albert |
| dc.contributor.author | Gatt, Anabelle |
| dc.contributor.author | Herrero, Carlos |
| dc.contributor.author | Iñurrieta, Uxoa |
| dc.contributor.author | Jagfeld, Glorianna |
| dc.contributor.author | Hnátková, Milena |
| dc.contributor.author | Ionescu, Mihaela |
| dc.contributor.author | Klyueva, Natalia |
| dc.contributor.author | Koeva, Svetla |
| dc.contributor.author | Kovács, Viktória |
| dc.contributor.author | Kuzman, Taja |
| dc.contributor.author | Leseva, Svetlozara |
| dc.contributor.author | Louisou, Sevi |
| dc.contributor.author | Lynn, Teresa |
| dc.contributor.author | Malka, Ruth |
| dc.contributor.author | Martínez Alonso, Héctor |
| dc.contributor.author | McCrae, John |
| dc.contributor.author | de Medeiros Caseli, Helena |
| dc.contributor.author | Miral, Ayşenur |
| dc.contributor.author | Muscat, Amanda |
| dc.contributor.author | Nivre, Joakim |
| dc.contributor.author | Oakes, Michael |
| dc.contributor.author | Onofrei, Mihaela |
| dc.contributor.author | Parmentier, Yannick |
| dc.contributor.author | Pasquer, Caroline |
| dc.contributor.author | Pia di Buono, Maria |
| dc.contributor.author | Priego Sanchez, Belem |
| dc.contributor.author | Raffone, Annalisa |
| dc.contributor.author | Ramisch, Renata |
| dc.contributor.author | Rimkutė, Erika |
| dc.contributor.author | Rizea, Monica-Mihaela |
| dc.contributor.author | Simkó, Katalin |
| dc.contributor.author | Spagnol, Michael |
| dc.contributor.author | Stefanova, Valentina |
| dc.contributor.author | Stymne, Sara |
| dc.contributor.author | Sulubacak, Umut |
| dc.contributor.author | Tabone, Nicole |
| dc.contributor.author | Tanti, Marc |
| dc.contributor.author | Todorova, Maria |
| dc.contributor.author | Urešová, Zdenka |
| dc.contributor.author | Villavicencio, Aline |
| dc.contributor.author | Zilio, Leonardo |
| dc.date.accessioned | 2017-06-20T08:39:36Z |
| dc.date.available | 2017-06-20T08:39:36Z |
| dc.date.issued | 2017-01-20 |
| dc.description | The PARSEME shared task aims at identifying verbal MWEs in running texts. Verbal MWEs include idioms (let the cat out of the bag), light verb constructions (make a decision), verb-particle constructions (give up), and inherently reflexive verbs (se suicider 'to suicide' in French). VMWEs were annotated according to the universal guidelines in 18 languages. The corpora are provided in the parsemetsv format, inspired by the CONLL-U format. For most languages, paired files in the CONLL-U format - not necessarily using UD tagsets - containing parts of speech, lemmas, morphological features and/or syntactic dependencies are also provided. Depending on the language, the information comes from treebanks (e.g., Universal Dependencies) or from automatic parsers trained on treebanks (e.g., UDPipe). This item contains training and test data, tools and the universal guidelines file. |
| dc.identifier.uri | http://hdl.handle.net/11372/LRT-2282 |
| dc.language.iso | bul |
| dc.language.iso | ces |
| dc.language.iso | deu |
| dc.language.iso | ell |
| dc.language.iso | spa |
| dc.language.iso | fas |
| dc.language.iso | fra |
| dc.language.iso | heb |
| dc.language.iso | hun |
| dc.language.iso | ita |
| dc.language.iso | lit |
| dc.language.iso | mlt |
| dc.language.iso | pol |
| dc.language.iso | por |
| dc.language.iso | ron |
| dc.language.iso | slv |
| dc.language.iso | swe |
| dc.language.iso | tur |
| dc.publisher | PARSEME |
| dc.relation.isreferencedby | http://multiword.sourceforge.net/mwe2017/proceedings/MWE201704.pdf |
| dc.relation.isreplacedby | http://hdl.handle.net/11372/LRT-2842 |
| dc.rights | PARSEME Shared Task Data (v. 1.0) Agreement |
| dc.rights.label | PUB |
| dc.rights.uri | https://lindat.mff.cuni.cz/repository/static/licence-mwe-1.0.html |
| dc.source.uri | http://multiword.sf.net/sharedtask2017 |
| dc.subject | Multiword expressions |
| dc.subject | verbal multiword expressions |
| dc.subject | idioms |
| dc.subject | light-verb constructions |
| dc.subject | verb-particle constructions |
| dc.subject | inherently reflexive verbs |
| dc.title | Annotated corpora and tools of the PARSEME Shared Task on Automatic Identification of Verbal Multiword Expressions (edition 1.0) |
| dc.type | corpus |
| local.branding | LRT + Open Submissions |
| local.contact.person | Agata Savary agata.savary@univ-tours.fr Université François Rabelais de Tours |
| local.contact.person | Carlos Ramisch carlos.ramisch@lif.univ-mrs.fr Aix Marseille Université |
| local.contact.person | Natalia Klyueva kljueva@ufal.mff.cuni.cz Charles University in Prague, UFAL |
| local.featuredService.kontext | Czech|http://lindat.mff.cuni.cz/services/kontext/first_form?corpname=parseme_cs_a |
| local.featuredService.kontext | German|http://lindat.mff.cuni.cz/services/kontext/first_form?corpname=parseme_de_a |
| local.featuredService.kontext | Greek|http://lindat.mff.cuni.cz/services/kontext/first_form?corpname=parseme_el_a |
| local.featuredService.kontext | Spanish|http://lindat.mff.cuni.cz/services/kontext/first_form?corpname=parseme_es_a |
| local.featuredService.kontext | Persian (Farsi)|http://lindat.mff.cuni.cz/services/kontext/first_form?corpname=parseme_fa_a |
| local.featuredService.kontext | French|http://lindat.mff.cuni.cz/services/kontext/first_form?corpname=parseme_fr_a |
| local.featuredService.kontext | Hungarian|http://lindat.mff.cuni.cz/services/kontext/first_form?corpname=parseme_hu_a |
| local.featuredService.kontext | Italian|http://lindat.mff.cuni.cz/services/kontext/first_form?corpname=parseme_it_a |
| local.featuredService.kontext | Maltese|http://lindat.mff.cuni.cz/services/kontext/first_form?corpname=parseme_mt_a |
| local.featuredService.kontext | Polish|http://lindat.mff.cuni.cz/services/kontext/first_form?corpname=parseme_pl_a |
| local.featuredService.kontext | Portuguese|http://lindat.mff.cuni.cz/services/kontext/first_form?corpname=parseme_pt_a |
| local.featuredService.kontext | Romanian|http://lindat.mff.cuni.cz/services/kontext/first_form?corpname=parseme_ro_a |
| local.featuredService.kontext | Slovenian|http://lindat.mff.cuni.cz/services/kontext/first_form?corpname=parseme_sl_a |
| local.featuredService.kontext | Swedish|http://lindat.mff.cuni.cz/services/kontext/first_form?corpname=parseme_sv_a |
| local.featuredService.kontext | Turkish|http://lindat.mff.cuni.cz/services/kontext/first_form?corpname=parseme_tr_a |
| local.files.count | 21 |
| local.files.size | 67369328 |
| local.has.files | yes |
| local.language.name | Bulgarian |
| local.language.name | Czech |
| local.language.name | German |
| local.language.name | Modern Greek (1453-) |
| local.language.name | Spanish |
| local.language.name | Persian |
| local.language.name | French |
| local.language.name | Hebrew |
| local.language.name | Hungarian |
| local.language.name | Italian |
| local.language.name | Lithuanian |
| local.language.name | Maltese |
| local.language.name | Polish |
| local.language.name | Portuguese |
| local.language.name | Romanian |
| local.language.name | Slovenian |
| local.language.name | Swedish |
| local.language.name | Turkish |
| local.size.info | 274376 sentences |
| local.size.info | 5439204 tokens |
| local.size.info | 62218 multiWordUnits |
| local.sponsor | euFunds IC1207 COST PARSEME: PARSing and Multi-word Expressions |
| metashare.ResourceInfo#ContentInfo.mediaType | text |
Collections
Files in this item
- Name
- IT.tgz
- Size
- 4.82 MB
- Format
- application/x-gzip
- Description
- gzip Archive
- MD5
- 64ab3ab19e87767e9fa9764130e41046

The file preview has not been generated yet. Please try again later or contact the system administrator lindat-help@ufal.mff.cuni.cz
- Name
- CS.tgz
- Size
- 10.56 MB
- Format
- application/x-gzip
- Description
- gzip Archive
- MD5
- b97b0f5bed1ed94f096be4150ee68049

The file preview has not been generated yet. Please try again later or contact the system administrator lindat-help@ufal.mff.cuni.cz
- Name
- TR.tgz
- Size
- 4.65 MB
- Format
- application/x-gzip
- Description
- gzip Archive
- MD5
- 4d34bb3f81dec21184b9877da2dcf12b

The file preview has not been generated yet. Please try again later or contact the system administrator lindat-help@ufal.mff.cuni.cz
- Name
- ES.tgz
- Size
- 2.16 MB
- Format
- application/x-gzip
- Description
- gzip Archive
- MD5
- e1ae9704c3608f78cf57e09bb9b165dd

The file preview has not been generated yet. Please try again later or contact the system administrator lindat-help@ufal.mff.cuni.cz
- Name
- SV.tgz
- Size
- 499.71 KB
- Format
- application/x-gzip
- Description
- gzip Archive
- MD5
- 52db26a0ba0dbc2e283c4551795b5271

The file preview has not been generated yet. Please try again later or contact the system administrator lindat-help@ufal.mff.cuni.cz
- Name
- FR.tgz
- Size
- 7.89 MB
- Format
- application/x-gzip
- Description
- gzip Archive
- MD5
- 69f9a65d2a6c127573f8b646cc10eeb3

The file preview has not been generated yet. Please try again later or contact the system administrator lindat-help@ufal.mff.cuni.cz
- Name
- EL.tgz
- Size
- 3.59 MB
- Format
- application/x-gzip
- Description
- gzip Archive
- MD5
- 0956143f60ed16a0c01c4ec87e6c07f3

The file preview has not been generated yet. Please try again later or contact the system administrator lindat-help@ufal.mff.cuni.cz
- Name
- FA.tgz
- Size
- 593.52 KB
- Format
- application/x-gzip
- Description
- gzip Archive
- MD5
- cc0686b1f93e0b0782855e514c54823a

The file preview has not been generated yet. Please try again later or contact the system administrator lindat-help@ufal.mff.cuni.cz
- Name
- HE.tgz
- Size
- 804.67 KB
- Format
- application/x-gzip
- Description
- gzip Archive
- MD5
- 6e20cc548086eeb18469841b0c5b1393

The file preview has not been generated yet. Please try again later or contact the system administrator lindat-help@ufal.mff.cuni.cz
- Name
- DE.tgz
- Size
- 2.09 MB
- Format
- application/x-gzip
- Description
- gzip Archive
- MD5
- 94ea5c3f074b783a090946e9e7e208ce

The file preview has not been generated yet. Please try again later or contact the system administrator lindat-help@ufal.mff.cuni.cz
- Name
- Annotation_guidelines_PARSEME_Shared_Task_1.0.pdf
- Size
- 608.46 KB
- Format
- application/pdf
- Description
- Adobe PDF
- MD5
- 7efe5547bd0d85cd3f341f0125a35a6c

The file preview has not been generated yet. Please try again later or contact the system administrator lindat-help@ufal.mff.cuni.cz
- Name
- Description_paper_PARSEME_Shared_Task_1.0.pdf
- Size
- 278.76 KB
- Format
- application/pdf
- Description
- Adobe PDF
- MD5
- 6947539d298d53bbcd9024437bd29939

The file preview has not been generated yet. Please try again later or contact the system administrator lindat-help@ufal.mff.cuni.cz
- Name
- RO.tgz
- Size
- 8 MB
- Format
- application/x-gzip
- Description
- gzip Archive
- MD5
- 83f76629b83fc380facb4e11e98f119e

The file preview has not been generated yet. Please try again later or contact the system administrator lindat-help@ufal.mff.cuni.cz
- Name
- PT.tgz
- Size
- 5.1 MB
- Format
- application/x-gzip
- Description
- gzip Archive
- MD5
- eec1f919ce50aad1099d69381ab1f76e

The file preview has not been generated yet. Please try again later or contact the system administrator lindat-help@ufal.mff.cuni.cz
- Name
- PL.tgz
- Size
- 3.45 MB
- Format
- application/x-gzip
- Description
- gzip Archive
- MD5
- 4f54d970d85b325b4c3b3f621e6c192d

The file preview has not been generated yet. Please try again later or contact the system administrator lindat-help@ufal.mff.cuni.cz
- Name
- HU.tgz
- Size
- 1.3 MB
- Format
- application/x-gzip
- Description
- gzip Archive
- MD5
- 06ce4fa53dcaeda0b1bdce90a24637ea

The file preview has not been generated yet. Please try again later or contact the system administrator lindat-help@ufal.mff.cuni.cz
- Name
- SL.tgz
- Size
- 2.89 MB
- Format
- application/x-gzip
- Description
- gzip Archive
- MD5
- abf463cd1bf7855d35efb581706ebf66

The file preview has not been generated yet. Please try again later or contact the system administrator lindat-help@ufal.mff.cuni.cz
- Name
- MT.tgz
- Size
- 2.89 MB
- Format
- application/x-gzip
- Description
- gzip Archive
- MD5
- 58f1b8bc4dc99429f504df50c59d21e5

The file preview has not been generated yet. Please try again later or contact the system administrator lindat-help@ufal.mff.cuni.cz
- Name
- LT.tgz
- Size
- 1.18 MB
- Format
- application/x-gzip
- Description
- gzip Archive
- MD5
- 4b2e19fdb954c52a1adf1b1dc05de4a0

The file preview has not been generated yet. Please try again later or contact the system administrator lindat-help@ufal.mff.cuni.cz
- Name
- BG.tgz
- Size
- 959.28 KB
- Format
- application/x-gzip
- Description
- gzip Archive
- MD5
- b29b32b039d4b7cfafc02569a9e90dcd

The file preview has not been generated yet. Please try again later or contact the system administrator lindat-help@ufal.mff.cuni.cz
- Name
- README.md
- Size
- 2.67 KB
- Format
- application/octet-stream
- Description
- Unknown
- MD5
- 3b65e76fcb453f3dbe570240b4a0ca3a

The file preview has not been generated yet. Please try again later or contact the system administrator lindat-help@ufal.mff.cuni.cz

