PARSEME Shared Task Data (v. 1.1) Agreement

(2018/07/30)

License Terms

PARSEME Shared Task VMWE Data (edition 1.1) is a collection of linguistic data and tools. Each of the corpora has its own license terms and you (the “User”) are responsible for complying with the license terms applicable to those parts which you use. If you do not agree with the license terms, you must stop using the corpora and destroy all copies of the data that you have obtained.


The license for every corpus included in the release is specified in the appropriate language directory. The licenses for VMWE annotations (column 11) and morphological/syntactic data (columns 1-10) can be different, which is also indicated in the table below. All files in the bin/ and trial/ folders are licensed under CC BY 4.0.


Overview of the corpora and their license terms

Language Language code VMWEs annotations (column 11) Morphological/syntactic data (columns 1-10)
Bulgarian BG CC BY 4.0 CC BY 4.0
German DE CC BY 4.0 CC BY-NC-SA 3.0 US
Greek EL CC BY-NC-SA 4.0 CC BY-NC-SA 4.0
English EN CC BY 4.0 CC BY-SA 4.0 (UD Original corpus and UD LinES corpus); CC BY-SA 3.0 (UD PUD corpus)
Spanish ES CC BY 4.0 CC BY 4.0 (IXA corpus); GNU GPL 3.0 (Ancora corpus); CC BY-NC-SA 3.0 US (UD corpus)
Basque EU CC BY-NC-SA 4.0 CC BY-NC-SA 4.0
Farsi FA CC BY-NC-SA 4.0 (special license for the MULTEX-East corpus, see README.md file for Farsi) CC BY-NC-SA 4.0 (special license for the MULTEX-East corpus, see README.md file for Farsi)
French FR CC BY 4.0 CC BY-NC-SA 4.0 (UD corpus); LGPL-LR (Sequoia corpus)
Hebrew HE CC BY-NC-SA 4.0 CC BY-NC-SA 4.0
Hindi HI CC BY-NC-SA 4.0 CC BY-NC-SA 4.0
Croatian HR CC BY-NC-SA 4.0 CC BY-NC-SA 4.0
Hungarian HU CC BY 4.0 GNU GPL 3.0
Italian IT CC BY-NC-SA 4.0 CC BY-NC-SA 4.0
Lithuanian LT CC BY-NC-SA 4.0 CC BY-NC-SA 4.0
Polish PL CC BY 4.0 CC BY-SA 4.0 (columns 1-6 of sentences 120-2-*, 310-* and 330-*); GNU GPL 3.0 (columns 1-6 of remainder sentences); CC BY-SA 4.0 (columns 7-9 of all sentences)
Portuguese PT CC BY-NC-SA 4.0 CC BY-NC-SA 4.0
Romanian RO CC BY 4.0 CC BY 4.0
Slovene SL CC BY-NC-SA 4.0 CC BY-NC-SA 4.0
Turkish TR CC BY-NC-SA 4.0 CC BY-NC-SA 4.0

Licenses

License URL
GNU GPL 3.0 http://opensource.org/licenses/GPL-3.0
LGPL-LR http://infolingu.univ-mlv.fr/DonneesLinguistiques/Lexiques-Grammaires/lgpllr.html
CC BY-SA 3.0 http://creativecommons.org/licenses/by-sa/3.0/
CC BY-NC-SA 3.0 http://creativecommons.org/licenses/by-nc-sa/3.0/
CC BY-NC-SA 3.0 US http://creativecommons.org/licenses/by-nc-sa/3.0/us/
CC BY 4.0 http://creativecommons.org/licenses/by/4.0/
CC BY-SA 4.0 http://creativecommons.org/licenses/by-sa/4.0/
CC BY-NC-SA 4.0 http://creativecommons.org/licenses/by-nc-sa/4.0/