dc.contributor.author | Hladká, Barbora |
dc.contributor.author | Mírovský, Jiří |
dc.date.accessioned | 2022-10-17T13:12:22Z |
dc.date.available | 2022-10-17T13:12:22Z |
dc.date.issued | 2022-10-14 |
dc.identifier.uri | http://hdl.handle.net/11234/1-4912 |
dc.description | Preamble 1.0 is a multilingual annotated corpus of the preamble of the EU REGULATION 2020/2092 OF THE EUROPEAN PARLIAMENT AND OF THE COUNCIL. The corpus consists of four language versions of the preamble (Czech, English, French, Polish), each of them annotated with sentence subjects. The data were annotated in the Brat tool (https://brat.nlplab.org/) and are distributed in the Brat native format, i.e. each annotated preamble is represented by the original plain text and a stand-off annotation file. |
dc.language.iso | ces |
dc.language.iso | eng |
dc.language.iso | fra |
dc.language.iso | pol |
dc.publisher | Charles University, Faculty of Mathematics and Physics, Institute of Formal and Applied Linguistics (UFAL) |
dc.rights | Creative Commons - Attribution-NonCommercial-ShareAlike 4.0 International (CC BY-NC-SA 4.0) |
dc.rights.uri | http://creativecommons.org/licenses/by-nc-sa/4.0/ |
dc.source.uri | https://ufal.mff.cuni.cz/courses/npfl134/subjann |
dc.subject | corpus |
dc.subject | multilingual |
dc.subject | subjects |
dc.title | Preamble 1.0 |
dc.type | corpus |
metashare.ResourceInfo#ContentInfo.mediaType | text |
dc.rights.label | PUB |
has.files | yes |
branding | LINDAT / CLARIAH-CZ |
contact.person | Jiří Mírovský mirovsky@ufal.mff.cuni.cz Charles University, Faculty of Mathematics and Physics, Institute of Formal and Applied Linguistics (UFAL) |
sponsor | 4EU+ European University Alliance 2021_F3_10 @SWitCH: Crash Course on Data Analytics for Students of Social Studies and Humanities euFunds |
size.info | 10173 words |
size.info | 522 items |
files.size | 37944 |
files.count | 2 |
Files in this item
Download all files in item (37.05 KB)This item is
Creative Commons - Attribution-NonCommercial-ShareAlike 4.0 International (CC BY-NC-SA 4.0)
Publicly Available
and licensed under:Creative Commons - Attribution-NonCommercial-ShareAlike 4.0 International (CC BY-NC-SA 4.0)
- Name
- Preamble1.0.zip
- Size
- 34.14 KB
- Format
- application/zip
- Description
- Preamble 1.0 distribution
- MD5
- 6d9825ad77f165a80011e331be0a5e61
- Name
- README.TXT
- Size
- 2.92 KB
- Format
- Text file
- Description
- Preamble 1.0 description
- MD5
- bdf82b43c155d3bf117d509e8080d449
============ Preamble 1.0 ============ Authors ======= Barbora Hladká (hladka@ufal.mff.cuni.cz) Jiří Mírovský (mirovsky@ufal.mff.cuni.cz) Introduction ============ Preamble 1.0 is a multilingual annotated corpus of the preamble of the EU REGULATION 2020/2092 OF THE EUROPEAN PARLIAMENT AND OF THE COUNCIL of 16 December 2020 on a general regime of conditionality for the protection of the Union budget. The corpus consists of four language versions of the preamble (source texts downloaded from the following web pages): Czech (https://eur-lex.europa.eu/legal-content/CS/TXT/PDF/?uri=CELEX:32020R2092) English (https://eur-lex.europa.eu/legal-content/EN/TXT/PDF/?uri=CELEX:32020R2092) French (https://eur-lex.europa.eu/legal-content/FR/TXT/PDF/?uri=CELEX:32020R2092) Polish (https://eur-lex.europa.eu/legal-content/PL/TXT/PDF/?uri=CELEX:32020R2092) The language selection is based on languages used in the course NPFL134 (Data Analytics for Students of Social Studies and Humanities) at the I . . .