Show simple item record

 
dc.contributor.author Hladká, Barbora
dc.contributor.author Mírovský, Jiří
dc.date.accessioned 2022-10-17T13:12:22Z
dc.date.available 2022-10-17T13:12:22Z
dc.date.issued 2022-10-14
dc.identifier.uri http://hdl.handle.net/11234/1-4912
dc.description Preamble 1.0 is a multilingual annotated corpus of the preamble of the EU REGULATION 2020/2092 OF THE EUROPEAN PARLIAMENT AND OF THE COUNCIL. The corpus consists of four language versions of the preamble (Czech, English, French, Polish), each of them annotated with sentence subjects. The data were annotated in the Brat tool (https://brat.nlplab.org/) and are distributed in the Brat native format, i.e. each annotated preamble is represented by the original plain text and a stand-off annotation file.
dc.language.iso ces
dc.language.iso eng
dc.language.iso fra
dc.language.iso pol
dc.publisher Charles University, Faculty of Mathematics and Physics, Institute of Formal and Applied Linguistics (UFAL)
dc.rights Creative Commons - Attribution-NonCommercial-ShareAlike 4.0 International (CC BY-NC-SA 4.0)
dc.rights.uri http://creativecommons.org/licenses/by-nc-sa/4.0/
dc.source.uri https://ufal.mff.cuni.cz/courses/npfl134/subjann
dc.subject corpus
dc.subject multilingual
dc.subject subjects
dc.title Preamble 1.0
dc.type corpus
metashare.ResourceInfo#ContentInfo.mediaType text
dc.rights.label PUB
has.files yes
branding LINDAT / CLARIAH-CZ
contact.person Jiří Mírovský mirovsky@ufal.mff.cuni.cz Charles University, Faculty of Mathematics and Physics, Institute of Formal and Applied Linguistics (UFAL)
sponsor 4EU+ European University Alliance 2021_F3_10 @SWitCH: Crash Course on Data Analytics for Students of Social Studies and Humanities euFunds
size.info 10173 words
size.info 522 items
files.size 37944
files.count 2


 Files in this item

 Download all files in item (37.05 KB)
Icon
Name
Preamble1.0.zip
Size
34.14 KB
Format
application/zip
Description
Preamble 1.0 distribution
MD5
6d9825ad77f165a80011e331be0a5e61
 Download file  Preview
 File Preview  
  • Preamble1.0
    • README.TXT2 kB
    • data
      • fr
        • preamble_fr.ann4 kB
        • preamble_fr.txt20 kB
      • en
        • preamble_en.ann4 kB
        • preamble_en.txt17 kB
      • pl
        • preamble_pl.ann3 kB
        • preamble_pl.txt18 kB
      • cs
        • preamble_cs.ann4 kB
        • preamble_cs.txt17 kB
Icon
Name
README.TXT
Size
2.92 KB
Format
Text file
Description
Preamble 1.0 description
MD5
bdf82b43c155d3bf117d509e8080d449
 Download file  Preview
 File Preview  
============
Preamble 1.0
============


Authors
=======

Barbora Hladká (hladka@ufal.mff.cuni.cz)
Jiří Mírovský (mirovsky@ufal.mff.cuni.cz)

Introduction
============

Preamble 1.0 is a multilingual annotated corpus of the preamble of the
EU REGULATION 2020/2092 OF THE EUROPEAN PARLIAMENT AND OF THE COUNCIL
of 16 December 2020 on a general regime of conditionality for the protection
of the Union budget. The corpus consists of four language versions of the
preamble (source texts downloaded from the following web pages):

Czech (https://eur-lex.europa.eu/legal-content/CS/TXT/PDF/?uri=CELEX:32020R2092)
English (https://eur-lex.europa.eu/legal-content/EN/TXT/PDF/?uri=CELEX:32020R2092)
French (https://eur-lex.europa.eu/legal-content/FR/TXT/PDF/?uri=CELEX:32020R2092)
Polish (https://eur-lex.europa.eu/legal-content/PL/TXT/PDF/?uri=CELEX:32020R2092)

The language selection is based on languages used in the course NPFL134 (Data
Analytics for Students of Social Studies and Humanities) at the I . . .
                                            

Show simple item record