|

Subcorpora

Eusko Legebiltzarreko corpusa ParlaMint-ES-PV

Source(s)

Eusko Legebiltzarreko aktak2015-02-01 - 2022-07-01

Location(s)

Becerro de Bengoa kaleaVitoria-GasteizBasque Country, Spain2015-02-01 - 2022-07-01

Names database

Transcriptions

Corpus search (16M tokens)


Linguistically annotated corpus of parliamentary debates ParlaMint-ES-PV

Documentation

Characteristics of the national parliament

The Basque Parliament (Basque: Eusko Legebiltzarra, Spanish: Parlamento Vasco) is the legislative body of the Basque Autonomous Community of Spain and the elected assembly to which the Basque Government is responsible.

The Parliament meets in the Basque capital, Vitoria-Gasteiz, although the first session of the modern assembly, as constituted by the Statute of Autonomy of the Basque Country, was held in Guernica – the symbolic centre of Basque freedoms – on 31 March 1980.

It is composed of seventy-five deputies representing citizens from the three provinces of the Basque autonomous community. Each province (Álava, Gipuzkoa and Biscay) elects the same number of deputies.

Data source and acquisition

The Basque Parliament Office in a session held on March 9, 2021, has adopted to share the transcripts of the Basque Parliament (and their translations, when possible), to that there is a corpus of Basque in the project ParlaMint (2021/1887).

Minutes of the Basque Parliament, Term X, XI and XII (2015 - 2022).

Data encoding process

Corpus-specific metadata

Structure

Linguistic annotation

PoS analysis was done using udpipe2.pl from service: http://lindat.mff.cuni.cz/services/udpipe.

For NER a multilingual xlmroberta was fine-tuned for Basque and Spanish using these tags: PER, LOC, ORG and MISC.


Responsibles

Mikel IruskietaData retrieval and curation
Kike FernandezTechnical support and conversion to XML
Manex AgirrezabalData revision and support
Ekain ArrietaTechnical support and conversion from conll to XML
Rodrigo AgerriTrain the NER model for Basque and Spanish
Tomaž ErjavecTechnical support
Matyáš KoppTechnical support and support for bilingual analysis
Jon AlkortaParla-CLARIN TEI XML corpus encoding
Linguistic annotation