This is a new version of the repository. Do let us know (lindat-help at ufal.mff.cuni.cz) if you encounter any issues.

Russian Media Corpus on the Harris–Trump Debate (RMC_HTD)

Please use the following text to cite this item or export to a predefined format:
Shorokhova,Elena, 2025, Russian Media Corpus on the Harris–Trump Debate (RMC_HTD), LINDAT/CLARIAH-CZ digital library at the Institute of Formal and Applied Linguistics (ÚFAL), http://hdl.handle.net/20.500.12800/1-6038.
Date issued
2025-11-13
Size
19 entries
Language(s)
Description
Russian Media Corpus on the Harris–Trump Debate contains metadata from Russian-language news articles reporting on the presidential debate between Kamala Harris and Donald Trump, which took place on 10 September 2024 and was broadcast by ABC News. The corpus includes articles published by four Russian-language media outlets: Kommersant, Argumenty i Fakty, Meduza, and BBC News Russian. All articles were published on 11 September 2024. The corpus consists of 19 articles written in Russian. The primary purpose of the corpus is to support research in Critical Discourse Analysis and studies on media representation of the event in the Russian-speaking press.
Acknowledgement
 Files in this item
Name
RMC_HTD.csv
Size
3.74 KB
Format
text/csv
Description
MD5
baec92afc706eadf634a9c9771f25dc6
Preview
  File Preview
Name
README.txt
Size
2.31 KB
Format
text/plain
Description
MD5
51f949ec4a7dfa968063f671669cdf39
Preview
  File Preview
    README – Russian Media Corpus on the Harris–Trump Debate (RMC_HTD)
    Version 1.0
    1. General Description
    This corpus contains metadata from Russian-language news articles reporting on the presidential debate between Kamala Harris and Donald Trump, which took place on 10 September 2024 and was broadcast by ABC News.
    The corpus includes articles published by four Russian-language media outlets: Kommersant, Argumenty i Fakty, Meduza, and BBC News Russian.
    All articles were published on 11 September 2024.
    The corpus consists of 19 articles written in Russian.
    The primary purpose of the corpus is to support research in Critical Discourse Analysis and studies on media representation of the event in the Russian-speaking press.
    2. Corpus Content
    The corpus contains only metadata of the articles, not the full texts (to comply with copyright regulations).
    Each news item includes the following fields:
    •	ID
    •	Media outlet
    •	Section
    •	Publication date
    •	Author (if any)
    •	URL
    •	Word count
    •	Language
    File format: CSV UTF-8
    
    3. Data Sources and Collection Method
    Articles were collected manually using the following criteria:
    •	keywords related to the debate (e.g., “Харрис”, “Трамп”, “дебаты”, etc.)
    •	publication date (11.09.2024)
    The articles were retrieved from the official digital editions of the four media outlets listed above.
    Only articles specifically addressing the Harris–Trump debate of 10 September 2024 were included.
    4. License and Permitted Use
    This corpus contains metadata only, and is distributed under the CC-BY 4.0 license.
    5. Corpus Structure
    /corpus/
        metadata.csv
        readme.md
    6. Intended Use
    This corpus is intended for critical discourse análisis, translation studies, pragmatic studies, framing análisis, etc.
    The corpus must not be used for commercial purposes unless the license explicitly allows it.
    7. Authors and Contact
    Author: Elena Shorokhova
    Affiliation: Universidad Rey Juan Carlos
    Contact: elena.shorokhova@urjc.es
    8. Recommended Citation
    Shorokhova,Ele . . .