This is a new version of the repository. Do let us know (lindat-help at ufal.mff.cuni.cz) if you encounter any issues.

Lexicon of Czech and German Anaphoric Connectives

Please use the following text to cite this item or export to a predefined format:
Rysová, Kateřina; Poláková, Lucie; Rysová, Magdaléna and Mírovský, Jiří, 2019, Lexicon of Czech and German Anaphoric Connectives, LINDAT/CLARIAH-CZ digital library at the Institute of Formal and Applied Linguistics (ÚFAL), http://hdl.handle.net/11234/1-3075.
Date issued
2019-12-23
Size
98 entries
Language(s)
Description
GeCzLex 1.0 is an online electronic resource for translation equivalents of Czech and German discourse connectives. It contains anaphoric connectives for both languages and their possible translations documented in bilingual parallel corpora (not necessarily anaphoric). The entries have been interlinked via semantic annotation of the connectives (taken from monolingual lexicons of connectives CzeDLex and DiMLex) according to the PDTB 3 sense taxonomy and translation possibilities aquired from the Czech and German parallel data of the Intercorp project. The lexicon is the first bilingual inventory of connectives with linkage on the level of individual pairs (connective + discourse sense).
Acknowledgement
 Files in this item
Name
geczlex1.0.zip
Size
95.97 KB
Format
application/zip
Description
GeCzLex 1.0 distribution
MD5
1d652dca56af52a9d75665bc7fa80afc
Preview
  File Preview
  • geczlex1.0
    • index.html3 kB
    • HTML
      • connectives
        • c-dabei.html2 kB
        • c-davor.html913 B
        • c-presto.html2 kB
        • c-indessen.html2 kB
        • c-demnach.html1 kB
        • c-unterdessen.html1 kB
        • c-deswegen.html1 kB
        • c-oproti_tomu.html1 kB
        • c-zato.html2 kB
        • c-seitdem.html1 kB
        • c-potom.html5 kB
        • c-nachher.html1 kB
        • c-prestoze.html2 kB
        • c-damit.html1 kB
        • c-ovsem.html7 kB
        • c-naopak.html3 kB
        • c-pak.html3 kB
        • c-demgegenuber.html1 kB
        • c-mezitim.html3 kB
        • c-k_tomu.html2 kB
        • c-zaroven.html3 kB
        • c-rovnez.html2 kB
        • c-hierdurch.html1 kB
        • c-hingegen.html2 kB
        • c-daraufhin.html2 kB
        • c-darum.html1 kB
        • c-indem.html1 kB
        • c-worauf.html1 kB
        • c-darauf.html1 kB
        • c-pote.html4 kB
        • c-take.html2 kB
        • c-außerdem.html2 kB
        • c-navzdory_tomu.html4 kB
        • c-infolgedessen.html1 kB
        • c-totiz.html4 kB
        • c-ale.html7 kB
        • c-nebenher.html2 kB
        • c-allerdings.html3 kB
        • c-vsak.html7 kB
        • c-dafur.html889 B
        • c-hierzu.html1 kB
        • c-hiernach.html1 kB
        • c-a.html6 kB
        • c-naproti_tomu.html2 kB
        • c-nadto.html2 kB
        • c-nato.html1 kB
        • c-dagegen.html1 kB
        • c-wogegen.html1 kB
        • c-uberdies.html2 kB
        • c-na_rozdil_od_toho.html1 kB
        • c-misto_toho.html1 kB
        • c-vzhledem_k_tomu.html1 kB
        • c-stattdessen.html1 kB
        • c-tedy.html8 kB
        • c-zudem.html2 kB
        • c-natoz.html887 B
        • c-demzufolge.html1 kB
        • c-hiermit.html782 B
        • c-nachdem.html1 kB
        • c-krome_toho.html5 kB
        • c-dazu.html1 kB
        • c-dahingegen.html915 B
        • c-wobei.html2 kB
        • c-dale.html4 kB
        • c-hierauf.html1 kB
        • c-pricemz.html4 kB
        • c-woraufhin.html1 kB
        • c-proto.html2 kB
        • c-i.html3 kB
        • c-mithin.html1 kB
        • c-hieraufhin.html1 kB
        • c-wahrenddessen.html1 kB
        • c-dann.html3 kB
        • c-predtim.html2 kB
        • c-womit.html2 kB
        • c-daneben.html1 kB
        • c-kvuli_tomu.html2 kB
        • c-vedle_toho.html2 kB
        • c-seither.html1023 B
        • c-zatimco.html3 kB
        • c-vorher.html915 B
        • c-dementgegen.html937 B
        • c-pritom.html4 kB
        • c-mimoto.html1 kB
        • c-wohingegen.html1 kB
        • c-deshalb.html1 kB
        • c-hinterher.html1 kB
        • c-hierfur.html1 kB
        • c-protoze.html1 kB
        • c-dadurch.html1 kB
        • c-trotzdem.html2 kB
        • c-wodurch.html2 kB
        • c-navic.html3 kB
        • c-wonach.html1 kB
        • c-danach.html1 kB
        • c-zatim.html1 kB
        • c-diky_tomu.html3 kB
        • c-weswegen.html1 kB
      • tail.html491 B
      • list_ge.html6 kB
      • index.html780 B
      • list_cz.html4 kB
      • geczlex.css6 kB
      • pics
        • logo_ufal.png18 kB
      • header.html933 B
Name
geczlex1.0_index_lindat.html
Size
3.21 KB
Format
text/html
Description
GeCzLex 1.0 README
MD5
395614bc8c0b2812122a938b3906b933
Preview
  File Preview
    Czech–German Lexicon of Anaphoric Connectives 1.0 (GeCzLex 1.0)

    Czech–German Lexicon of Anaphoric Connectives 1.0 (GeCzLex 1.0)

    Introduction

    GeCzLex 1.0 is an online electronic resource for translation equivalents of Czech and German discourse connectives. It is one of the outcomes of a three-year research project on anaphoricity in Czech and German connectives (Anaphoricity in Connectives: Lexical Description and Bilingual Corpus Analysis), so the lexicon enlists anaphoric connectives for both languages and provides their possible translations documented in bilingual parallel corpora (not necessarily anaphoric).

    As a basis, two existing monolingual lexicons of connectives were used:

    • the Lexicon of Czech Discourse Connectives CzeDLex 0.6 and
    • the Lexicon of Discourse Markers DiMLex for German.

    Their relevant entries were interlinked via semantic annotation of the connectives according to the PDTB 3 sense taxonomy and statistical information of translation possibilities from the Czech and German parallel data of the Intercorp project. The lexicon is, as far as we know, the first bilingual inventory of connectives with linkage on the level of individual semantic categories of each connective. In total, there are 42 entries in the Czech part of GeCzLex 1.0, and 56 entries in the German part.

    How to get, open and browse the data

    GeCzLex 1.0 is available on-line as HTML web pages. It can also be downloaded from the LINDAT-Clarin repository under the Creative Commons - Attribution-NonCommercial-ShareAlike 4.0 International (CC BY-NC-SA 4.0) license.

    In the downloaded distribution, the lexicon is available in the HTML format - simply open the file index.html from the directory HTML in a web browser.

    How to Cite

    Kateřina Rysová, Lucie Poláková, Magdaléna Rysová, Jiří Mírovský: Lexicon of Czech and German Anaphoric Connectives. Data/software, Charles University, Prague, Czech Republic, http://hdl.handle.net/11234/1-3075, Dec 2019

    Updates, further information

    For updates and further information, see the on-line documentation to GeCzLex.