This is a new version of the repository. Do let us know (lindat-help at ufal.mff.cuni.cz) if you encounter any issues.

Czech Etymological Lexicon 1.0

Please use the following text to cite this item or export to a predefined format:
Rejzek, Jiří; Papáček, Aleš; Brezinová, Viktória and Žabokrtský, Zdeněk, 2025, Czech Etymological Lexicon 1.0, LINDAT/CLARIAH-CZ digital library at the Institute of Formal and Applied Linguistics (ÚFAL), http://hdl.handle.net/11234/1-5845.
Date issued
2025-01-28
Size
10502 words
Language(s)
Description
The Czech Etymological Lexicon, version 1.0, contains 10,502 Czech words, each annotated with a sequence of ISO 639-3 language codes representing its etymological origin. The dataset is provided in a simple tab-separated format, with the first column containing the lemma and the second listing the language codes separated by commas. Example entry: architekt deu,lat,ell loan The word architekt originated from Greek, and came to Czech through Latin and German. The third column indicates whether the word is a loanword (marked as "loan") or a native word (marked as "native"). Note that "native" refers to inherited words as opposed to loanwords. The language sequences were extracted from the printed dictionary REJZEK, Jiří. Český etymologický slovník [Czech etymological dictionary]. LEDA, 2015. The extraction of language sequences from the entries in the original dictionary was fully automated and, therefore, may contain imperfections. Please refer to the original dictionary for highly precise information.
Acknowledgement
Subject(s)
 Files in this item
Name
documentation.pdf
Size
38.05 KB
Format
application/pdf
Description
documentation
MD5
155f8d5a24cc12d4f44d6963adf3e95d
Preview
  File Preview
Name
czetyl.tsv
Size
187.44 KB
Format
application/octet-stream
Description
data
MD5
42028535a8915355d99a324804b2ab0c
Preview
  File Preview