Skip to search
Skip to main content
Skip to first result
Search
Search Results
Creator:
Majliš, Martin
Publisher:
Charles University, Faculty of Mathematics and Physics, Institute of Formal and Applied Linguistics (UFAL)
Type:
text and corpus
Subject:
multilingual corpora
Language:
Afrikaans , Tosk Albanian , Amharic , Arabic , Aragonese , Egyptian Arabic , Asturian , Azerbaijani , Belarusian , Bengali , Bosnian , Bishnupriya , Breton , Buginese , Bulgarian , Catalan , Cebuano , Czech , Chuvash , Corsican , Welsh , Danish , German , Dimli (individual language) , Modern Greek (1453-) , English , Esperanto , Estonian , Basque , Faroese , Persian , Finnish , French , Western Frisian , Gan Chinese , Scottish Gaelic , Irish , Galician , Gilaki , Gujarati , Haitian , Serbo-Croatian , Hebrew , Fiji Hindi , Hindi , Croatian , Upper Sorbian , Hungarian , Armenian , Ido , Interlingua (International Auxiliary Language Association) , Indonesian , Icelandic , Italian , Javanese , Japanese , Kannada , Georgian , Kazakh , Korean , Kurdish , Latin , Latvian , Limburgan , Lithuanian , Lombard , Luxembourgish , Malayalam , Marathi , Macedonian , Malagasy , Mongolian , Maori , Malay (macrolanguage) , Burmese , Neapolitan , Low German , Nepali (macrolanguage) , Newari , Dutch , Norwegian Nynorsk , Norwegian , Occitan (post 1500) , Ossetian , Pampanga , Piemontese , Polish , Portuguese , Quechua , Romanian , Russian , Yakut , Sicilian , Scots , Slovak , Slovenian , Spanish , Albanian , Serbian , Sundanese , Swahili (macrolanguage) , Swedish , Tamil , Tatar , Telugu , Tajik , Tagalog , Thai , Turkish , Ukrainian , Urdu , Uzbek , Venetian , Vietnamese , Volapük , Waray (Philippines) , Walloon , Yiddish , Yoruba , and Chinese
Description:
A set of corpora for 120 languages automatically collected from wikipedia and the web.
Collected using the W2C toolset: http://hdl.handle.net/11858/00-097C-0000-0022-60D6-1
Rights:
Attribution-ShareAlike 3.0 Unported (CC BY-SA 3.0) , http://creativecommons.org/licenses/by-sa/3.0/ , and PUB
Creator:
Muller, Anton (cinny 18. stoleti-19. stoleti) and Buchler, Jan (cinny 1792-1812)
Publisher:
K dostanj v Jana Buchlera, knihkupce
Type:
model:monograph and TEXT
Language:
Czech and Latin
Rights:
http://creativecommons.org/publicdomain/mark/1.0/ and policy:public
Creator:
Müller, Thomas and Schütze, Hinrich
Publisher:
Center for Information and Language Processing, University of Munich
Type:
text and corpus
Subject:
morphological dictionary , morphological analysis , and PoS tagging
Language:
English , German , Latin , Hungarian , Spanish , and Czech
Description:
Dictionaries with different representations for various languages. Representations include brown clusters of different sizes and morphological dictionaries extracted using different morphological analyzers. All representations cover the most frequent 250,000 word types on the Wikipedia version of the respective language.
Analzers used: MAGYARLANC (Hungarian, Zsibrita et al. (2013)), FREELING (English and Spanish, Padro and Stanilovsky (2012)), SMOR (German, Schmid et al. (2004)), an MA from Charles University (Czech, Hajic (2001)) and LATMOR (Latin, Springmann et al. (2014)).
Rights:
Creative Commons - Attribution 3.0 Unported (CC BY 3.0) , http://creativecommons.org/licenses/by/3.0/ , and PUB
Publisher:
University of Leipzig
Type:
corpus
Language:
Afrikaans , Albanian , Bulgarian , Catalan , Chinese , Croatian , Czech , Danish , Dutch , English , Esperanto , Estonian , Finnish , French , German , Hungarian , Icelandic , Indonesian , Italian , Japanese , Korean , Latin , Latvian , Lithuanian , Malay (macrolanguage) , Norwegian , Occitan (post 1500) , Romanian , Russian , Slovak , Slovenian , Spanish , Sundanese , Swedish , Tagalog , Turkish , Vietnamese , and Welsh
Description:
Collected from newspaper texts, webcrawling, etc.: words (+frequency), cooccurrences (+graph), left/right neighbours, example sentences
Rights:
Not specified
Creator:
Václav Dobřenský and Cžerny, Giřij
Publisher:
Giřij Cžerny
Format:
print and [125] ff, 12°
Type:
model:monograph and TEXT
Language:
Czech and Latin
Description:
Černý z Černého Mostu, Jiří
Rights:
http://creativecommons.org/publicdomain/mark/1.0/ and policy:public
Creator:
Adrichem, Christiaan van (1533-1585) , Capella, Petr (asi 1550-1599) , Carolides z Karlsperka, Jiri (1569-1612) , Adam z Veleslavina, Daniel (1546-1599) , and Funk z Olivetu, Jiri (1545-asi 1617)
Publisher:
[v M. Danyele Adama z Weleslawjna]
Type:
model:monograph and TEXT
Language:
Czech and Latin
Rights:
http://creativecommons.org/publicdomain/mark/1.0/ and policy:public
Publisher:
Repronis,
Type:
sborníky
Subject:
Dějiny civilizace. Kulturní dějiny , Oliva, Pavel, , sborníky , věda o antice , antika , dějiny starověkého Řecka , české (československé) sborníky a kolektivní monografie , antický svět , and personální bibliografie
Language:
Czech , Latin , Polish , Russian , and Slovak
Description:
Publikace je věnována profesorovi PhDr. Pavlu Olivovi, DrSc. k jeho životnímu jubileu
Rights:
unknown
Creator:
Bečvář, Jindřich,
Type:
text and monografie
Subject:
Algebra , matematika , dějiny matematiky , algebra , přehledná zpracování světových dějin (chronologicky) , and matematika, kybernetika
Language:
Czech , English , French , German , and Latin
Description:
Nad názvem: katedra didaktiky matematiky, Matematicko-fyzikální fakulta Univerzity Karlovy
Rights:
unknown
Creator:
Sedláček, August,
Type:
text and studie
Subject:
Dějiny Česka a Slovenska , obce , dějiny obcí , přehledná zpracování dějin českých zemí (chronologicky) , and města, obce
Language:
Latin and Czech
Rights:
unknown
Type:
text and listiny
Subject:
Dějiny Česka a Slovenska , Karel , privilegia městská , panovníci , listiny , právo městské , panovníci, panovnické rody, dvory , města, obce , and české země 1306-1419
Language:
Czech and Latin
Description:
Částečně souběžný latinský text, francouzské, německé a anglické resumé
Rights:
unknown