Skip to search
Skip to main content
Skip to first result
Search
Search Results
Creator:
Mareček, David , Yu, Zhiwei , Zeman, Daniel , and Žabokrtský, Zdeněk
Publisher:
Charles University, Faculty of Mathematics and Physics, Institute of Formal and Applied Linguistics (UFAL)
Type:
text and corpus
Subject:
part of speech , tagging , semi-supervised , and cross-language
Language:
Belarusian , Bosnian , Bulgarian , Czech , Serbo-Croatian , Croatian , Upper Sorbian , Macedonian , Polish , Russian , Slovak , Slovenian , Serbian , Ukrainian , Latvian , Lithuanian , Afrikaans , Danish , German , English , Faroese , Western Frisian , Swiss German , Icelandic , Limburgan , Luxembourgish , Low German , Dutch , Norwegian Nynorsk , Norwegian , Scots , Swedish , Yiddish , Aragonese , Asturian , Catalan , French , Galician , Haitian , Italian , Latin , Lombard , Neapolitan , Piemontese , Portuguese , Romanian , Spanish , Venetian , Walloon , Breton , Welsh , Scottish Gaelic , Irish , Modern Greek (1453-) , Armenian , Albanian , Dimli (individual language) , Persian , Gilaki , Kurdish , Tajik , Bengali , Bishnupriya , Gujarati , Fiji Hindi , Hindi , Marathi , Nepali (macrolanguage) , Urdu , Amharic , Arabic , Egyptian Arabic , Hebrew , Estonian , Finnish , Hungarian , Basque , Georgian , Chuvash , Azerbaijani , Turkish , Uzbek , Kazakh , Tatar , Yakut , Korean , Mongolian , Telugu , Kannada , Malayalam , Tamil , Newari , Vietnamese , Indonesian , Javanese , Malagasy , Maori , Malay (macrolanguage) , Pampanga , Sundanese , Tagalog , Waray (Philippines) , Swahili (macrolanguage) , Esperanto , Ido , Interlingua (International Auxiliary Language Association) , and Volapük
Description:
Texts in 107 languages from the W2C corpus (http://hdl.handle.net/11858/00-097C-0000-0022-6133-9), first 1,000,000 tokens per language, tagged by the delexicalized tagger described in Yu et al. (2016, LREC, Portorož, Slovenia).
Changes in version 1.1:
1. Universal Dependencies tagset instead of the older and smaller Google Universal POS tagset.
2. SVM classifier trained on Universal Dependencies 1.2 instead of HamleDT 2.0.
3. Balto-Slavic languages, Germanic languages and Romance languages were tagged by classifier trained only on the respective group of languages. Other languages were tagged by a classifier trained on all available languages. The "c7" combination from version 1.0 is no longer used.
Rights:
Creative Commons - Attribution-ShareAlike 4.0 International (CC BY-SA 4.0) , http://creativecommons.org/licenses/by-sa/4.0/ , and PUB
Publisher:
Vojenský zeměpisný ústav
Format:
map and 1 mapa : barevná ; 39 x 52 cm na listu 48 x 62 cm
Type:
model:map , cartographic , and IMAGE
Subject:
udc:913(4) , Konspekt:7 , udc:912 , udc:913(437.6) , udc:912.43 , udc:(084.3) , Konspekt:Geografie Evropy, reálie, cestování , Konspekt:Mapy. Atlasy. Glóby , and czenas:Dunajská Streda (Slovensko : oblast)
Language:
Czech , Slovak , and Hungarian
Description:
4859, Legenda, Edice dle kladu listů, and (Language) Místní názvy slovensky a maďarsky
Rights:
http://creativecommons.org/publicdomain/mark/1.0/ and policy:public
Type:
text and adresáře
Subject:
Seriálové publikace. Periodika , dějiny světové , and zahraniční periodika a sborníky
Language:
Hungarian
Rights:
unknown
Creator:
Tarafás, Imre
Type:
text and studie
Subject:
Historická věda. Pomocné vědy historické. Archivnictví , Český časopis historický , historiografie , historiografie maďarská , časopisy maďarské , časopisy vědecké , české země 1848-1918 , Maďarsko , světové dějiny 1789-1918 , dějepisectví, historické vědy, historici , historiografie, vědecké projekty , české časopisy a sborníky (dějiny) , and zahraniční periodika a sborníky
Language:
Hungarian
Description:
In each other's mirror. Hungarian historiography in the Český Časopis Historický, and Czech historiography in the Századok between 1867 and 1918.
Rights:
unknown
Type:
text and bibliografie
Subject:
Dějiny zemí střední Evropy , Bibliografie. Katalogy , bibliografie oborové , historiografie maďarská , bibliografie oborové a tematické, rejstříky časopisů , and Maďarsko
Language:
English and Hungarian
Rights:
unknown
Creator:
Pecina, Pavel and Saleh, Shadi
Publisher:
Charles University, Faculty of Mathematics and Physics, Institute of Formal and Applied Linguistics (UFAL)
Type:
text and corpus
Subject:
cross-lingual information retrieval and machine translation
Language:
English , Czech , French , German , Hungarian , Polish , Spanish , and Swedish
Description:
This package contains an extended version of the test collection used in the CLEF eHealth Information Retrieval tasks in 2013--2015. Compared to the original version, it provides complete query translations into Czech, French, German, Hungarian, Polish, Spanish and Swedish and additional relevance assessment.
Rights:
Creative Commons - Attribution-NonCommercial 4.0 International (CC BY-NC 4.0) , http://creativecommons.org/licenses/by-nc/4.0/ , and PUB
Creator:
Laczlavik, György
Type:
text and studie
Subject:
Dějiny zemí střední Evropy , bitva u Moháče (1526) , války turecké , Maďarsko , světové dějiny 1492-1648 , vojenské operace, války, bitvy , and Osmanská říše
Language:
Hungarian
Description:
Wendepunkt? Was veränderte sich in den zwei Jahrzehnten nach der Schalcht bei Mohács im Ungarischen Königreich?
Rights:
unknown
Creator:
Horbulák, Zsolt
Type:
text and studie
Subject:
Národní hospodářství a hospodářská politika , Rozsypal, Kurt, , Šik, Ota, , reformy hospodářské , dějiny hospodářské , hospodářství socialistické , ekonomové , Československo 1948-1969 , hospodářské dějiny , and ekonomie, ekonomové
Language:
Hungarian
Description:
Economic Reform Attempts in Socialist Czechoslovakia.
Rights:
unknown
Publisher:
Vojenský zeměpisný ústav
Format:
map and 1 mapa : barevná ; 39 x 51 cm na listu 48 x 63 cm
Type:
model:map , cartographic , and IMAGE
Subject:
udc:913(4) , Konspekt:7 , udc:912 , udc:913(437.6) , udc:912.43 , udc:(084.3) , Konspekt:Geografie Evropy, reálie, cestování , Konspekt:Mapy. Atlasy. Glóby , and czenas:Hajnáčka (Slovensko : oblast)
Language:
Czech , Slovak , and Hungarian
Description:
4764, Legenda, Edice dle kladu listů, and (Language) Místní názvy slovensky a maďarsky
Rights:
http://creativecommons.org/publicdomain/mark/1.0/ and policy:public
Creator:
Zeman, Daniel , Mareček, David , Mašek, Jan , Popel, Martin , Ramasamy, Loganathan , Rosa, Rudolf , Štěpánek, Jan , and Žabokrtský, Zdeněk
Publisher:
Charles University, Faculty of Mathematics and Physics, Institute of Formal and Applied Linguistics (UFAL)
Type:
text and corpus
Subject:
treebank , Stanford dependencies , Prague dependencies , harmonization , common annotation style , and Interset
Language:
Arabic , Bulgarian , Bengali , Catalan , Czech , Danish , German , Modern Greek (1453-) , English , Spanish , Estonian , Basque , Persian , Finnish , Ancient Greek (to 1453) , Hindi , Hungarian , Italian , Japanese , Latin , Dutch , Portuguese , Romanian , Russian , Slovak , Slovenian , Swedish , Tamil , Telugu , and Turkish
Description:
HamleDT 2.0 is a collection of 30 existing treebanks harmonized into a common annotation style, the Prague Dependencies, and further transformed into Stanford Dependencies, a treebank annotation style that became popular recently. We use the newest basic Universal Stanford Dependencies, without added language-specific subtypes.
Rights:
HamleDT 2.0 Licence Agreement , https://lindat.mff.cuni.cz/repository/xmlui/page/licence-hamledt-2.0 , and ACA