Skip to search
Skip to main content
Skip to first result
Search
Search Results
Creator:
Zeman, Daniel and Droganova, Kira
Publisher:
Charles University, Faculty of Mathematics and Physics, Institute of Formal and Applied Linguistics (UFAL)
Type:
text and corpus
Subject:
semantic dependency and universal dependencies
Language:
Afrikaans , Assyrian Neo-Aramaic , Akkadian , Amharic , Arabic , Belarusian , Breton , Bulgarian , Russia Buriat , Catalan , Czech , Church Slavic , Mandarin Chinese , Coptic , Welsh , Danish , German , Modern Greek (1453-) , English , Estonian , Basque , Faroese , Finnish , French , Irish , Gothic , Ancient Greek (to 1453) , Mbyá Guaraní , Hebrew , Hindi , Croatian , Upper Sorbian , Hungarian , Armenian , Indonesian , Italian , Japanese , Kazakh , Northern Kurdish , Korean , Komi-Zyrian , Karelian , Latin , Latvian , Lithuanian , Literary Chinese , Marathi , Erzya , Dutch , Norwegian , Old Russian , Nigerian Pidgin , Polish , Portuguese , Romanian , Russian , Sanskrit , Slovak , Slovenian , Northern Sami , Spanish , Serbian , Swedish , Tamil , Tagalog , Turkish , Ukrainian , Urdu , Vietnamese , Warlpiri , Wolof , Yoruba , Galician , Bhojpuri , Komi-Permyak , Livvi , Moksha , Scottish Gaelic , Skolt Sami , Icelandic , Albanian , Persian , Akuntsu , Apurinã , Khunsari , Manx , Mundurukú , Nayini , Soi , South Levantine Arabic , Tupinambá , Beja , Western Frisian , Urubú-Kaapor , Kangri , K'iche' , Low German , Makuráp , Western Armenian , and Central Siberian Yupik
Description:
Deep Universal Dependencies is a collection of treebanks derived semi-automatically from Universal Dependencies (http://hdl.handle.net/11234/1-3687). It contains additional deep-syntactic and semantic annotations. Version of Deep UD corresponds to the version of UD it is based on. Note however that some UD treebanks have been omitted from Deep UD.
Rights:
Licence Universal Dependencies v2.8 , https://lindat.mff.cuni.cz/repository/xmlui/page/license-ud-2.8 , and PUB
Creator:
Mareček, David , Yu, Zhiwei , Zeman, Daniel , and Žabokrtský, Zdeněk
Publisher:
Charles University, Faculty of Mathematics and Physics, Institute of Formal and Applied Linguistics (UFAL)
Type:
text and corpus
Subject:
part of speech , tagging , semi-supervised , and cross-language
Language:
Belarusian , Bosnian , Bulgarian , Czech , Serbo-Croatian , Croatian , Upper Sorbian , Macedonian , Polish , Russian , Slovak , Slovenian , Serbian , Ukrainian , Latvian , Lithuanian , Afrikaans , Danish , German , English , Faroese , Western Frisian , Swiss German , Icelandic , Limburgan , Luxembourgish , Low German , Dutch , Norwegian Nynorsk , Norwegian , Scots , Swedish , Yiddish , Aragonese , Asturian , Catalan , French , Galician , Haitian , Italian , Latin , Lombard , Neapolitan , Piemontese , Portuguese , Romanian , Spanish , Venetian , Walloon , Breton , Welsh , Scottish Gaelic , Irish , Modern Greek (1453-) , Armenian , Albanian , Dimli (individual language) , Persian , Gilaki , Kurdish , Tajik , Bengali , Bishnupriya , Gujarati , Fiji Hindi , Hindi , Marathi , Nepali (macrolanguage) , Urdu , Amharic , Arabic , Egyptian Arabic , Hebrew , Estonian , Finnish , Hungarian , Basque , Georgian , Chuvash , Azerbaijani , Turkish , Uzbek , Kazakh , Tatar , Yakut , Korean , Mongolian , Telugu , Kannada , Malayalam , Tamil , Newari , Vietnamese , Indonesian , Javanese , Malagasy , Maori , Malay (macrolanguage) , Pampanga , Sundanese , Tagalog , Waray (Philippines) , Swahili (macrolanguage) , Esperanto , Ido , Interlingua (International Auxiliary Language Association) , and Volapük
Description:
Texts in 107 languages from the W2C corpus (http://hdl.handle.net/11858/00-097C-0000-0022-6133-9), first 1,000,000 tokens per language, tagged by the delexicalized tagger described in Yu et al. (2016, LREC, Portorož, Slovenia).
Rights:
Creative Commons - Attribution-ShareAlike 4.0 International (CC BY-SA 4.0) , http://creativecommons.org/licenses/by-sa/4.0/ , and PUB
Creator:
Mareček, David , Yu, Zhiwei , Zeman, Daniel , and Žabokrtský, Zdeněk
Publisher:
Charles University, Faculty of Mathematics and Physics, Institute of Formal and Applied Linguistics (UFAL)
Type:
text and corpus
Subject:
part of speech , tagging , semi-supervised , and cross-language
Language:
Belarusian , Bosnian , Bulgarian , Czech , Serbo-Croatian , Croatian , Upper Sorbian , Macedonian , Polish , Russian , Slovak , Slovenian , Serbian , Ukrainian , Latvian , Lithuanian , Afrikaans , Danish , German , English , Faroese , Western Frisian , Swiss German , Icelandic , Limburgan , Luxembourgish , Low German , Dutch , Norwegian Nynorsk , Norwegian , Scots , Swedish , Yiddish , Aragonese , Asturian , Catalan , French , Galician , Haitian , Italian , Latin , Lombard , Neapolitan , Piemontese , Portuguese , Romanian , Spanish , Venetian , Walloon , Breton , Welsh , Scottish Gaelic , Irish , Modern Greek (1453-) , Armenian , Albanian , Dimli (individual language) , Persian , Gilaki , Kurdish , Tajik , Bengali , Bishnupriya , Gujarati , Fiji Hindi , Hindi , Marathi , Nepali (macrolanguage) , Urdu , Amharic , Arabic , Egyptian Arabic , Hebrew , Estonian , Finnish , Hungarian , Basque , Georgian , Chuvash , Azerbaijani , Turkish , Uzbek , Kazakh , Tatar , Yakut , Korean , Mongolian , Telugu , Kannada , Malayalam , Tamil , Newari , Vietnamese , Indonesian , Javanese , Malagasy , Maori , Malay (macrolanguage) , Pampanga , Sundanese , Tagalog , Waray (Philippines) , Swahili (macrolanguage) , Esperanto , Ido , Interlingua (International Auxiliary Language Association) , and Volapük
Description:
Texts in 107 languages from the W2C corpus (http://hdl.handle.net/11858/00-097C-0000-0022-6133-9), first 1,000,000 tokens per language, tagged by the delexicalized tagger described in Yu et al. (2016, LREC, Portorož, Slovenia).
Changes in version 1.1:
1. Universal Dependencies tagset instead of the older and smaller Google Universal POS tagset.
2. SVM classifier trained on Universal Dependencies 1.2 instead of HamleDT 2.0.
3. Balto-Slavic languages, Germanic languages and Romance languages were tagged by classifier trained only on the respective group of languages. Other languages were tagged by a classifier trained on all available languages. The "c7" combination from version 1.0 is no longer used.
Rights:
Creative Commons - Attribution-ShareAlike 4.0 International (CC BY-SA 4.0) , http://creativecommons.org/licenses/by-sa/4.0/ , and PUB
Creator:
Kłodnicki, Zygmunt,
Type:
text and studie
Subject:
Kulturní antropologie. Etnologie. Etnografie , démonologie , etnologie , metodologie , Polsko , přehledná zpracování světových dějin (chronologicky) , církevní a náboženské dějiny , and zahraniční národopis
Language:
Polish
Description:
Vorschläge zur Systematik der Volksdämonologie des Polnischen ethnographischen Atlas in Cieszyn.
Rights:
unknown
Creator:
Ptaśnik, Jan,
Type:
text and monografie
Subject:
Dějiny zemí střední Evropy , dějiny polské , haléř svatopetrský , papežství , důchody papežské , Polsko , papežství, církevní politika , and světové dějiny středověku (do r. 1492)
Language:
Polish
Rights:
unknown
Creator:
Koebner, Richard,
Type:
text and monografie
Subject:
Dějiny zemí střední Evropy , Jiří z Poděbrad, , vztahy česko-polské , města polská , Polsko , města, obce , české země 1419-1471 , světové dějiny středověku (do r. 1492) , and dějiny osídlení, regionální dějiny
Language:
Polish
Rights:
unknown
Creator:
Metallmann, Joachim
Publisher:
Polska Akademja Umiejętności
Format:
print and xiv, 424 s.
Type:
text , volume , pojednání , model:monograph , and TEXT
Subject:
Speciální metafyzika , determinismus , přírodní vědy , 123.2 , 5 , (049) , and 122/129
Language:
Polish
Description:
Joachim Metallmann. and KČSN
Rights:
http://creativecommons.org/publicdomain/mark/1.0/ and policy:public
Creator:
Hlaváček, Ivan,
Type:
text and články jubilejní
Subject:
Historická věda. Pomocné vědy historické. Archivnictví , Spunar, Pavel, , historici , kodikologové , paleografie , kultura středověká , jubilea životní , and historici (jubilea, nekrology apod.)
Language:
Polish
Rights:
unknown
Publisher:
[S.n.]
Type:
Text , model:monograph , and TEXT
Subject:
054
Language:
Latin and Polish
Description:
Chybí listy, 3 poslední kapitoly 2. knihy. Neúplné. Čísl. vrstvami. Sign. Písmo gotické. Rubriky. Sazba ve vokabuláři 4-sloupcová. Linky. Viněta na fol. 4a propriová. Vazba původní, renesanční, žlutá kůže s intarsiemi, poškozená. Hřbet natřen barvou světle hnědou. Pův. maj. býv. kl. frant. v Uh. Hradišti. and Pův. maj. býv. kl. frant. v Uh. Hradišti
Rights:
http://creativecommons.org/publicdomain/mark/1.0/ and policy:public
Creator:
Sappok, Gerhard
Type:
text and monografie
Subject:
Dějiny Evropy , biskupství poznaňské , biskupové poznaňští , Polsko , světové dějiny středověku (do r. 1492) , and církevní správa a hospodářství
Language:
Polish
Rights:
unknown