Skip to search
Skip to main content
Skip to first result
Search
Search Results
Type:
text and sborníky
Subject:
Ruská literatura (o ní) , Puškin, Aleksandr Sergejevič, , básníci ruští , literatura ruská , and české (československé) sborníky a kolektivní monografie
Language:
Czech and Russian
Rights:
unknown
Creator:
Droganova, Kira , Zeman, Daniel , Kanerva, Jenna , and Ginter, Filip
Publisher:
Charles University, Faculty of Mathematics and Physics, Institute of Formal and Applied Linguistics (UFAL)
Type:
text and corpus
Subject:
universal dependencies , ellipsis , and gapping
Language:
English , Czech , Finnish , Russian , and Slovak
Description:
Artificially created treebank of elliptical constructions (gapping), in the annotation style of Universal Dependencies. Data taken from UD 2.1 release, and from large web corpora parsed by two parsers. Input data are filtered, sentences are identified where gapping could be applied, then those sentences are transformed, one or more words are omitted, resulting in a sentence with gapping. Details in Droganova et al.: Parse Me if You Can: Artificial Treebanks for Parsing Experiments on Elliptical Constructions, LREC 2018, Miyazaki, Japan.
Rights:
Licence Universal Dependencies v2.1 , https://lindat.mff.cuni.cz/repository/xmlui/page/licence-UD-2.1 , and PUB
Creator:
Čeněk Zíbrt and Česká akademie císaře Františka Josefa pro vědy, slovesnost a umění
Publisher:
Nákladem České akademie císaře Františka Josefa pro vědy, slovesnost a umění
Format:
print , svazek , and 326 stran.
Type:
model:monograph and TEXT
Subject:
Vokální hudba , Bibliografie. Katalogy , české lidové písně , historické prameny , Česko , 784.4(=162.3) , (016) , (437.3) , 9 , 12 , 784 , and 01
Language:
Czech , English , French , German , Italian , Latin , Polish , and Russian
Description:
sestavil Čeněk Zíbrt., Obsahuje rejstříky., Částečně souběžný anglický, francouzský, německý, italský, latinský, polský a ruský text, and Vydává III. třída České akademie císaře Františka Josefa pro vědy, slovesnost a umění v Praze
Rights:
http://creativecommons.org/publicdomain/mark/1.0/ and policy:public
Creator:
Fedor Michajlovič Dostojevskij and Jaromír Hrubý
Type:
model:monograph and TEXT
Language:
Czech and Russian
Rights:
http://creativecommons.org/publicdomain/mark/1.0/ and policy:public
Type:
model:periodicalitem and TEXT
Language:
Czech , English , and Russian
Description:
15 and Sborník statí věnovaných k šedesátinám člena korespondenta Josefa Lihnarta
Rights:
http://creativecommons.org/publicdomain/mark/1.0/ and policy:public
Type:
model:periodicalitem and TEXT
Language:
Czech and Russian
Description:
1
Rights:
http://creativecommons.org/publicdomain/mark/1.0/ and policy:public
Type:
model:periodicalitem and TEXT
Language:
Czech , English , and Russian
Description:
17, Sborník statí věnovaných k šedesátinám člena korespondenta Josefa Lihnarta, and Označení čísla nebylo na předloze uvedeno, pořadí je dopočítáno.
Rights:
http://creativecommons.org/publicdomain/mark/1.0/ and policy:public
Creator:
Linhart, J.
Type:
model:periodicalitem and TEXT
Language:
Czech and Russian
Description:
3 and Cognitive processes and learning
Rights:
http://creativecommons.org/publicdomain/mark/1.0/ and policy:public
Creator:
Gurevych, Iryna , Habernal, Ivan , and Zayed, Omnia
Publisher:
Technische Universität Darmstadt
Type:
text and corpus
Subject:
CommonCrawl , Creative Commons , Web corpus , and Amazon Web Services
Language:
Afrikaans , Arabic , Bengali , Bulgarian , Czech , Danish , German , Modern Greek (1453-) , English , Estonian , Persian , Finnish , French , Hebrew , Hindi , Croatian , Hungarian , Indonesian , Italian , Japanese , Kannada , Korean , Latvian , Lithuanian , Malayalam , Macedonian , Nepali (macrolanguage) , Dutch , Norwegian , Panjabi , Polish , Portuguese , Romanian , Russian , Slovak , Slovenian , Somali , Spanish , Albanian , Swahili (macrolanguage) , Swedish , Tamil , Telugu , Tagalog , Thai , Turkish , Ukrainian , Undetermined , Vietnamese , and Chinese
Description:
A large web corpus (over 10 billion tokens) licensed under CreativeCommons license family in 50+ languages that has been extracted from CommonCrawl, the largest publicly available general Web crawl to date with about 2 billion crawled URLs.
Rights:
Creative Commons - Attribution-NonCommercial 4.0 International (CC BY-NC 4.0) , http://creativecommons.org/licenses/by-nc/4.0/ , and PUB
Creator:
Gurevych, Iryna , Habernal, Ivan , and Zayed, Omnia
Publisher:
Technische Universität Darmstadt
Type:
text and corpus
Subject:
CommonCrawl , Creative Commons , Web corpus , and Amazon Web Services
Language:
Afrikaans , Arabic , Bengali , Bulgarian , Czech , Danish , German , Modern Greek (1453-) , English , Estonian , Persian , Finnish , French , Gujarati , Hebrew , Hindi , Croatian , Hungarian , Indonesian , Italian , Japanese , Kannada , Korean , Latvian , Lithuanian , Malayalam , Marathi , Macedonian , Nepali (macrolanguage) , Dutch , Norwegian , Polish , Portuguese , Romanian , Russian , Slovak , Slovenian , Somali , Spanish , Albanian , Swahili (macrolanguage) , Swedish , Tamil , Telugu , Tagalog , Thai , Turkish , Ukrainian , Undetermined , Urdu , Vietnamese , and Chinese
Description:
A large web corpus (over 10 billion tokens) licensed under CreativeCommons license family in 50+ languages that has been extracted from CommonCrawl, the largest publicly available general Web crawl to date with about 2 billion crawled URLs.
Rights:
Creative Commons - Attribution-NonCommercial-NoDerivatives 4.0 International (CC BY-NC-ND 4.0) , http://creativecommons.org/licenses/by-nc-nd/4.0/ , and PUB