Creator: Čermák, František - LINDAT/CLARIAH-CZ Catalog Search Results

Start Over Creator Čermák, František

1. Lexikon korpusu Orální historie (Příběhy)

Creator:: Cvrček, Václav, Čermák, František, Chlumská, Lucie, and Mácha, Jiří
Format:: bez média and svazek
Type:: model:article and TEXT
Subject:: lexicon, keywords, collocations, oral history, lexikon, klíčová slova, kolokace, and orální historie
Language:: Czech
Description:: This paper is based on a study which was conducted within the research grant ''Institutions in Life Stories. Multilevel Comparative Analysis of Biographical Narratives of Three Groups of Participants in Czech Society in 20th Century''. The aim of this research was both to describe one possible way of using a corpus to identify relevant differences between three types of text (in this case biographical narratives of three groups of speakers: communist officials, dissidents and so-called common people) and to serve as a basis for further analysis (be it a linguistic, sociological or historical analysis). We tried to point out typical features of the language of each group based on the most frequent expressions (nouns, adjectives etc.) and especially collocations. We also compared the corpus Příběhy (Stories) as a whole with the ORAL2008 corpus of synchronic spoken Czech, the SYN2005 corpus of synchronic written Czech and the Totalita corpus (a corpus of communist propaganda).
Rights:: http://creativecommons.org/publicdomain/mark/1.0/ and policy:public

2. Minulost, přítomnost a budoucnost česko-anglických slovníků

Creator:: Čermák, František and Klégr, Aleš
Format:: print
Type:: model:internalpart and TEXT
Language:: Czech
Rights:: http://creativecommons.org/licenses/by-nc-sa/4.0/ and policy:public

3. Monokolokabilní slova v češtině: jejich hlavní aspekty

Creator:: Čermák, František
Format:: bez média and svazek
Type:: model:article and TEXT
Subject:: monocollocable words, language periphery, combination, monokolokabilní slova, periferie jazyka, and kombinace
Language:: Czech
Description:: Monocollocable words are such words and word forms that occur in a single lexical combination only or in very few, whose number is severely restricted and set. Practically, they are found as parts of set idioms and multi-word terms. They are found in many other languages, cf. English tenterhooks or Russian bakluši. Czech examples dát/dostat najevo, na viděnou, je mi líto, říct/mluvit/hrát nahlas, je známo, je zapotřebí, být třešničkou na dortu, není divu, jít/chodit pěšky, dát/dostat zadarmo illustrate this in more detail, showing, at the same time, that there might be a limited variation found, too, but, above all, that these are, in fact, no full-fledged words, lacking most of their basic characteristics, such as meaning, word-class membership, etc. In the sense of their severely limited combinatorial capacity, these words, less known under such alternative labels as cranberry words, form a substantial and irregular periphery of language and its lexicon. The contribution briefly comments on some of their aspects and suggests that broadly some classes or types can be recognized.
Rights:: http://creativecommons.org/publicdomain/mark/1.0/ and policy:public

4. O geologii a dolování na Rýmařovsku /

Creator:: Čermák, František
Type:: text and publikace informační
Subject:: Ložisková geologie, geologie, dolování, doly, těžba rud, těžba stříbra, těžba zlata, přehledná zpracování dějin českých zemí (chronologicky), and průmysl, manufaktury, hornictví, pivovary
Language:: Czech
Description:: Terminologický slovník
Rights:: unknown

5. Přehled archivních fondů do znárodnění podnikovém archívu n. p. Juta Dvůr Králové n. L. /

Creator:: Čermák, František
Type:: text and studie
Subject:: Textilní průmysl, fondy archivní, průmysl textilní, archivy podnikové, české země 1792-1918, Československo 1918-1992, Československo 1945-1948, hospodářské dějiny, and české a československé archivy, archivní fondy
Language:: Czech
Rights:: unknown

6. Přehled vývoje národního podniku Juta, Dvůr Králové nad Labem, od založení do roku 1980 /

Creator:: Čermák, František
Type:: text and studie
Subject:: Textilní průmysl, podniky průmyslové, dějiny podniků, průmysl textilní, jutařství, Československo 1945-1992, and průmysl, manufaktury, hornictví, pivovary
Language:: Czech
Rights:: unknown

7. Some current problems of corpus and computational linguistics, or Fifteen commandments and general truths

Creator:: Čermák, František
Format:: bez média and svazek
Type:: model:article and TEXT
Subject:: corpus, corpus lingustics, computational linguistics, methodology, type of data, type of information, representativeness of corpora, systems of tagging, lemmatizers, ir/regularity in language, collocations, meaning, aligners, korpus, korpusová lingvistika, komputační lingvistika, metodologie, typy dat, typy informace, reprezentativnost korpusu, systémy taggování, lemmatizátory, ne/pravidelnost v jazyce, kolokace, význam, and alignery
Language:: Czech
Description:: This contribution, which in a brief, succint and almost aphoristic way, critically brings forward to the reader a number of problems of today’s corpus and computational linguistics as well as their unsatisfactory solutions, is trying, at the same time, to do away with a number of myths and simplified opinions in the field. and Příspěvek ve stručné a téměř aforizované podobě připomíná řadu kritizovaných problémů a jejich neuspokojivých řešení v dnešní korpusové a komputační lingvistice a snaží se tak odstranit řadu mýtů a zjednodušujících představ.
Rights:: http://creativecommons.org/publicdomain/mark/1.0/ and policy:public

8. SYN2005: balanced corpus of written Czech

Creator:: Čermák, František, Hlaváčová, Jaroslava, Hnátková, Milena, Jelínek, Tomáš, Kocek, Jan, Kopřivová, Marie, Křen, Michal, Novotná, Renata, Petkevič, Vladimír, Schmiedtová, Věra, Skoumalová, Hana, Spoustová, Johanka, Šulc, Michal, and Velíšek, Zdeněk
Publisher:: Faculty of Arts, Institute of the Czech National Corpus, Charles University in Prague
Type:: text and corpus
Subject:: balanced corpus and written language
Language:: Czech
Description:: Balanced corpus of contemporary written Czech sized 100 MW. It was created as a representation of written language from 2000–2004 and thus it contains a wide range of text types and genres (fiction, professional literature, newspapers etc.) in balanced proportions. The corpus is lemmatized and morphologically tagged by a combination of stochastic and rule-based methods. The corpus is provided in a (semi-XML) vertical format used as an input to the Manatee query engine. The data thus correspond to the corpus available via query interface to registered users of the CNC with one important exception: they are shuffled, i.e. divided into blocks sized max. 100 words (respecting the sentence boundaries) whose ordering was randomized within the given document. and MSM0021620823 – Český národní korpus a korpusy dalších jazyků
Rights:: Czech National Corpus (Shuffled Corpus Data), https://lindat.mff.cuni.cz/repository/xmlui/page/license-cnc, and ACA

9. SYN2006PUB: corpus of Czech newspapers

Creator:: Čermák, František, Hlaváčová, Jaroslava, Hnátková, Milena, Jelínek, Tomáš, Kocek, Jan, Kopřivová, Marie, Křen, Michal, Novotná, Renata, Petkevič, Vladimír, Schmiedtová, Věra, Skoumalová, Hana, Spoustová, Johanka, Šulc, Michal, and Velíšek, Zdeněk
Publisher:: Faculty of Arts, Institute of the Czech National Corpus, Charles University in Prague
Type:: text and corpus
Subject:: corpus and written language
Language:: Czech
Description:: Corpus of contemporary Czech newspapers and magazines sized 300 MW. It contains various titles published between the end of 1989 and 2004. The corpus is lemmatized and morphologically tagged by a combination of stochastic and rule-based methods. The corpus is provided in a (semi-XML) vertical format used as an input to the Manatee query engine. The data thus correspond to the corpus available via query interface to registered users of the CNC with one important exception: they are shuffled, i.e. divided into blocks sized max. 100 words (respecting the sentence boundaries) whose ordering was randomized within the given document. and MSM0021620823 – Český národní korpus a korpusy dalších jazyků
Rights:: Czech National Corpus (Shuffled Corpus Data), https://lindat.mff.cuni.cz/repository/xmlui/page/license-cnc, and ACA

10. Travaux du Cercle linguistique de Prague n. s. Prague Linguistic Circle Papers, Vol. 3. Eds. E. Hajičová - T. Hoskovec - O. Leška - P. Sgall - Z. Skoumalová. John Benjamins Publishing Company, Amsterdam - Philadelphia 1999. 310 s.

Creator:: Čermák, František
Format:: print
Type:: model:internalpart and TEXT
Language:: Czech
Rights:: http://creativecommons.org/licenses/by-nc-sa/4.0/ and policy:public

1. Lexikon korpusu Orální historie (Příběhy)

2. Minulost, přítomnost a budoucnost česko-anglických slovníků

3. Monokolokabilní slova v češtině: jejich hlavní aspekty

4. O geologii a dolování na Rýmařovsku /

5. Přehled archivních fondů do znárodnění podnikovém archívu n. p. Juta Dvůr Králové n. L. /

6. Přehled vývoje národního podniku Juta, Dvůr Králové nad Labem, od založení do roku 1980 /

7. Some current problems of corpus and computational linguistics, or Fifteen commandments and general truths

8. SYN2005: balanced corpus of written Czech

9. SYN2006PUB: corpus of Czech newspapers

10. Travaux du Cercle linguistique de Prague n. s. Prague Linguistic Circle Papers, Vol. 3. Eds. E. Hajičová - T. Hoskovec - O. Leška - P. Sgall - Z. Skoumalová. John Benjamins Publishing Company, Amsterdam - Philadelphia 1999. 310 s.

Limit your search

Show values starting with

Show values starting with

Show values starting with

Search

Search Constraints

Search Results

Limit your search

Contributor

Coverage

Show values starting with

Creator

Show values starting with

Format

Language

Publisher

Rights

Subject

Show values starting with

Type

Date

Original context has metadata only

Harvested from