LINDAT/CLARIAH-CZ logo
  • Catalog
  • Repository
  • Education
  • Projects
  • Tools
  • Services
  • About
    Partners Mission Statement CLARIN DARIAH Service integrations Project partnerships
  • DARIAH logo
  • CLARIN logo
  •  Login
  • English čeština
  • LINDAT/CLARIAH-CZ Repository Home
  • View Item
  •  
  • LINDAT/CLARIAH-CZ logo
    CLARIN logo
  •   Browse  
    •    All of the Repository  
      •   Issue Date
      •   Authors
      •   Titles
      •   Subjects
      •   Publisher
      •   Language
      •   Type
      •   Rights Label
  •   My Account  
    •    Login
  •   Statistics  
    •    StatisticsBETA
  •   General Information  
    •    Deposit
    •    Cite
    •    Submission Lifecycle
    •    FAQ
    •    About
    •    Help Desk
 
 

Kacenka : parallel corpus of English and Czech texts

 
LRT + Open Submissions
  Authors
Rambousek, Jiri
  Item identifier
http://hdl.handle.net/11372/LRT-891
 Project URL
http://www.phil.muni.cz/angl/kacenka/kachna.html
 Date issued
1997
 Type
corpus
 Language(s)
Czech , English
 Description
Parallel corpus, 3,297,283 words. The idea was to create a small parallel corpus which would enable to work with entire texts in translation analysis rather then short extracts. At the same time it aimed at acquiring experience that could be used in creating a larger parallel corpus of English and Czech in the future. Although the main part of work has been completed -- and the aims of the KACENKA grant met -- we keep improving and enlarging KACENKA gradually. Currently, it has the size of 3,297,283 words (out of which, 1,689,513 have been acquired by means of scanning). Most of the English texts for KACENKA have been retrieved from the Internet resources. The rest -- and nearly all the Czech texts -- had to be scanned with the use of an OCR programme. KACENKA is stored on a single CD-ROM; its use is limited by copyright restrictions.
 Publisher
Masaryk University, Brno
 Collection(s)
LRT + Open Submissions Data & Tools
Show full item record
 
 

LINDAT/CLARIAH-CZ

  • Mission Statement
  • Advisory Board
  • Events
  • CLARIN Participation
  • DARIAH Participation

  • FAQ
  • Helpdesk
  • User Feedback Form

  • Acknowledge LINDAT/CLARIAH-CZ

Partners

  • Charles University
    • Faculty of Mathematics and Physics
    • Faculty of Arts
  • Masaryk University
    • Faculty of Arts
    • Faculty of Informatics
  • University of West Bohemia
    • Faculty of Applied Sciences
  • Terezín
    • Terezín Initiative Institute
    • Terezín Memorial
  • Czech Academy of Sciences
    • Czech Language Institute
    • Library of Academy
    • Institute of History
    • Institute of Philosophy
    • Masaryk Institute and Archives
  • Archives, Libraries and Galleries
    • National Library of the Czech Republic
    • Moravian Library in Brno
    • National Gallery Prague
    • National Film Archive
    • National Archives

Services

  • Service Status
  • About and Policies
  • Terms of Use
CLARIN CENTRE B CLARIN CENTRE K CoreTrustSeal Certification
Follow us on Twitter Link to Profile Home Page
THE LINDAT/CLARIAH-CZ PROJECT (LM2023062; formerly LM2010013, LM2015071, LM2018101) IS FULLY SUPPORTED BY THE MINISTRY OF EDUCATION, SPORTS AND YOUTH OF THE CZECH REPUBLIC UNDER THE PROGRAMME LM OF "LARGE INFRASTRUCTURES"
Icons © Smashicons and Freepik from flaticon.com licensed by CC 3.0 BY
website © 2023 by ÚFAL