EdUKate translation models 2025
Please use the following text to cite this item or export to a predefined format:
Hrabal, Miroslav; et al., 2025,
EdUKate translation models 2025, LINDAT/CLARIAH-CZ digital library at the Institute of Formal and Applied Linguistics (ÚFAL),
http://hdl.handle.net/11234/1-6032.
Authors
Hrabal, Miroslav ; et al.
Item identifier
Project URL
Demo URL
Referenced by
Date issued
2025
Type
Description
This package includes three models adapted for sentence-level machine translation in educational domain: Czech-to-Ukrainian, Czech-to-English and Czech-to-German. The models are provided as LoRA adapters on top of EuroLLM-9B-Instruct LLM and can be used in the Charles Translator service (https://translator.cuni.cz) and in the web portal Škola s nadhledem (https://skolasnadhledem.cz/). The models were developed within the EdUKate project, which aims to help mitigate language barriers between non-Czech-speaking children in the Czech Republic and the education in the Czech school system. The project focuses on the development and dissemination of multilingual digital learning materials for students in primary and secondary schools.
Acknowledgement
Technologická agentura ČR
Project code:TQ01000458
Project name:EdUKate: Podpora digitálního vzdělávání cizojazyčných dětí prostřednictvím počítačového překladu
Subject(s)
Collections
This item isPublicly Available
and licensed under:
Files in this item
- Name
- cs-en.zip
- Size
- 722.5 MB
- Format
- application/zip
- Description
- Zip
- MD5
- 7403373a6dc490b69224c1214a0db8bb

- csen
- tokenizer.model2 MB
- tokenizer.json15 MB
- special_tokens_map.json557 B
- tokenizer_config.json45 kB
- adapter_config.json945 B
- generation_config.json163 B
- adapter_model.safetensors777 MB
- chat_template.jinja295 B
- Name
- cs-de.zip
- Size
- 363.5 MB
- Format
- application/zip
- Description
- Zip
- MD5
- 88583e490c1afb8ccc6c79afcf4455a1

- csde
- tokenizer.model2 MB
- tokenizer.json15 MB
- special_tokens_map.json557 B
- tokenizer_config.json45 kB
- adapter_config.json944 B
- generation_config.json163 B
- adapter_model.safetensors388 MB
- chat_template.jinja295 B
- Name
- cs-uk.zip
- Size
- 727.34 MB
- Format
- application/zip
- Description
- Zip
- MD5
- 94d4f7dd447eb71ec7672e17fcd11baa

- csuk
- tokenizer.model2 MB
- tokenizer.json15 MB
- tokenizer_config.json45 kB
- special_tokens_map.json557 B
- adapter_config.json506 B
- generation_config.json163 B
- adapter_model.safetensors777 MB
- chat_template.jinja295 B

