This is a new version of the repository. Do let us know (lindat-help at ufal.mff.cuni.cz) if you encounter any issues.

BulTreeBank Text Archive

Please use the following text to cite this item or export to a predefined format:
2014, BulTreeBank Text Archive, LINDAT/CLARIAH-CZ digital library at the Institute of Formal and Applied Linguistics (ÚFAL), http://hdl.handle.net/11372/LRT-175.
Date issued
2014-07-30
Type
Language(s)
Description
72 000 000 tokens, 15% fiction, 78% newspapers and 7% legal texts, government bulletins and others