This is a new version of the repository. Do let us know (lindat-help at ufal.mff.cuni.cz) if you encounter any issues.

iula_tokenizer

Please use the following text to cite this item or export to a predefined format:
Institut Universitari de Lingüística Aplicada, Universitat Pompeu Fabra, 2014, iula_tokenizer, LINDAT/CLARIAH-CZ digital library at the Institute of Formal and Applied Linguistics (ÚFAL), http://hdl.handle.net/11372/LRT-1416.
Date issued
2014-07-30
Description
Text tokenizer (the text tokenizer requires that the input text be in plain text format (file.txt) and UTF-8 encoded).