This is a new version of the repository. Do let us know (lindat-help at ufal.mff.cuni.cz) if you encounter any issues.

huntoken - tokenizer and sentence splitter

Please use the following text to cite this item or export to a predefined format:
Németh, László; Halácsy, Péter and Kornai, András, 2014, huntoken - tokenizer and sentence splitter, LINDAT/CLARIAH-CZ digital library at the Institute of Formal and Applied Linguistics (ÚFAL), http://hdl.handle.net/11372/LRT-1338.
Date issued
2014-07-30
Description
HunToken is a rule based tokenizer and sentence boundary detector for Hungarian (and English) texts.
Subject(s)
This item isPublicly Available
and licensed under:
 Files in this item
Name
huntoken-1.6.tgz
Size
409.95 KB
Format
application/x-gzip
Description
Huntoken
MD5
f2e24178f2ed18bba994c0ec5e2c7fe4
Preview
  File Preview
    • huntoken-1.6.tgz1 MB