This is a new version of the repository. Do let us know (lindat-help at ufal.mff.cuni.cz) if you encounter any issues.

Szeged Corpus 2.0

Please use the following text to cite this item or export to a predefined format:
Department of Informatics, Human Language Technology Group, University of Szeged, 2003, Szeged Corpus 2.0, LINDAT/CLARIAH-CZ digital library at the Institute of Formal and Applied Linguistics (ÚFAL), http://hdl.handle.net/11372/LRT-347.
Date issued
2003
Type
Language(s)
Description
written, monolingual, general, manually POS annotated reference corpus; 1,459,288 tokens; MSD tagset, XML (TEI P4) files