BulTreeBank Morphosyntactic Corpus
- Autoři
- Simov, Kiril
- Identifikátor
- http://hdl.handle.net/11372/LRT-221
- URL projektu
- http://www.bultreebank.org/btbmorf/
- Datum vydání
- 2014-07-30
- Typ
- corpus
- Jazyky
- Bulgarian
- Popis
- Written, synchronic, general, manually annotated, 1 000 000 tokens divided in three sets: 215 000 tokens used in BulTreeBank HPSG Treebank (see below), additionally 300 000 checked second time, rest about 480 000 checked by the annotators. Morphosyntactic annotation with the BulTreeBank Tagset (http://www.bultreebank.org/TechRep/BTB-TR03.pdf), XML, annotation description in technical reports of BulTreeBank project http://www.bultreebank.org/TechRep