This is a new version of the repository. Do let us know (lindat-help at ufal.mff.cuni.cz) if you encounter any issues.

STYX 1.0 (2017-10-03)

Please use the following text to cite this item or export to a predefined format:
Hladká, Barbora; Kučera, Ondřej and Kuchyňová, Karolína, 2017, STYX 1.0 (2017-10-03), LINDAT/CLARIAH-CZ digital library at the Institute of Formal and Applied Linguistics (ÚFAL), http://hdl.handle.net/11234/1-2467.
Date issued
2017
Size
11655 sentences
Language(s)
Description
STYX 1.0 is a corpus of Czech sentences selected from the Prague Dependency treebank. The criterion for including sentences into STYX was their suitability for practicing Czech morphology and syntax in elementary schools. The sentences contain both the PDT annotations and the school sentence analyses. The school sentence analyses were created by transforming the PDT annotations using handcrafted rules. Altogether the STYX 1.0 corpus contains 11 655 sentences. Originally, the STYX 1.0 corpus was an inseparable part of the Styx system (http://hdl.handle.net/11858/00-097C-0000-0001-48FB-F)
 Files in this item
Name
STYX-1.0.zip
Size
2.64 MB
Format
application/zip
Description
Unknown
MD5
0daf2a3b1f5f7f28f7f89fb77181f7e1
Preview
  File Preview
  • STYX-1.0
    • img
      • STYX-2-2-1.png249 kB
      • STYX-1.png628 kB
      • STYX-2-2.png60 kB
      • STYX-2-1-1.png193 kB
      • STYX-2-1.png58 kB
    • bin
      • PDTVisual.exe35 kB
    • README.CZ4 kB
    • data
      • etest.o814 kB
      • train-1.o747 kB
      • train-6.o736 kB
      • train-2.o835 kB
      • dtest.o868 kB
      • train-7.o796 kB
      • train-3.o810 kB
      • train-8.o829 kB
      • train-4.o704 kB
      • train-5.o849 kB
    • README.EN4 kB