This is a new version of the repository. Do let us know (lindat-help at ufal.mff.cuni.cz) if you encounter any issues.

Prague Dependency Treebank of Spoken Language (PDTSL) 0.5

Please use the following text to cite this item or export to a predefined format:
Hajič, Jan; et al., 2009, Prague Dependency Treebank of Spoken Language (PDTSL) 0.5, LINDAT/CLARIAH-CZ digital library at the Institute of Formal and Applied Linguistics (ÚFAL), http://hdl.handle.net/11858/00-097C-0000-0001-4914-D.
Date issued
2009-11-02T10:40:55Z
Size
120000 words
Language(s)
Description
The first edition of a speech corpus with a speech reconstruction layer (edited transcript). The project of speech reconstruction of Czech and English has been started at UFAL together with the PIRE project in 2005, and has gradually grown from ideas to (first) annotation specification, annotation software and actual annotation. It is part of the Prague Dependency Treebank family of annotated corpus resources and tools, to which it adds the spoken language layer(s).
Acknowledgement
This item isAcademic Use
and licensed under:
 Files in this item
Name
pdtsc-snapshot-2008-08-19.zip
Size
4.42 MB
Format
application/zip
Description
Czech part
MD5
cbd2f7ee0f4f42a4f63675b3f111c9c3
Preview
  File Preview
Name
pdtse-snapshot-2008-08-19.zip
Size
2.18 MB
Format
application/zip
Description
English part
MD5
a290fdc805a1d4f6bc6db1ca5431cdb0
Preview
  File Preview