This is a new version of the repository. Do let us know (lindat-help at ufal.mff.cuni.cz) if you encounter any issues.

Vystadial 2013 – scripts

Please use the following text to cite this item or export to a predefined format:
Korvas, Matěj; Plátek, Ondřej; Dušek, Ondřej; Žilka, Lukáš and Jurčíček, Filip, 2014, Vystadial 2013 – scripts, LINDAT/CLARIAH-CZ digital library at the Institute of Formal and Applied Linguistics (ÚFAL), http://hdl.handle.net/11858/00-097C-0000-0023-466F-C.
Date issued
2014-02-21
Language(s)
Description
Vystadial 2013 is a dataset of telephone conversations in English and Czech, developed for training acoustic models for automatic speech recognition in spoken dialogue systems. It ships in three parts: Czech data, English data, and scripts. The data comprise over 41 hours of speech in English and over 15 hours in Czech, plus orthographic transcriptions. The scripts implement data pre-processing and building acoustic models using the HTK and Kaldi toolkits. This is the scripts part of the dataset.
Acknowledgement
This item isPublicly Available
and licensed under:
 Files in this item
Name
scripts.tgz
Size
95.64 MB
Format
application/x-gzip
Description
Vystadial 2013 scripts and models, tgz archive
MD5
1697afb04360e9ac2a6c4a5a55c7f63f
Preview
  File Preview
    • scripts.tgz219 MB