Show simple item record

 
dc.contributor.author Korvas, Matěj
dc.contributor.author Plátek, Ondřej
dc.contributor.author Dušek, Ondřej
dc.contributor.author Žilka, Lukáš
dc.contributor.author Jurčíček, Filip
dc.date.accessioned 2014-02-21T10:45:40Z
dc.date.available 2014-02-21T10:45:40Z
dc.date.issued 2014-02-21
dc.identifier.uri http://hdl.handle.net/11858/00-097C-0000-0023-4671-4
dc.description Vystadial 2013 is a dataset of telephone conversations in English and Czech, developed for training acoustic models for automatic speech recognition in spoken dialogue systems. It ships in three parts: Czech data, English data, and scripts. The data comprise over 41 hours of speech in English and over 15 hours in Czech, plus orthographic transcriptions. The scripts implement data pre-processing and building acoustic models using the HTK and Kaldi toolkits. This is the English data part of the dataset.
dc.description.sponsorship This research was funded by the Ministry of Education, Youth and Sports of the Czech Republic under the grant agreement LK11221.
dc.language.iso eng
dc.publisher Faculty of Mathematics and Physics, Charles University in Prague
dc.rights Attribution-ShareAlike 3.0 Unported (CC BY-SA 3.0)
dc.rights.uri http://creativecommons.org/licenses/by-sa/3.0/
dc.source.uri https://ufal.mff.cuni.cz/grants/vystadial
dc.subject acoustic data
dc.subject speech corpus
dc.subject spoken corpus
dc.subject orthographic transcriptions
dc.subject telephone speech
dc.subject voip
dc.subject dialogue system
dc.title Vystadial 2013 – English data
dc.type corpus
metashare.ResourceInfo#ContactInfo#PersonInfo.surname Korvas
metashare.ResourceInfo#ContactInfo#PersonInfo.givenName Matěj
metashare.ResourceInfo#ContactInfo#PersonInfo#OrganizationInfo.organizationName Faculty of Mathematics and Physics, Charles University in Prague, UFAL
metashare.ResourceInfo#DistributionInfo.availability unrestrictedUse
metashare.ResourceInfo#DistributionInfo#LicenseInfo.restrictionsOfUse evaluationUse
metashare.ResourceInfo#DistributionInfo#LicenseInfo.restrictionsOfUse commercialUse
metashare.ResourceInfo#DistributionInfo#LicenseInfo.restrictionsOfUse attribution
metashare.ResourceInfo#DistributionInfo#LicenseInfo.restrictionsOfUse shareAlike
metashare.ResourceInfo#DistributionInfo#LicenseInfo.distributionAccessMedium downloadable
metashare.ResourceInfo#ValidationInfo.validated True
metashare.ResourceInfo#ResourceCreationInfo#FundingInfo#ProjectInfo.projectName MŠMT LK11221 (Vývoj metod pro návrh statistických mluvených dialogových systémů)
metashare.ResourceInfo#ResourceCreationInfo#FundingInfo#ProjectInfo.fundingType National
metashare.ResourceInfo#ContentInfo.mediaType audio
metashare.ResourceInfo#TextInfo#SizeInfo.size 45
metashare.ResourceInfo#TextInfo#SizeInfo.sizeUnit hours
metashare.ResourceInfo#ContactInfo#PersonInfo#OrganizationInfo#CommunicationInfo.email korvas@ufal.mff.cuni.cz
dc.rights.label PUB
has.files yes
branding LINDAT / CLARIN
sponsor Ministerstvo školství, mládeže a tělovýchovy České republiky LK11221 Vývoj metod pro návrh statistických mluvených dialogových systémů nationalFunds
files.size 2793418303
files.count 1
featuredService.kontext search|http://lindat.mff.cuni.cz/services/kontext/run.cgi/first_form?corpname=vystadial_2013_en_w


 Files in this item

This item is
Publicly Available
and licensed under:
Attribution-ShareAlike 3.0 Unported (CC BY-SA 3.0)
Distributed under Creative Commons Attribution Required Share Alike
Icon
Name
data_voip_en.tgz
Size
2.6 GB
Format
application/x-gzip
Description
Vystadial 2013 English data, tgz archive
MD5
1d11887e54b8798de856a3bc80d22843
 Download file

Show simple item record