Show simple item record

 
dc.contributor.author Korvas, Matěj
dc.contributor.author Plátek, Ondřej
dc.contributor.author Dušek, Ondřej
dc.contributor.author Žilka, Lukáš
dc.contributor.author Jurčíček, Filip
dc.date.accessioned 2014-02-21T10:42:18Z
dc.date.available 2014-02-21T10:42:18Z
dc.date.issued 2014-02-21
dc.identifier.uri http://hdl.handle.net/11858/00-097C-0000-0023-4670-6
dc.description Vystadial 2013 is a dataset of telephone conversations in English and Czech, developed for training acoustic models for automatic speech recognition in spoken dialogue systems. It ships in three parts: Czech data, English data, and scripts. The data comprise over 41 hours of speech in English and over 15 hours in Czech, plus orthographic transcriptions. The scripts implement data pre-processing and building acoustic models using the HTK and Kaldi toolkits. This is the Czech data part of the dataset.
dc.description.sponsorship This research was funded by the Ministry of Education, Youth and Sports of the Czech Republic under the grant agreement LK11221.
dc.language.iso ces
dc.publisher Faculty of Mathematics and Physics, Charles University in Prague
dc.relation.isreplacedby http://hdl.handle.net/11234/1-1740
dc.rights Attribution-ShareAlike 3.0 Unported (CC BY-SA 3.0)
dc.rights.uri http://creativecommons.org/licenses/by-sa/3.0/
dc.source.uri https://ufal.mff.cuni.cz/grants/vystadial
dc.subject acoustic data
dc.subject speech corpus
dc.subject spoken corpus
dc.subject orthographic transcriptions
dc.subject telephone speech
dc.subject voip
dc.subject dialogue system
dc.title Vystadial 2013 – Czech data
dc.type corpus
metashare.ResourceInfo#ContactInfo#PersonInfo.surname Korvas
metashare.ResourceInfo#ContactInfo#PersonInfo.givenName Matěj
metashare.ResourceInfo#ContactInfo#PersonInfo#OrganizationInfo.organizationName Faculty of Mathematics and Physics, Charles University in Prague, UFAL
metashare.ResourceInfo#DistributionInfo.availability unrestrictedUse
metashare.ResourceInfo#DistributionInfo#LicenseInfo.restrictionsOfUse evaluationUse
metashare.ResourceInfo#DistributionInfo#LicenseInfo.restrictionsOfUse commercialUse
metashare.ResourceInfo#DistributionInfo#LicenseInfo.restrictionsOfUse attribution
metashare.ResourceInfo#DistributionInfo#LicenseInfo.restrictionsOfUse shareAlike
metashare.ResourceInfo#DistributionInfo#LicenseInfo.distributionAccessMedium downloadable
metashare.ResourceInfo#ValidationInfo.validated True
metashare.ResourceInfo#ResourceCreationInfo#FundingInfo#ProjectInfo.projectName MŠMT LK11221 (Vývoj metod pro návrh statistických mluvených dialogových systémů)
metashare.ResourceInfo#ResourceCreationInfo#FundingInfo#ProjectInfo.fundingType National
metashare.ResourceInfo#ContentInfo.mediaType audio
metashare.ResourceInfo#TextInfo#SizeInfo.size 18
metashare.ResourceInfo#TextInfo#SizeInfo.sizeUnit hours
metashare.ResourceInfo#ContactInfo#PersonInfo#OrganizationInfo#CommunicationInfo.email korvas@ufal.mff.cuni.cz
dc.rights.label PUB
has.files yes
branding LINDAT / CLARIN
sponsor Ministerstvo školství, mládeže a tělovýchovy České republiky LK11221 Vývoj metod pro návrh statistických mluvených dialogových systémů nationalFunds
files.size 1580742931
files.count 1
featuredService.kontext search|http://lindat.mff.cuni.cz/services/kontext/run.cgi/first_form?corpname=vystadial_2013_cs_w


 Files in this item

This item is
Publicly Available
and licensed under:
Attribution-ShareAlike 3.0 Unported (CC BY-SA 3.0)
Distributed under Creative Commons Attribution Required Share Alike
Icon
Name
data_voip_cs.tgz
Size
1.47 GB
Format
application/x-gzip
Description
Vystadial 2013 Czech data, tgz archive
MD5
514b38e657bdd52309e80f22a773d6cc
 Download file

Show simple item record