Show simple item record

Vystadial 2013 – Czech data

CreatorJurčíček, Filip
CreatorKorvas, Matěj
CreatorŽilka, Lukáš
CreatorDušek, Ondřej
CreatorPlátek, Ondřej
Date2014-02-21T10:42:18Z
dc.date.accessioned2021-07-25T12:04:00Z
dc.date.available2021-07-25T12:04:00Z
Identifierhttp://hdl.handle.net/11858/00-097C-0000-0023-4670-6
dc.identifier.urihttps://linghub.org/handle/123456789/1042038
DescriptionThis research was funded by the Ministry of Education, Youth and Sports of the Czech Republic under the grant agreement LK11221.
DescriptionVystadial 2013 is a dataset of telephone conversations in English and Czech, developed for training acoustic models for automatic speech recognition in spoken dialogue systems. It ships in three parts: Czech data, English data, and scripts. The data comprise over 41 hours of speech in English and over 15 hours in Czech, plus orthographic transcriptions. The scripts implement data pre-processing and building acoustic models using the HTK and Kaldi toolkits. This is the Czech data part of the dataset.
PublisherCharles University, Faculty of Mathematics and Physics
RightsAttribution-ShareAlike 3.0 Unported (CC BY-SA 3.0)
Rightshttp://creativecommons.org/licenses/by-sa/3.0/
Subjectspoken corpus
Subjectvoip
Subjectorthographic transcriptions
Subjectdialogue system
Subjectacoustic data
Subjecttelephone speech
Subjectspeech corpus
TitleVystadial 2013 – Czech data
Typecorpus
TypeText
dcterms.available2014-02-21T10:42:18Z
dcterms.bibliographicCitationhttp://hdl.handle.net/11858/00-097C-0000-0023-4670-6
dcterms.creatorJurčíček, Filip
dcterms.creatorKorvas, Matěj
dcterms.creatorŽilka, Lukáš
dcterms.creatorDušek, Ondřej
dcterms.creatorPlátek, Ondřej
dcterms.date2014-02-21T10:42:18Z
dcterms.descriptionThis research was funded by the Ministry of Education, Youth and Sports of the Czech Republic under the grant agreement LK11221.
dcterms.descriptionVystadial 2013 is a dataset of telephone conversations in English and Czech, developed for training acoustic models for automatic speech recognition in spoken dialogue systems. It ships in three parts: Czech data, English data, and scripts. The data comprise over 41 hours of speech in English and over 15 hours in Czech, plus orthographic transcriptions. The scripts implement data pre-processing and building acoustic models using the HTK and Kaldi toolkits. This is the Czech data part of the dataset.
dcterms.identifierhttp://hdl.handle.net/11858/00-097C-0000-0023-4670-6
dcterms.isReplacedByhttp://hdl.handle.net/11234/1-1740
dcterms.publisherCharles University, Faculty of Mathematics and Physics
dcterms.rightsAttribution-ShareAlike 3.0 Unported (CC BY-SA 3.0)
dcterms.rightshttp://creativecommons.org/licenses/by-sa/3.0/
dcterms.subjectspoken corpus
dcterms.subjectvoip
dcterms.subjectorthographic transcriptions
dcterms.subjectdialogue system
dcterms.subjectacoustic data
dcterms.subjecttelephone speech
dcterms.subjectspeech corpus
dcterms.titleVystadial 2013 – Czech data
dcterms.typecorpus
dcterms.typeText
odrl.Policyhttp://purl.org/net/rdflicense/cc-by-sa3.0


Check resource access

Authorized
Reason

Files in this item

FilesSizeFormatView

There are no files associated with this item.

This item appears in the following Collection(s)

  • OLAC
    Main data from the OLAC dataset

Show simple item record


Copyright  © 2020 All Rights Reserved by Prêt-à-LLOD Project.

Horizon 2020

This project has received funding from the European Union's Horizon 2020 research and innovation programme under grant agreement No 825182.