Creator | Jurčíček, Filip | |
Creator | Korvas, Matěj | |
Creator | Žilka, Lukáš | |
Creator | Dušek, Ondřej | |
Creator | Plátek, Ondřej | |
Date | 2014-02-21T10:42:18Z | |
dc.date.accessioned | 2021-07-25T12:04:00Z | |
dc.date.available | 2021-07-25T12:04:00Z | |
Identifier | http://hdl.handle.net/11858/00-097C-0000-0023-4670-6 | |
dc.identifier.uri | https://linghub.org/handle/123456789/1042038 | |
Description | This research was funded by the Ministry of
Education, Youth and Sports of the Czech Republic under the grant agreement
LK11221. | |
Description | Vystadial 2013 is a dataset of telephone conversations in English and Czech, developed for training acoustic models for automatic speech recognition in spoken dialogue systems. It ships in three parts: Czech data, English data, and scripts.
The data comprise over 41 hours of speech in English and over 15 hours in Czech, plus orthographic transcriptions. The scripts implement data pre-processing and building acoustic models using the HTK and Kaldi toolkits.
This is the Czech data part of the dataset. | |
Publisher | Charles University, Faculty of Mathematics and Physics | |
Rights | Attribution-ShareAlike 3.0 Unported (CC BY-SA 3.0) | |
Rights | http://creativecommons.org/licenses/by-sa/3.0/ | |
Subject | spoken corpus | |
Subject | voip | |
Subject | orthographic transcriptions | |
Subject | dialogue system | |
Subject | acoustic data | |
Subject | telephone speech | |
Subject | speech corpus | |
Title | Vystadial 2013 – Czech data | |
Type | corpus | |
Type | Text | |
dcterms.available | 2014-02-21T10:42:18Z | |
dcterms.bibliographicCitation | http://hdl.handle.net/11858/00-097C-0000-0023-4670-6 | |
dcterms.creator | Jurčíček, Filip | |
dcterms.creator | Korvas, Matěj | |
dcterms.creator | Žilka, Lukáš | |
dcterms.creator | Dušek, Ondřej | |
dcterms.creator | Plátek, Ondřej | |
dcterms.date | 2014-02-21T10:42:18Z | |
dcterms.description | This research was funded by the Ministry of
Education, Youth and Sports of the Czech Republic under the grant agreement
LK11221. | |
dcterms.description | Vystadial 2013 is a dataset of telephone conversations in English and Czech, developed for training acoustic models for automatic speech recognition in spoken dialogue systems. It ships in three parts: Czech data, English data, and scripts.
The data comprise over 41 hours of speech in English and over 15 hours in Czech, plus orthographic transcriptions. The scripts implement data pre-processing and building acoustic models using the HTK and Kaldi toolkits.
This is the Czech data part of the dataset. | |
dcterms.identifier | http://hdl.handle.net/11858/00-097C-0000-0023-4670-6 | |
dcterms.isReplacedBy | http://hdl.handle.net/11234/1-1740 | |
dcterms.publisher | Charles University, Faculty of Mathematics and Physics | |
dcterms.rights | Attribution-ShareAlike 3.0 Unported (CC BY-SA 3.0) | |
dcterms.rights | http://creativecommons.org/licenses/by-sa/3.0/ | |
dcterms.subject | spoken corpus | |
dcterms.subject | voip | |
dcterms.subject | orthographic transcriptions | |
dcterms.subject | dialogue system | |
dcterms.subject | acoustic data | |
dcterms.subject | telephone speech | |
dcterms.subject | speech corpus | |
dcterms.title | Vystadial 2013 – Czech data | |
dcterms.type | corpus | |
dcterms.type | Text | |
odrl.Policy | http://purl.org/net/rdflicense/cc-by-sa3.0 | |