dc.contributor.author |
Davel, MH
|
|
dc.contributor.author |
Van Heerden, C
|
|
dc.contributor.author |
Kleynhans, N
|
|
dc.contributor.author |
Barnard, E
|
|
dc.date.accessioned |
2012-04-16T15:22:53Z |
|
dc.date.available |
2012-04-16T15:22:53Z |
|
dc.date.issued |
2011-08 |
|
dc.identifier.citation |
Davel, MH, Van Heerden, C, Kleynhans, N and Barnard, E. Efficient harvesting of Internet audio for resource-scarce ASR. 12 Annual Conference of the International Speech Communication Association (Interspeech 2011), Florence, Italy, 27-31 August 2011 |
en_US |
dc.identifier.isbn |
9781618392701 |
|
dc.identifier.uri |
http://hdl.handle.net/10204/5769
|
|
dc.description |
12 Annual Conference of the International Speech Communication Association (Interspeech 2011), Florence, Italy, 27-31 August 2011 |
en_US |
dc.description.abstract |
Spoken recordings that have been transcribed for human reading (e.g. as captions for audiovisual material, or to provide alternative modes of access to recordings) are widely available in many languages. Such recordings and transcriptions have proven to be a valuable source of ASR data in well-resourced languages, but have not been exploited to a significant extent in under-resourced languages or dialects. Techniques used to harvest such data typically assume the availability of a fairly accurate ASR system, which is generally not available when working with resourcescarce languages. In this work, the authors define a process whereby an ASR corpus is bootstrapped using unmatched ASR models in conjunction with speech and approximate transcriptions sourced from the Internet. They introduce a new segmentation technique based on the use of a phone-internal garbage model, and demonstrate how this technique (combined with limited filtering) can be used to develop a large, high-quality corpus in an underresourced dialect with minimal effort. |
en_US |
dc.language.iso |
en |
en_US |
dc.publisher |
The International Speech Communication Association |
en_US |
dc.relation.ispartofseries |
Workflow;7187 |
|
dc.subject |
Speech recognition |
en_US |
dc.subject |
Under-resourced languages |
en_US |
dc.subject |
Garbage modeling |
en_US |
dc.subject |
Automatic speech recognition (ASR) |
en_US |
dc.title |
Efficient harvesting of Internet audio for resource-scarce ASR |
en_US |
dc.type |
Conference Presentation |
en_US |
dc.identifier.apacitation |
Davel, M., Van Heerden, C., Kleynhans, N., & Barnard, E. (2011). Efficient harvesting of Internet audio for resource-scarce ASR. The International Speech Communication Association. http://hdl.handle.net/10204/5769 |
en_ZA |
dc.identifier.chicagocitation |
Davel, MH, C Van Heerden, N Kleynhans, and E Barnard. "Efficient harvesting of Internet audio for resource-scarce ASR." (2011): http://hdl.handle.net/10204/5769 |
en_ZA |
dc.identifier.vancouvercitation |
Davel M, Van Heerden C, Kleynhans N, Barnard E, Efficient harvesting of Internet audio for resource-scarce ASR; The International Speech Communication Association; 2011. http://hdl.handle.net/10204/5769 . |
en_ZA |
dc.identifier.ris |
TY - Conference Presentation
AU - Davel, MH
AU - Van Heerden, C
AU - Kleynhans, N
AU - Barnard, E
AB - Spoken recordings that have been transcribed for human reading (e.g. as captions for audiovisual material, or to provide alternative modes of access to recordings) are widely available in many languages. Such recordings and transcriptions have proven to be a valuable source of ASR data in well-resourced languages, but have not been exploited to a significant extent in under-resourced languages or dialects. Techniques used to harvest such data typically assume the availability of a fairly accurate ASR system, which is generally not available when working with resourcescarce languages. In this work, the authors define a process whereby an ASR corpus is bootstrapped using unmatched ASR models in conjunction with speech and approximate transcriptions sourced from the Internet. They introduce a new segmentation technique based on the use of a phone-internal garbage model, and demonstrate how this technique (combined with limited filtering) can be used to develop a large, high-quality corpus in an underresourced dialect with minimal effort.
DA - 2011-08
DB - ResearchSpace
DP - CSIR
KW - Speech recognition
KW - Under-resourced languages
KW - Garbage modeling
KW - Automatic speech recognition (ASR)
LK - https://researchspace.csir.co.za
PY - 2011
SM - 9781618392701
T1 - Efficient harvesting of Internet audio for resource-scarce ASR
TI - Efficient harvesting of Internet audio for resource-scarce ASR
UR - http://hdl.handle.net/10204/5769
ER -
|
en_ZA |