ResearchSpace

Efficient data selection for ASR

Show simple item record

dc.contributor.author Kleynhans, NT
dc.contributor.author Barnard, E
dc.date.accessioned 2015-10-15T08:00:36Z
dc.date.available 2015-10-15T08:00:36Z
dc.date.issued 2014-10
dc.identifier.citation Kleynhans, NT and Barnard, E. 2014. Efficient data selection for ASR. Language Resources and Evaluation, Vol 49(2), pp 327-353 en_US
dc.identifier.issn 1574-020X
dc.identifier.uri http://download.springer.com/static/pdf/560/art%253A10.1007%252Fs10579-014-9285-0.pdf?originUrl=http%3A%2F%2Flink.springer.com%2Farticle%2F10.1007%2Fs10579-014-9285-0&token2=exp=1444732418~acl=%2Fstatic%2Fpdf%2F560%2Fart%25253A10.1007%25252Fs10579-014-9285-0.pdf%3ForiginUrl%3Dhttp%253A%252F%252Flink.springer.com%252Farticle%252F10.1007%252Fs10579-014-9285-0*~hmac=b704e15194418c482b6a9f86ab0634f1b34ef560c7b6430923fa9ddde81a7d5b
dc.identifier.uri http://hdl.handle.net/10204/8181
dc.description 1. Copyright: 2014 Springer Verlag. Due to copyright restrictions, the attached PDF file only contains the abstract of the full text item. For access to the full text item, please consult the publisher's website. The definitive version of the work is published in Lang Resources & Evaluation journal, Vol 49(2), pp 327-353 en_US
dc.description.abstract Automatic speech recognition (ASR) technology has matured over the past few decades and has made significant impacts in a variety of fields, from assistive technologies to commercial products. However, ASR system development is a resource intensive activity and requires language resources in the form of text annotated audio recordings and pronunciation dictionaries. Unfortunately, many languages found in the developing world fall into the resource-scarce category and due to this resource scarcity the deployment of ASR systems in the developing world is severely inhibited. One approach to assist with resource-scarce ASR system development, is to select ‘‘useful’’ training samples which could reduce the resources needed to collect new corpora. In this work, we propose a new data selection framework which can be used to design a speech recognition corpus. We show for limited data sets, independent of language and bandwidth, the most effective strategy for data selection is frequency-matched selection and that the widely-used maximum entropy methods generally produced the least promising results. In our model, the frequency-matched selection method corresponds to a logarithmic relationship between accuracy and corpus size; we also investigated other model relationships, and found that a hyperbolic relationship (as suggested from simple asymptotic arguments in learning theory) may lead to somewhat better performance under certain conditions. en_US
dc.language.iso en en_US
dc.publisher Springer Verlag en_US
dc.relation.ispartofseries Workflow;14504
dc.subject Automatic speech recognition en_US
dc.subject ASR en_US
dc.subject Resource-scarce en_US
dc.subject Data selection en_US
dc.subject Corpus design en_US
dc.title Efficient data selection for ASR en_US
dc.type Article en_US
dc.identifier.apacitation Kleynhans, N., & Barnard, E. (2014). Efficient data selection for ASR. http://hdl.handle.net/10204/8181 en_ZA
dc.identifier.chicagocitation Kleynhans, NT, and E Barnard "Efficient data selection for ASR." (2014) http://hdl.handle.net/10204/8181 en_ZA
dc.identifier.vancouvercitation Kleynhans N, Barnard E. Efficient data selection for ASR. 2014; http://hdl.handle.net/10204/8181. en_ZA
dc.identifier.ris TY - Article AU - Kleynhans, NT AU - Barnard, E AB - Automatic speech recognition (ASR) technology has matured over the past few decades and has made significant impacts in a variety of fields, from assistive technologies to commercial products. However, ASR system development is a resource intensive activity and requires language resources in the form of text annotated audio recordings and pronunciation dictionaries. Unfortunately, many languages found in the developing world fall into the resource-scarce category and due to this resource scarcity the deployment of ASR systems in the developing world is severely inhibited. One approach to assist with resource-scarce ASR system development, is to select ‘‘useful’’ training samples which could reduce the resources needed to collect new corpora. In this work, we propose a new data selection framework which can be used to design a speech recognition corpus. We show for limited data sets, independent of language and bandwidth, the most effective strategy for data selection is frequency-matched selection and that the widely-used maximum entropy methods generally produced the least promising results. In our model, the frequency-matched selection method corresponds to a logarithmic relationship between accuracy and corpus size; we also investigated other model relationships, and found that a hyperbolic relationship (as suggested from simple asymptotic arguments in learning theory) may lead to somewhat better performance under certain conditions. DA - 2014-10 DB - ResearchSpace DP - CSIR KW - Automatic speech recognition KW - ASR KW - Resource-scarce KW - Data selection KW - Corpus design LK - https://researchspace.csir.co.za PY - 2014 SM - 1574-020X T1 - Efficient data selection for ASR TI - Efficient data selection for ASR UR - http://hdl.handle.net/10204/8181 ER - en_ZA


Files in this item

This item appears in the following Collection(s)

Show simple item record