Efficient data selection for ASR

Kleynhans, NT; Barnard, E

dc.contributor.author	Kleynhans, NT
dc.contributor.author	Barnard, E
dc.date.accessioned	2015-10-15T08:00:36Z
dc.date.available	2015-10-15T08:00:36Z
dc.date.issued	2014-10
dc.identifier.citation	Kleynhans, NT and Barnard, E. 2014. Efficient data selection for ASR. Language Resources and Evaluation, Vol 49(2), pp 327-353	en_US
dc.identifier.issn	1574-020X
dc.identifier.uri	http://download.springer.com/static/pdf/560/art%253A10.1007%252Fs10579-014-9285-0.pdf?originUrl=http%3A%2F%2Flink.springer.com%2Farticle%2F10.1007%2Fs10579-014-9285-0&token2=exp=1444732418~acl=%2Fstatic%2Fpdf%2F560%2Fart%25253A10.1007%25252Fs10579-014-9285-0.pdf%3ForiginUrl%3Dhttp%253A%252F%252Flink.springer.com%252Farticle%252F10.1007%252Fs10579-014-9285-0*~hmac=b704e15194418c482b6a9f86ab0634f1b34ef560c7b6430923fa9ddde81a7d5b
dc.identifier.uri	http://hdl.handle.net/10204/8181
dc.description	1. Copyright: 2014 Springer Verlag. Due to copyright restrictions, the attached PDF file only contains the abstract of the full text item. For access to the full text item, please consult the publisher's website. The definitive version of the work is published in Lang Resources & Evaluation journal, Vol 49(2), pp 327-353	en_US
dc.description.abstract	Automatic speech recognition (ASR) technology has matured over the past few decades and has made significant impacts in a variety of fields, from assistive technologies to commercial products. However, ASR system development is a resource intensive activity and requires language resources in the form of text annotated audio recordings and pronunciation dictionaries. Unfortunately, many languages found in the developing world fall into the resource-scarce category and due to this resource scarcity the deployment of ASR systems in the developing world is severely inhibited. One approach to assist with resource-scarce ASR system development, is to select ‘‘useful’’ training samples which could reduce the resources needed to collect new corpora. In this work, we propose a new data selection framework which can be used to design a speech recognition corpus. We show for limited data sets, independent of language and bandwidth, the most effective strategy for data selection is frequency-matched selection and that the widely-used maximum entropy methods generally produced the least promising results. In our model, the frequency-matched selection method corresponds to a logarithmic relationship between accuracy and corpus size; we also investigated other model relationships, and found that a hyperbolic relationship (as suggested from simple asymptotic arguments in learning theory) may lead to somewhat better performance under certain conditions.	en_US
dc.language.iso	en	en_US
dc.publisher	Springer Verlag	en_US
dc.relation.ispartofseries	Workflow;14504
dc.subject	Automatic speech recognition	en_US
dc.subject	ASR	en_US
dc.subject	Resource-scarce	en_US
dc.subject	Data selection	en_US
dc.subject	Corpus design	en_US
dc.title	Efficient data selection for ASR	en_US
dc.type	Article	en_US
dc.identifier.apacitation	Kleynhans, N., & Barnard, E. (2014). Efficient data selection for ASR. http://hdl.handle.net/10204/8181	en_ZA
dc.identifier.chicagocitation	Kleynhans, NT, and E Barnard "Efficient data selection for ASR." (2014) http://hdl.handle.net/10204/8181	en_ZA
dc.identifier.vancouvercitation	Kleynhans N, Barnard E. Efficient data selection for ASR. 2014; http://hdl.handle.net/10204/8181.	en_ZA
dc.identifier.ris	TY - Article AU - Kleynhans, NT AU - Barnard, E AB - Automatic speech recognition (ASR) technology has matured over the past few decades and has made significant impacts in a variety of fields, from assistive technologies to commercial products. However, ASR system development is a resource intensive activity and requires language resources in the form of text annotated audio recordings and pronunciation dictionaries. Unfortunately, many languages found in the developing world fall into the resource-scarce category and due to this resource scarcity the deployment of ASR systems in the developing world is severely inhibited. One approach to assist with resource-scarce ASR system development, is to select ‘‘useful’’ training samples which could reduce the resources needed to collect new corpora. In this work, we propose a new data selection framework which can be used to design a speech recognition corpus. We show for limited data sets, independent of language and bandwidth, the most effective strategy for data selection is frequency-matched selection and that the widely-used maximum entropy methods generally produced the least promising results. In our model, the frequency-matched selection method corresponds to a logarithmic relationship between accuracy and corpus size; we also investigated other model relationships, and found that a hyperbolic relationship (as suggested from simple asymptotic arguments in learning theory) may lead to somewhat better performance under certain conditions. DA - 2014-10 DB - ResearchSpace DP - CSIR KW - Automatic speech recognition KW - ASR KW - Resource-scarce KW - Data selection KW - Corpus design LK - https://researchspace.csir.co.za PY - 2014 SM - 1574-020X T1 - Efficient data selection for ASR TI - Efficient data selection for ASR UR - http://hdl.handle.net/10204/8181 ER -	en_ZA

Files in this item

Name: Kleynhans8_2014.pdf

Size: 285.4Kb

Format: PDF

View/Open

This item appears in the following Collection(s)

Journal Articles

Show simple item record

Browse

All of ResearchSpace
This Collection
- By Issue Date
- Authors
- Titles
- Subjects
- Publication Type
- Cluster
- Impact Area

Quick Links

Legislation and compliance

General Enquiries

Tel: + 27 12 841 2911
Email: callcentre@csir.co.za

Physical Address
Meiring Naudé Road
Brummeria
Pretoria
South Africa

Postal Address
PO Box 395
Pretoria 0001
South Africa

Social Connect

Resources on this site are free to download and reuse according to associated licensing provision. Please read the terms and conditions of usage of each resource.