Efficient data selection for ASR

Kleynhans, NT; Barnard, E

Efficient data selection for ASR

http://download.springer.com/static/pdf/560/art%253A10.1007%252Fs10579-014-9285-0.pdf?originUrl=http%3A%2F%2Flink.springer.com%2Farticle%2F10.1007%2Fs10579-014-9285-0&token2=exp=1444732418~acl=%2Fstatic%2Fpdf%2F560%2Fart%25253A10.1007%25252Fs10579-014-9285-0.pdf%3ForiginUrl%3Dhttp%253A%252F%252Flink.springer.com%252Farticle%252F10.1007%252Fs10579-014-9285-0*~hmac=b704e15194418c482b6a9f86ab0634f1b34ef560c7b6430923fa9ddde81a7d5b
http://hdl.handle.net/10204/8181

Abstract:

Automatic speech recognition (ASR) technology has matured over the past few decades and has made significant impacts in a variety of fields, from assistive technologies to commercial products. However, ASR system development is a resource intensive activity and requires language resources in the form of text annotated audio recordings and pronunciation dictionaries. Unfortunately, many languages found in the developing world fall into the resource-scarce category and due to this resource scarcity the deployment of ASR systems in the developing world is severely inhibited. One approach to assist with resource-scarce ASR system development, is to select ‘‘useful’’ training samples which could reduce the resources needed to collect new corpora. In this work, we propose a new data selection framework which can be used to design a speech recognition corpus. We show for limited data sets, independent of language and bandwidth, the most effective strategy for data selection is frequency-matched selection and that the widely-used maximum entropy methods generally produced the least promising results. In our model, the frequency-matched selection method corresponds to a logarithmic relationship between accuracy and corpus size; we also investigated other model relationships, and found that a hyperbolic relationship (as suggested from simple asymptotic arguments in learning theory) may lead to somewhat better performance under certain conditions.

Reference:

Kleynhans, NT and Barnard, E. 2014. Efficient data selection for ASR. Language Resources and Evaluation, Vol 49(2), pp 327-353

Kleynhans, N., & Barnard, E. (2014). Efficient data selection for ASR. http://hdl.handle.net/10204/8181

Kleynhans, NT, and E Barnard "Efficient data selection for ASR." (2014) http://hdl.handle.net/10204/8181

Kleynhans N, Barnard E. Efficient data selection for ASR. 2014; http://hdl.handle.net/10204/8181.

Download RIS

1. Copyright: 2014 Springer Verlag. Due to copyright restrictions, the attached PDF file only contains the abstract of the full text item. For access to the full text item, please consult the publisher's website. The definitive version of the work is published in Lang Resources & Evaluation journal, Vol 49(2), pp 327-353

Kleynhans, NT
Barnard, E

Oct 2014

Automatic speech recognition
ASR
Resource-scarce
Data selection
Corpus design

Show full item record

Files in this item

Kleynhans8_2014.pdf

This item appears in the following Collection(s)

Journal Articles

Browse

All of ResearchSpace
This Collection
- By Issue Date
- Authors
- Titles
- Subjects
- Publication Type
- Cluster
- Impact Area

Quick Links

Legislation and compliance

General Enquiries

Tel: + 27 12 841 2911
Email: callcentre@csir.co.za

Physical Address
Meiring Naudé Road
Brummeria
Pretoria
South Africa

Postal Address
PO Box 395
Pretoria 0001
South Africa

Social Connect

Resources on this site are free to download and reuse according to associated licensing provision. Please read the terms and conditions of usage of each resource.

Efficient data selection for ASR

Efficient data selection for ASR

This item appears in the following Collection(s)

Browse

All of ResearchSpace

This Collection

Quick Links

Legislation and compliance

General Enquiries

Social Connect