Cross-bandwidth adaptation for ASR systems

Kleynhans, N; Barnard, E

dc.contributor.author	Kleynhans, N
dc.contributor.author	Barnard, E
dc.date.accessioned	2014-03-07T10:32:50Z
dc.date.available	2014-03-07T10:32:50Z
dc.date.issued	2013-12
dc.identifier.citation	Kleynhans, N and Barnard, E. 2013. Cross-bandwidth adaptation for ASR systems. In: Conference Proceedings of the 24th Annual Symposium of the Pattern Recognition Association of South Africa, Johannesburg, South Africa, 3 December 2013	en_US
dc.identifier.uri	http://www.prasa.org/proceedings/2013/prasa2013-06.pdf
dc.identifier.uri	http://hdl.handle.net/10204/7271
dc.description	Conference Proceedings of the 24th Annual Symposium of the Pattern Recognition Association of South Africa, Johannesburg, South Africa, 3 December 2013	en_US
dc.description.abstract	Mismatches between application and training data greatly reduce the performance of automatic speech recognition (ASR) systems. However, collecting suitable amounts of in-domain and application-specific data for training is resource intensive and may not be feasible for resource-scarce environments. Utilising limited amounts of in-domain data and a combination of feature normalisation and acoustic model adaptation techniques has therefore found wide use in ASR systems. Various approaches have been proposed, and it is not clear when to make use of a particular approach given a specific amount of adaptation data. In this work we investigate the use of standard feature normalisation and model adaptation techniques, for the scenario where adaptation between narrow- and wide-band environments must be performed. Our investigation focuses on the dependence of the adaptation data amount and various adaptation techniques by systematically varying the adaptation data amount and com- paring the performance of various adaptation techniques. From this we establish a guideline which can be used by an ASR developer to choose the best adaptation technique given a size constraint on the adaptation data. In addition, we investigate the effectiveness of a novel channel normalisation technique and compare the performance with standard normalisation and adaptation techniques.	en_US
dc.language.iso	en	en_US
dc.publisher	PRASA 2013 Proceedings	en_US
dc.relation.ispartofseries	Workflow;12231
dc.subject	Automatic speech recognition	en_US
dc.subject	ASR	en_US
dc.subject	Speech Technologies	en_US
dc.title	Cross-bandwidth adaptation for ASR systems	en_US
dc.type	Conference Presentation	en_US
dc.identifier.apacitation	Kleynhans, N., & Barnard, E. (2013). Cross-bandwidth adaptation for ASR systems. PRASA 2013 Proceedings. http://hdl.handle.net/10204/7271	en_ZA
dc.identifier.chicagocitation	Kleynhans, N, and E Barnard. "Cross-bandwidth adaptation for ASR systems." (2013): http://hdl.handle.net/10204/7271	en_ZA
dc.identifier.vancouvercitation	Kleynhans N, Barnard E, Cross-bandwidth adaptation for ASR systems; PRASA 2013 Proceedings; 2013. http://hdl.handle.net/10204/7271 .	en_ZA
dc.identifier.ris	TY - Conference Presentation AU - Kleynhans, N AU - Barnard, E AB - Mismatches between application and training data greatly reduce the performance of automatic speech recognition (ASR) systems. However, collecting suitable amounts of in-domain and application-specific data for training is resource intensive and may not be feasible for resource-scarce environments. Utilising limited amounts of in-domain data and a combination of feature normalisation and acoustic model adaptation techniques has therefore found wide use in ASR systems. Various approaches have been proposed, and it is not clear when to make use of a particular approach given a specific amount of adaptation data. In this work we investigate the use of standard feature normalisation and model adaptation techniques, for the scenario where adaptation between narrow- and wide-band environments must be performed. Our investigation focuses on the dependence of the adaptation data amount and various adaptation techniques by systematically varying the adaptation data amount and com- paring the performance of various adaptation techniques. From this we establish a guideline which can be used by an ASR developer to choose the best adaptation technique given a size constraint on the adaptation data. In addition, we investigate the effectiveness of a novel channel normalisation technique and compare the performance with standard normalisation and adaptation techniques. DA - 2013-12 DB - ResearchSpace DP - CSIR KW - Automatic speech recognition KW - ASR KW - Speech Technologies LK - https://researchspace.csir.co.za PY - 2013 T1 - Cross-bandwidth adaptation for ASR systems TI - Cross-bandwidth adaptation for ASR systems UR - http://hdl.handle.net/10204/7271 ER -	en_ZA

Files in this item

Name: Kleynhans_2014.pdf

Size: 4.967Mb

Format: PDF

View/Open

This item appears in the following Collection(s)

Conference Publications

Show simple item record

Browse

All of ResearchSpace
This Collection
- By Issue Date
- Authors
- Titles
- Subjects
- Publication Type
- Cluster
- Impact Area

Quick Links

Legislation and compliance

General Enquiries

Tel: + 27 12 841 2911
Email: callcentre@csir.co.za

Physical Address
Meiring Naudé Road
Brummeria
Pretoria
South Africa

Postal Address
PO Box 395
Pretoria 0001
South Africa

Social Connect

Resources on this site are free to download and reuse according to associated licensing provision. Please read the terms and conditions of usage of each resource.