ResearchSpace

Cross-bandwidth adaptation for ASR systems

Show simple item record

dc.contributor.author Kleynhans, N
dc.contributor.author Barnard, E
dc.date.accessioned 2014-03-07T10:32:50Z
dc.date.available 2014-03-07T10:32:50Z
dc.date.issued 2013-12
dc.identifier.citation Kleynhans, N and Barnard, E. 2013. Cross-bandwidth adaptation for ASR systems. In: Conference Proceedings of the 24th Annual Symposium of the Pattern Recognition Association of South Africa, Johannesburg, South Africa, 3 December 2013 en_US
dc.identifier.uri http://www.prasa.org/proceedings/2013/prasa2013-06.pdf
dc.identifier.uri http://hdl.handle.net/10204/7271
dc.description Conference Proceedings of the 24th Annual Symposium of the Pattern Recognition Association of South Africa, Johannesburg, South Africa, 3 December 2013 en_US
dc.description.abstract Mismatches between application and training data greatly reduce the performance of automatic speech recognition (ASR) systems. However, collecting suitable amounts of in-domain and application-specific data for training is resource intensive and may not be feasible for resource-scarce environments. Utilising limited amounts of in-domain data and a combination of feature normalisation and acoustic model adaptation techniques has therefore found wide use in ASR systems. Various approaches have been proposed, and it is not clear when to make use of a particular approach given a specific amount of adaptation data. In this work we investigate the use of standard feature normalisation and model adaptation techniques, for the scenario where adaptation between narrow- and wide-band environments must be performed. Our investigation focuses on the dependence of the adaptation data amount and various adaptation techniques by systematically varying the adaptation data amount and com- paring the performance of various adaptation techniques. From this we establish a guideline which can be used by an ASR developer to choose the best adaptation technique given a size constraint on the adaptation data. In addition, we investigate the effectiveness of a novel channel normalisation technique and compare the performance with standard normalisation and adaptation techniques. en_US
dc.language.iso en en_US
dc.publisher PRASA 2013 Proceedings en_US
dc.relation.ispartofseries Workflow;12231
dc.subject Automatic speech recognition en_US
dc.subject ASR en_US
dc.subject Speech Technologies en_US
dc.title Cross-bandwidth adaptation for ASR systems en_US
dc.type Conference Presentation en_US
dc.identifier.apacitation Kleynhans, N., & Barnard, E. (2013). Cross-bandwidth adaptation for ASR systems. PRASA 2013 Proceedings. http://hdl.handle.net/10204/7271 en_ZA
dc.identifier.chicagocitation Kleynhans, N, and E Barnard. "Cross-bandwidth adaptation for ASR systems." (2013): http://hdl.handle.net/10204/7271 en_ZA
dc.identifier.vancouvercitation Kleynhans N, Barnard E, Cross-bandwidth adaptation for ASR systems; PRASA 2013 Proceedings; 2013. http://hdl.handle.net/10204/7271 . en_ZA
dc.identifier.ris TY - Conference Presentation AU - Kleynhans, N AU - Barnard, E AB - Mismatches between application and training data greatly reduce the performance of automatic speech recognition (ASR) systems. However, collecting suitable amounts of in-domain and application-specific data for training is resource intensive and may not be feasible for resource-scarce environments. Utilising limited amounts of in-domain data and a combination of feature normalisation and acoustic model adaptation techniques has therefore found wide use in ASR systems. Various approaches have been proposed, and it is not clear when to make use of a particular approach given a specific amount of adaptation data. In this work we investigate the use of standard feature normalisation and model adaptation techniques, for the scenario where adaptation between narrow- and wide-band environments must be performed. Our investigation focuses on the dependence of the adaptation data amount and various adaptation techniques by systematically varying the adaptation data amount and com- paring the performance of various adaptation techniques. From this we establish a guideline which can be used by an ASR developer to choose the best adaptation technique given a size constraint on the adaptation data. In addition, we investigate the effectiveness of a novel channel normalisation technique and compare the performance with standard normalisation and adaptation techniques. DA - 2013-12 DB - ResearchSpace DP - CSIR KW - Automatic speech recognition KW - ASR KW - Speech Technologies LK - https://researchspace.csir.co.za PY - 2013 T1 - Cross-bandwidth adaptation for ASR systems TI - Cross-bandwidth adaptation for ASR systems UR - http://hdl.handle.net/10204/7271 ER - en_ZA


Files in this item

This item appears in the following Collection(s)

Show simple item record