ResearchSpace

Under-resourced speech recognition based on the speech manifold

Show simple item record

dc.contributor.author Sahraeian, R
dc.contributor.author van Compernolle, D
dc.contributor.author De Wet, Febe
dc.date.accessioned 2016-05-16T10:17:32Z
dc.date.available 2016-05-16T10:17:32Z
dc.date.issued 2015-09
dc.identifier.citation Sahraeian, R, van Compernolle, D and de Wet, F. Under-resourced speech recognition based on the speech manifold. In: 16th Annual Conference of the International Speech Communication Association (Interspeech 2015), Dresden, Germany, September 6-10, 2015, 1255-1259 en_US
dc.identifier.uri https://lirias.kuleuven.be/bitstream/123456789/510516/1/3955_final.pdf
dc.identifier.uri http://hdl.handle.net/10204/8542
dc.description 16th Annual Conference of the International Speech Communication Association (Interspeech 2015), Dresden, Germany, September 6-10, 2015. Due to copyright restrictions, the attached PDF file only contains the abstract of the full text item. For access to the full text item, please consult the publisher's website en_US
dc.description.abstract Conventional acoustic modeling involves estimating many parameters to effectively model feature distributions. The sparseness of speech and text data, however, degrades the reliability of the estimation process and makes speech recognition a challenging task. In this paper, we propose to use a nonlinear feature transformation based on the speech manifold called Intrinsic Spectral Analysis (ISA) for under-resourced speech recognition. First, the authors investigate the usefulness of ISA features in low resource scenarios for both Gaussian mixture and deep neural network (DNN) acoustic modeling. Moreover, due to the connection of ISA features to the articulatory configuration space, this feature space is potentially less language dependent than other typical spectral-based features, and therefore exploiting out-of-language data in this feature space is beneficial. They demonstrate the positive effect of ISA in the frame work of multilingual DNN systems where Flemish and Afrikaans are used as donor and under-resourced target languages respectively. The authors compare the performance of ISA with conventional features in both multilingual and under-resourced monolingual conditions. en_US
dc.language.iso en en_US
dc.publisher International Speech Communication Association en_US
dc.relation.ispartofseries Workflow;15696
dc.subject Under-resourced speech recognition en_US
dc.subject Intrinsic spectral analysis en_US
dc.subject Multilingual deep neural network en_US
dc.title Under-resourced speech recognition based on the speech manifold en_US
dc.type Conference Presentation en_US
dc.identifier.apacitation Sahraeian, R., van Compernolle, D., & De Wet, F. (2015). Under-resourced speech recognition based on the speech manifold. International Speech Communication Association. http://hdl.handle.net/10204/8542 en_ZA
dc.identifier.chicagocitation Sahraeian, R, D van Compernolle, and Febe De Wet. "Under-resourced speech recognition based on the speech manifold." (2015): http://hdl.handle.net/10204/8542 en_ZA
dc.identifier.vancouvercitation Sahraeian R, van Compernolle D, De Wet F, Under-resourced speech recognition based on the speech manifold; International Speech Communication Association; 2015. http://hdl.handle.net/10204/8542 . en_ZA
dc.identifier.ris TY - Conference Presentation AU - Sahraeian, R AU - van Compernolle, D AU - De Wet, Febe AB - Conventional acoustic modeling involves estimating many parameters to effectively model feature distributions. The sparseness of speech and text data, however, degrades the reliability of the estimation process and makes speech recognition a challenging task. In this paper, we propose to use a nonlinear feature transformation based on the speech manifold called Intrinsic Spectral Analysis (ISA) for under-resourced speech recognition. First, the authors investigate the usefulness of ISA features in low resource scenarios for both Gaussian mixture and deep neural network (DNN) acoustic modeling. Moreover, due to the connection of ISA features to the articulatory configuration space, this feature space is potentially less language dependent than other typical spectral-based features, and therefore exploiting out-of-language data in this feature space is beneficial. They demonstrate the positive effect of ISA in the frame work of multilingual DNN systems where Flemish and Afrikaans are used as donor and under-resourced target languages respectively. The authors compare the performance of ISA with conventional features in both multilingual and under-resourced monolingual conditions. DA - 2015-09 DB - ResearchSpace DP - CSIR KW - Under-resourced speech recognition KW - Intrinsic spectral analysis KW - Multilingual deep neural network LK - https://researchspace.csir.co.za PY - 2015 T1 - Under-resourced speech recognition based on the speech manifold TI - Under-resourced speech recognition based on the speech manifold UR - http://hdl.handle.net/10204/8542 ER - en_ZA


Files in this item

This item appears in the following Collection(s)

Show simple item record