Under-resourced speech recognition based on the speech manifold

Sahraeian, R; van Compernolle, D; De Wet, Febe

Under-resourced speech recognition based on the speech manifold

https://lirias.kuleuven.be/bitstream/123456789/510516/1/3955_final.pdf
http://hdl.handle.net/10204/8542

Abstract:

Conventional acoustic modeling involves estimating many parameters to effectively model feature distributions. The sparseness of speech and text data, however, degrades the reliability of the estimation process and makes speech recognition a challenging task. In this paper, we propose to use a nonlinear feature transformation based on the speech manifold called Intrinsic Spectral Analysis (ISA) for under-resourced speech recognition. First, the authors investigate the usefulness of ISA features in low resource scenarios for both Gaussian mixture and deep neural network (DNN) acoustic modeling. Moreover, due to the connection of ISA features to the articulatory configuration space, this feature space is potentially less language dependent than other typical spectral-based features, and therefore exploiting out-of-language data in this feature space is beneficial. They demonstrate the positive effect of ISA in the frame work of multilingual DNN systems where Flemish and Afrikaans are used as donor and under-resourced target languages respectively. The authors compare the performance of ISA with conventional features in both multilingual and under-resourced monolingual conditions.

Reference:

Sahraeian, R, van Compernolle, D and de Wet, F. Under-resourced speech recognition based on the speech manifold. In: 16th Annual Conference of the International Speech Communication Association (Interspeech 2015), Dresden, Germany, September 6-10, 2015, 1255-1259

Sahraeian, R., van Compernolle, D., & De Wet, F. (2015). Under-resourced speech recognition based on the speech manifold. International Speech Communication Association. http://hdl.handle.net/10204/8542

Sahraeian, R, D van Compernolle, and Febe De Wet. "Under-resourced speech recognition based on the speech manifold." (2015): http://hdl.handle.net/10204/8542

Sahraeian R, van Compernolle D, De Wet F, Under-resourced speech recognition based on the speech manifold; International Speech Communication Association; 2015. http://hdl.handle.net/10204/8542 .

Download RIS

16th Annual Conference of the International Speech Communication Association (Interspeech 2015), Dresden, Germany, September 6-10, 2015. Due to copyright restrictions, the attached PDF file only contains the abstract of the full text item. For access to the full text item, please consult the publisher's website

Sahraeian, R
van Compernolle, D
De Wet, Febe

Sep 2015

Under-resourced speech recognition
Intrinsic spectral analysis
Multilingual deep neural network

Show full item record

Files in this item

de Wet1_2015_ABSTRACT.pdf

This item appears in the following Collection(s)

Conference Publications

Browse

All of ResearchSpace
This Collection
- By Issue Date
- Authors
- Titles
- Subjects
- Publication Type
- Cluster
- Impact Area

Quick Links

Legislation and compliance

General Enquiries

Tel: + 27 12 841 2911
Email: callcentre@csir.co.za

Physical Address
Meiring Naudé Road
Brummeria
Pretoria
South Africa

Postal Address
PO Box 395
Pretoria 0001
South Africa

Social Connect

Resources on this site are free to download and reuse according to associated licensing provision. Please read the terms and conditions of usage of each resource.

Under-resourced speech recognition based on the speech manifold

Under-resourced speech recognition based on the speech manifold

This item appears in the following Collection(s)

Browse

All of ResearchSpace

This Collection

Quick Links

Legislation and compliance

General Enquiries

Social Connect