Comparing grapheme-based and phoneme-based speech recognition for Afrikaans

Basson, WD; Davel, MH

Comparing grapheme-based and phoneme-based speech recognition for Afrikaans

http://www.prasa.org/index.php/2012-03-07-10-55-15
http://hdl.handle.net/10204/6492

Abstract:

This paper compares the recognition accuracy of a phoneme-based automatic speech recognition system with that of a grapheme-based system, using Afrikaans as case study. The first system is developed using a conventional pronunciation dictionary, while the latter system uses the letters of each word directly as the acoustic units to be modelled. We ensure that the pronunciation dictionary we use is highly accurate and then investigate the extent to which ASR performance degrades when the dictionary is removed.We analyse this effect at different data set sizes and classify the causes of performance degradation. With grapheme-based ASR outperforming phoneme-based ASR in certain word categories, we find that relative error rates are highly dependent on word category, which points towards strategies for compensating for grapheme-based inaccuracies.

Reference:

Basson, W.D and Davel, M.H. 2012. Comparing grapheme-based and phoneme-based speech recognition for Afrikaans. In: PRASA 2012, CSIR International Convention Centre, Pretoria, 29-30 November 2012

Basson, W., & Davel, M. (2012). Comparing grapheme-based and phoneme-based speech recognition for Afrikaans. PRASA 2012. http://hdl.handle.net/10204/6492

Basson, WD, and MH Davel. "Comparing grapheme-based and phoneme-based speech recognition for Afrikaans." (2012): http://hdl.handle.net/10204/6492

Basson W, Davel M, Comparing grapheme-based and phoneme-based speech recognition for Afrikaans; PRASA 2012; 2012. http://hdl.handle.net/10204/6492 .

Download RIS

PRASA 2012, CSIR International Convention Centre, Pretoria, 29-30 November 2012

Basson, WD
Davel, MH

Nov 2012

Automatic speech recognition
ASR
Grapheme-based system

Show full item record

Files in this item

Basson_2012.pdf

This item appears in the following Collection(s)

Conference Publications

Browse

All of ResearchSpace
This Collection
- By Issue Date
- Authors
- Titles
- Subjects
- Publication Type
- Cluster
- Impact Area

Quick Links

Legislation and compliance

General Enquiries

Tel: + 27 12 841 2911
Email: callcentre@csir.co.za

Physical Address
Meiring Naudé Road
Brummeria
Pretoria
South Africa

Postal Address
PO Box 395
Pretoria 0001
South Africa

Social Connect

Resources on this site are free to download and reuse according to associated licensing provision. Please read the terms and conditions of usage of each resource.

Comparing grapheme-based and phoneme-based speech recognition for Afrikaans

Comparing grapheme-based and phoneme-based speech recognition for Afrikaans

This item appears in the following Collection(s)

Browse

All of ResearchSpace

This Collection

Quick Links

Legislation and compliance

General Enquiries

Social Connect