ResearchSpace

Perceptual evaluation of corpus-based speech synthesis techniques in under-resourced environments

Show simple item record

dc.contributor.author Van Niekerk, DR
dc.contributor.author Barnard, E
dc.contributor.author Schlunz, Georg I
dc.date.accessioned 2010-01-08T07:49:21Z
dc.date.available 2010-01-08T07:49:21Z
dc.date.issued 2009-11
dc.identifier.citation Van Niekerk, DR, Barnard, E and Schlunz, G. 2009. Perceptual evaluation of corpus-based speech synthesis techniques in under-resourced environments. 20th Annual Symposium of the Pattern Recognition Association of South Africa (PRASA). Stellenbosch, South Africa, 30 November - 01 December 2009, pp 71-75 en
dc.identifier.uri http://hdl.handle.net/10204/3852
dc.description 20th Annual Symposium of the Pattern Recognition Association of South Africa (PRASA). Stellenbosch, South Africa, 30 November - 01 December 2009 en
dc.description.abstract With the increasing prominence and maturity of corpus-based techniques for speech synthesis, the process of system development has in some ways been simplified considerably. However, the dependence on sufficient amounts of relevant speech data of high quality remains a central challenge in under-resourced environments. In this paper the authors investigate the quality implications when building baseline synthesis systems with reduced amounts of speech data. This is done through a perceptual evaluation of synthesis systems based on unit-selection and statistical parametric synthesis techniques. The authors show that - although it is possible to build an acceptable unit-selection synthesizer with as little as 27 minutes of carefully recorded speech data - synthesis quality obtainable from Hidden Markov Model-based synthesis is more consistent and requires significantly less speech data. en
dc.language.iso en en
dc.publisher PRASA 2009 en
dc.subject Speech synthesis techniques en
dc.subject Under-resourced environments en
dc.subject Perceptual evaluation en
dc.subject Speech data en
dc.subject Hidden markov models en
dc.subject PRASA 2009 en
dc.title Perceptual evaluation of corpus-based speech synthesis techniques in under-resourced environments en
dc.type Conference Presentation en
dc.identifier.apacitation Van Niekerk, D., Barnard, E., & Schlunz, G. I. (2009). Perceptual evaluation of corpus-based speech synthesis techniques in under-resourced environments. PRASA 2009. http://hdl.handle.net/10204/3852 en_ZA
dc.identifier.chicagocitation Van Niekerk, DR, E Barnard, and Georg I Schlunz. "Perceptual evaluation of corpus-based speech synthesis techniques in under-resourced environments." (2009): http://hdl.handle.net/10204/3852 en_ZA
dc.identifier.vancouvercitation Van Niekerk D, Barnard E, Schlunz GI, Perceptual evaluation of corpus-based speech synthesis techniques in under-resourced environments; PRASA 2009; 2009. http://hdl.handle.net/10204/3852 . en_ZA
dc.identifier.ris TY - Conference Presentation AU - Van Niekerk, DR AU - Barnard, E AU - Schlunz, Georg I AB - With the increasing prominence and maturity of corpus-based techniques for speech synthesis, the process of system development has in some ways been simplified considerably. However, the dependence on sufficient amounts of relevant speech data of high quality remains a central challenge in under-resourced environments. In this paper the authors investigate the quality implications when building baseline synthesis systems with reduced amounts of speech data. This is done through a perceptual evaluation of synthesis systems based on unit-selection and statistical parametric synthesis techniques. The authors show that - although it is possible to build an acceptable unit-selection synthesizer with as little as 27 minutes of carefully recorded speech data - synthesis quality obtainable from Hidden Markov Model-based synthesis is more consistent and requires significantly less speech data. DA - 2009-11 DB - ResearchSpace DP - CSIR KW - Speech synthesis techniques KW - Under-resourced environments KW - Perceptual evaluation KW - Speech data KW - Hidden markov models KW - PRASA 2009 LK - https://researchspace.csir.co.za PY - 2009 T1 - Perceptual evaluation of corpus-based speech synthesis techniques in under-resourced environments TI - Perceptual evaluation of corpus-based speech synthesis techniques in under-resourced environments UR - http://hdl.handle.net/10204/3852 ER - en_ZA


Files in this item

This item appears in the following Collection(s)

Show simple item record