Speaker specific phrase break modeling with conditional random fields for text-to-speech

Louw, Johannes A; Moodley, Avashlin

Speaker specific phrase break modeling with conditional random fields for text-to-speech

https://ieeexplore.ieee.org/document/7813163
DOI: 10.1109/RoboMech.2016.7813163
http://hdl.handle.net/10204/10966

Abstract:

In this paper we present a new cascading conditional random field based phrase break model for text-to-speech systems, trained on the speaker specific acoustic data that the text-to-speech voices are trained on. The training phase does not require any manually labeled phrase break tags, as these are derived directly from the speaker specific recordings used for building the synthetic voices. We present objective evaluations on various corpora, and show that the proposed model compares well with state-of-the-art data-driven phrase break models, with the added benefit of being in a unified framework.

Reference:

Louw, J.A. & Moodley, A. 2016. Speaker specific phrase break modeling with conditional random fields for text-to-speech. In: 2016 Pattern Recognition Association of South Africa and Robotics and Mechatronics International Conference, 30 November - 2 December 2016, Stellenbosch, South Africa

Louw, J. A., & Moodley, A. (2016). Speaker specific phrase break modeling with conditional random fields for text-to-speech. http://hdl.handle.net/10204/10966

Louw, Johannes A, and Avashlin Moodley. "Speaker specific phrase break modeling with conditional random fields for text-to-speech." (2016): http://hdl.handle.net/10204/10966

Louw JA, Moodley A, Speaker specific phrase break modeling with conditional random fields for text-to-speech; 2016. http://hdl.handle.net/10204/10966 .

Download RIS

Presented in: 2016 Pattern Recognition Association of South Africa and Robotics and Mechatronics International Conference, 30 November - 2 December 2016, Stellenbosch, South Africa. Due to copyright restrictions, the attached PDF file only contains the abstract of the full-text item. For access to the full-text item, please consult the publisher's website. While waiting for the post-print or published PDF document from the publisher

Louw, Johannes A
Moodley, Avashlin

Dec 2016

Text-to-speech systems
Phrase breaks
Prosodic phrasing

Show full item record

Files in this item

Louw_2016_ABSTRACT.pdf

This item appears in the following Collection(s)

Conference Publications

Browse

All of ResearchSpace
This Collection
- By Issue Date
- Authors
- Titles
- Subjects
- Publication Type
- Cluster
- Impact Area

Quick Links

Legislation and compliance

General Enquiries

Tel: + 27 12 841 2911
Email: callcentre@csir.co.za

Physical Address
Meiring Naudé Road
Brummeria
Pretoria
South Africa

Postal Address
PO Box 395
Pretoria 0001
South Africa

Social Connect

Resources on this site are free to download and reuse according to associated licensing provision. Please read the terms and conditions of usage of each resource.

Speaker specific phrase break modeling with conditional random fields for text-to-speech

Speaker specific phrase break modeling with conditional random fields for text-to-speech

This item appears in the following Collection(s)

Browse

All of ResearchSpace

This Collection

Quick Links

Legislation and compliance

General Enquiries

Social Connect