Neural network based generation of fundamental frequency contours

Michael S. Scordilis, John N. Gowdy

Research output: Contribution to journalConference articlepeer-review

33 Scopus citations

Abstract

Although a number of algorithms exist for the generation of the fundamental frequency contour in automatic text-to-speech conversion systems, the absence of a general theory of intonation still prevents the correct derivation of this important feature in unrestricted text applications. A parallel distributed approach is presented in which two neural networks were designed to learn the F0 values for each phoneme and the F0 fluctuations within each phoneme for words that correspond to a small training set. The neural networks used for this task have demonstrated the ability to generalize their properties on new text, and their level of success depends on the composition and size of the training corpus.

Original languageEnglish (US)
Pages (from-to)219-222
Number of pages4
JournalICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings
Volume1
StatePublished - Dec 1 1989
Event1989 International Conference on Acoustics, Speech, and Signal Processing - Glasgow, Scotland
Duration: May 23 1989May 26 1989

ASJC Scopus subject areas

  • Software
  • Signal Processing
  • Electrical and Electronic Engineering

Fingerprint Dive into the research topics of 'Neural network based generation of fundamental frequency contours'. Together they form a unique fingerprint.

Cite this