Spoken emotion classification using ToBI features and GMM

Alexander I. Iliev, Yongxin Zhang, Michael S. Scordilis

Research output: Chapter in Book/Report/Conference proceedingConference contribution

9 Scopus citations

Abstract

This study investigated the usefulness of ToBI marks in determining the emotional state conveyed in speech. The Gaussian Mixture Model GMM used was as the classifier structure. A total of three different classification systems were developed based on the use of three different feature vectors. They were: (a) the classical approach that used signal pitch and energy features; (b) a ToBI-only feature based on tone and break tiers; and (c) a system that used the features of both (a) and (b). In ToBI, tone tier elements were automatically determined using pitch information. Three emotional states were investigated: Happiness, Anger, and Sadness. The overall success rate achieved for the combined system was between 75% and 100%. This work indicated that the ToBI features alone were very useful for the classification of emotion, and detection improves when classical features are used in conjunction with ToBI.

Original languageEnglish (US)
Title of host publication2007 IWSSIP and EC-SIPMCS - Proc. 2007 14th Int. Workshop on Systems, Signals and Image Processing, and 6th EURASIP Conf. Focused on Speech and Image Processing, Multimedia Communications and Services
Pages495-498
Number of pages4
DOIs
StatePublished - Dec 1 2007
Event14th International Conference on Systems Signals and Image Processing, IWSSIP 2007 and 6th EURASIP Conference Focused on Speech and Image Processing, Multimedia Communications and Services, EC-SIPMCS 2007 - Maribor, Slovenia
Duration: Jun 27 2007Jun 30 2007

Publication series

Name2007 IWSSIP and EC-SIPMCS - Proc. 2007 14th Int. Workshop on Systems, Signals and Image Processing, and 6th EURASIP Conf. Focused on Speech and Image Processing, Multimedia Communications and Services

Other

Other14th International Conference on Systems Signals and Image Processing, IWSSIP 2007 and 6th EURASIP Conference Focused on Speech and Image Processing, Multimedia Communications and Services, EC-SIPMCS 2007
CountrySlovenia
CityMaribor
Period6/27/076/30/07

ASJC Scopus subject areas

  • Computer Vision and Pattern Recognition
  • Signal Processing
  • Control and Systems Engineering

Fingerprint Dive into the research topics of 'Spoken emotion classification using ToBI features and GMM'. Together they form a unique fingerprint.

  • Cite this

    Iliev, A. I., Zhang, Y., & Scordilis, M. S. (2007). Spoken emotion classification using ToBI features and GMM. In 2007 IWSSIP and EC-SIPMCS - Proc. 2007 14th Int. Workshop on Systems, Signals and Image Processing, and 6th EURASIP Conf. Focused on Speech and Image Processing, Multimedia Communications and Services (pp. 495-498). [4381149] (2007 IWSSIP and EC-SIPMCS - Proc. 2007 14th Int. Workshop on Systems, Signals and Image Processing, and 6th EURASIP Conf. Focused on Speech and Image Processing, Multimedia Communications and Services). https://doi.org/10.1109/IWSSIP.2007.4381149