Paralinguistic analysis of children's speech in natural environments

Hrishikesh Rao, Mark A. Clements, Yin Li, Meghan R. Swanson, Joseph Piven, Daniel S Messinger

Research output: Chapter in Book/Report/Conference proceedingChapter

Abstract

Paralinguistic cues are the non-phonemic aspects of human speech that convey information about the affective state of the speaker. In children's speech, these events are also important markers for the detection of early developmental disorders. Detecting these events in hours of audio data would be beneficial for clinicians to analyze the social behaviors of children. The chapter focuses on the use of spectral and prosodic baseline acoustic features to classify instances of children's laughter and fussing/crying while interacting with their caregivers in naturalistic settings. In conjunction with baseline features, long-term intensity-based features, that capture the periodic structure of laughter, enable in detecting instances of laughter to a reasonably high degree of accuracy in a variety of classification tasks.

Original languageEnglish (US)
Title of host publicationMobile Health
Subtitle of host publicationSensors, Analytic Methods, and Applications
PublisherSpringer International Publishing
Pages219-238
Number of pages20
ISBN (Electronic)9783319513942
ISBN (Print)9783319513935
DOIs
StatePublished - Jul 12 2017

Fingerprint

Laughter
Periodic structures
Crying
Social Behavior
Acoustics
Caregivers
Cues

ASJC Scopus subject areas

  • Medicine(all)
  • Computer Science(all)

Cite this

Rao, H., Clements, M. A., Li, Y., Swanson, M. R., Piven, J., & Messinger, D. S. (2017). Paralinguistic analysis of children's speech in natural environments. In Mobile Health: Sensors, Analytic Methods, and Applications (pp. 219-238). Springer International Publishing. https://doi.org/10.1007/978-3-319-51394-2_12

Paralinguistic analysis of children's speech in natural environments. / Rao, Hrishikesh; Clements, Mark A.; Li, Yin; Swanson, Meghan R.; Piven, Joseph; Messinger, Daniel S.

Mobile Health: Sensors, Analytic Methods, and Applications. Springer International Publishing, 2017. p. 219-238.

Research output: Chapter in Book/Report/Conference proceedingChapter

Rao, H, Clements, MA, Li, Y, Swanson, MR, Piven, J & Messinger, DS 2017, Paralinguistic analysis of children's speech in natural environments. in Mobile Health: Sensors, Analytic Methods, and Applications. Springer International Publishing, pp. 219-238. https://doi.org/10.1007/978-3-319-51394-2_12
Rao H, Clements MA, Li Y, Swanson MR, Piven J, Messinger DS. Paralinguistic analysis of children's speech in natural environments. In Mobile Health: Sensors, Analytic Methods, and Applications. Springer International Publishing. 2017. p. 219-238 https://doi.org/10.1007/978-3-319-51394-2_12
Rao, Hrishikesh ; Clements, Mark A. ; Li, Yin ; Swanson, Meghan R. ; Piven, Joseph ; Messinger, Daniel S. / Paralinguistic analysis of children's speech in natural environments. Mobile Health: Sensors, Analytic Methods, and Applications. Springer International Publishing, 2017. pp. 219-238
@inbook{372a24b6ee2c4f4aa68dd4de5048031d,
title = "Paralinguistic analysis of children's speech in natural environments",
abstract = "Paralinguistic cues are the non-phonemic aspects of human speech that convey information about the affective state of the speaker. In children's speech, these events are also important markers for the detection of early developmental disorders. Detecting these events in hours of audio data would be beneficial for clinicians to analyze the social behaviors of children. The chapter focuses on the use of spectral and prosodic baseline acoustic features to classify instances of children's laughter and fussing/crying while interacting with their caregivers in naturalistic settings. In conjunction with baseline features, long-term intensity-based features, that capture the periodic structure of laughter, enable in detecting instances of laughter to a reasonably high degree of accuracy in a variety of classification tasks.",
author = "Hrishikesh Rao and Clements, {Mark A.} and Yin Li and Swanson, {Meghan R.} and Joseph Piven and Messinger, {Daniel S}",
year = "2017",
month = "7",
day = "12",
doi = "10.1007/978-3-319-51394-2_12",
language = "English (US)",
isbn = "9783319513935",
pages = "219--238",
booktitle = "Mobile Health",
publisher = "Springer International Publishing",

}

TY - CHAP

T1 - Paralinguistic analysis of children's speech in natural environments

AU - Rao, Hrishikesh

AU - Clements, Mark A.

AU - Li, Yin

AU - Swanson, Meghan R.

AU - Piven, Joseph

AU - Messinger, Daniel S

PY - 2017/7/12

Y1 - 2017/7/12

N2 - Paralinguistic cues are the non-phonemic aspects of human speech that convey information about the affective state of the speaker. In children's speech, these events are also important markers for the detection of early developmental disorders. Detecting these events in hours of audio data would be beneficial for clinicians to analyze the social behaviors of children. The chapter focuses on the use of spectral and prosodic baseline acoustic features to classify instances of children's laughter and fussing/crying while interacting with their caregivers in naturalistic settings. In conjunction with baseline features, long-term intensity-based features, that capture the periodic structure of laughter, enable in detecting instances of laughter to a reasonably high degree of accuracy in a variety of classification tasks.

AB - Paralinguistic cues are the non-phonemic aspects of human speech that convey information about the affective state of the speaker. In children's speech, these events are also important markers for the detection of early developmental disorders. Detecting these events in hours of audio data would be beneficial for clinicians to analyze the social behaviors of children. The chapter focuses on the use of spectral and prosodic baseline acoustic features to classify instances of children's laughter and fussing/crying while interacting with their caregivers in naturalistic settings. In conjunction with baseline features, long-term intensity-based features, that capture the periodic structure of laughter, enable in detecting instances of laughter to a reasonably high degree of accuracy in a variety of classification tasks.

UR - http://www.scopus.com/inward/record.url?scp=85055379839&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=85055379839&partnerID=8YFLogxK

U2 - 10.1007/978-3-319-51394-2_12

DO - 10.1007/978-3-319-51394-2_12

M3 - Chapter

AN - SCOPUS:85055379839

SN - 9783319513935

SP - 219

EP - 238

BT - Mobile Health

PB - Springer International Publishing

ER -