Articulation, acoustics and perception of Mandarin Chinese emotional speech

Donna Erickson, Chunyue Zhu, Shigeto Kawahara, Atsuo Suemitsu

Research output: Contribution to journalArticle

4 Citations (Scopus)

Abstract

This paper studies articulatory, acoustic and perceptual characteristics of Mandarin Chinese emotional utterances as produced by two speakers, expressing Neutral, Angry, Sad and Happy emotions. Articulatory patterns were recorded using ElectroMagnetic Articulography (EMA), together with acoustic recordings. The acoustic and articulatory analysis revealed that Happy and Angry were generally higher-pitched, louder, and produced with a more open mouth than Neutral or Sad. Sad is produced with low back tongue dorsum position and Happy, with a forward position, and for one speaker, duration was longer for Angry and Sad. Moreover, F1 and F2 are more dispersed (i.e., hyperarticulated) in emotional speech than Neutral speech. Perception tests conducted with 18 native listeners suggest that listeners were able to perceive the expressed emotions far above chance level. The louder and higher pitched the utterance, the more emotional the speech tends to be perceived. We also explore specific articulatory and acoustic correlates of each type of emotional speech, and how they impact perception.

Original languageEnglish
Pages (from-to)620-635
Number of pages16
JournalOpen Linguistics
Volume2
Issue number1
DOIs
Publication statusPublished - 2016 Jan 1

Fingerprint

acoustics
listener
emotion
recording
Acoustics
Emotion
Mandarin Chinese
Articulation
Utterance
Listeners

Keywords

  • Acoustics
  • Articulation
  • Duration
  • Emotion
  • F0
  • F1
  • F2
  • Intensity
  • Jaw displacement
  • Mandarin Chinese
  • Perception
  • Tongue dorsum

ASJC Scopus subject areas

  • Language and Linguistics
  • Linguistics and Language

Cite this

Articulation, acoustics and perception of Mandarin Chinese emotional speech. / Erickson, Donna; Zhu, Chunyue; Kawahara, Shigeto; Suemitsu, Atsuo.

In: Open Linguistics, Vol. 2, No. 1, 01.01.2016, p. 620-635.

Research output: Contribution to journalArticle

Erickson, Donna ; Zhu, Chunyue ; Kawahara, Shigeto ; Suemitsu, Atsuo. / Articulation, acoustics and perception of Mandarin Chinese emotional speech. In: Open Linguistics. 2016 ; Vol. 2, No. 1. pp. 620-635.
@article{d715030b030248509e1bbe421fb3f65b,
title = "Articulation, acoustics and perception of Mandarin Chinese emotional speech",
abstract = "This paper studies articulatory, acoustic and perceptual characteristics of Mandarin Chinese emotional utterances as produced by two speakers, expressing Neutral, Angry, Sad and Happy emotions. Articulatory patterns were recorded using ElectroMagnetic Articulography (EMA), together with acoustic recordings. The acoustic and articulatory analysis revealed that Happy and Angry were generally higher-pitched, louder, and produced with a more open mouth than Neutral or Sad. Sad is produced with low back tongue dorsum position and Happy, with a forward position, and for one speaker, duration was longer for Angry and Sad. Moreover, F1 and F2 are more dispersed (i.e., hyperarticulated) in emotional speech than Neutral speech. Perception tests conducted with 18 native listeners suggest that listeners were able to perceive the expressed emotions far above chance level. The louder and higher pitched the utterance, the more emotional the speech tends to be perceived. We also explore specific articulatory and acoustic correlates of each type of emotional speech, and how they impact perception.",
keywords = "Acoustics, Articulation, Duration, Emotion, F0, F1, F2, Intensity, Jaw displacement, Mandarin Chinese, Perception, Tongue dorsum",
author = "Donna Erickson and Chunyue Zhu and Shigeto Kawahara and Atsuo Suemitsu",
year = "2016",
month = "1",
day = "1",
doi = "10.1515/opli-2016-0034",
language = "English",
volume = "2",
pages = "620--635",
journal = "Open Linguistics",
issn = "2300-9969",
publisher = "Walter de Gruyter GmbH",
number = "1",

}

TY - JOUR

T1 - Articulation, acoustics and perception of Mandarin Chinese emotional speech

AU - Erickson, Donna

AU - Zhu, Chunyue

AU - Kawahara, Shigeto

AU - Suemitsu, Atsuo

PY - 2016/1/1

Y1 - 2016/1/1

N2 - This paper studies articulatory, acoustic and perceptual characteristics of Mandarin Chinese emotional utterances as produced by two speakers, expressing Neutral, Angry, Sad and Happy emotions. Articulatory patterns were recorded using ElectroMagnetic Articulography (EMA), together with acoustic recordings. The acoustic and articulatory analysis revealed that Happy and Angry were generally higher-pitched, louder, and produced with a more open mouth than Neutral or Sad. Sad is produced with low back tongue dorsum position and Happy, with a forward position, and for one speaker, duration was longer for Angry and Sad. Moreover, F1 and F2 are more dispersed (i.e., hyperarticulated) in emotional speech than Neutral speech. Perception tests conducted with 18 native listeners suggest that listeners were able to perceive the expressed emotions far above chance level. The louder and higher pitched the utterance, the more emotional the speech tends to be perceived. We also explore specific articulatory and acoustic correlates of each type of emotional speech, and how they impact perception.

AB - This paper studies articulatory, acoustic and perceptual characteristics of Mandarin Chinese emotional utterances as produced by two speakers, expressing Neutral, Angry, Sad and Happy emotions. Articulatory patterns were recorded using ElectroMagnetic Articulography (EMA), together with acoustic recordings. The acoustic and articulatory analysis revealed that Happy and Angry were generally higher-pitched, louder, and produced with a more open mouth than Neutral or Sad. Sad is produced with low back tongue dorsum position and Happy, with a forward position, and for one speaker, duration was longer for Angry and Sad. Moreover, F1 and F2 are more dispersed (i.e., hyperarticulated) in emotional speech than Neutral speech. Perception tests conducted with 18 native listeners suggest that listeners were able to perceive the expressed emotions far above chance level. The louder and higher pitched the utterance, the more emotional the speech tends to be perceived. We also explore specific articulatory and acoustic correlates of each type of emotional speech, and how they impact perception.

KW - Acoustics

KW - Articulation

KW - Duration

KW - Emotion

KW - F0

KW - F1

KW - F2

KW - Intensity

KW - Jaw displacement

KW - Mandarin Chinese

KW - Perception

KW - Tongue dorsum

UR - http://www.scopus.com/inward/record.url?scp=85047250536&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=85047250536&partnerID=8YFLogxK

U2 - 10.1515/opli-2016-0034

DO - 10.1515/opli-2016-0034

M3 - Article

AN - SCOPUS:85047250536

VL - 2

SP - 620

EP - 635

JO - Open Linguistics

JF - Open Linguistics

SN - 2300-9969

IS - 1

ER -