TY - GEN
T1 - Emotional speech classification with prosodic prameters by using neural networks
AU - Sato, H.
AU - Mitsukura, Y.
AU - Fukumi, M.
AU - Akamatsu, N.
N1 - Publisher Copyright:
© 2001 ARCME, Univ of WA.
PY - 2001
Y1 - 2001
N2 - Interestingly, in order to achieve a new Human Interface such that digital computers can deal with the KASEI information, the study of the KANSEI information processing recently has been approached. In this paper, we propose a new classification method of emotional speech by analyzing feature parameters obtained from the emotional speech and by learning them using neural networks, which is regarded as a KANSEI information processing. In the present research, KANSEI information is usually human emotion. The emotion is classified broadly into four patterns such as neutral, anger, sad and joy. The pitch as one of feature parameters governs voice modulation, and can be sensitive to change of emotion. The pitch is extracted from each emotional speech by the cepstrum method. Input values of neural networks (NNs) are then emotional pitch patterns, which are time-varying. It is shown that NNs can achieve classification of emotion by learning each emotional pitch pattern by means of computer simulations.
AB - Interestingly, in order to achieve a new Human Interface such that digital computers can deal with the KASEI information, the study of the KANSEI information processing recently has been approached. In this paper, we propose a new classification method of emotional speech by analyzing feature parameters obtained from the emotional speech and by learning them using neural networks, which is regarded as a KANSEI information processing. In the present research, KANSEI information is usually human emotion. The emotion is classified broadly into four patterns such as neutral, anger, sad and joy. The pitch as one of feature parameters governs voice modulation, and can be sensitive to change of emotion. The pitch is extracted from each emotional speech by the cepstrum method. Input values of neural networks (NNs) are then emotional pitch patterns, which are time-varying. It is shown that NNs can achieve classification of emotion by learning each emotional pitch pattern by means of computer simulations.
UR - http://www.scopus.com/inward/record.url?scp=1542314530&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=1542314530&partnerID=8YFLogxK
U2 - 10.1109/ANZIIS.2001.974111
DO - 10.1109/ANZIIS.2001.974111
M3 - Conference contribution
AN - SCOPUS:1542314530
T3 - ANZIIS 2001 - Proceedings of the 7th Australian and New Zealand Intelligent Information Systems Conference
SP - 395
EP - 398
BT - ANZIIS 2001 - Proceedings of the 7th Australian and New Zealand Intelligent Information Systems Conference
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 7th Australian and New Zealand Intelligent Information Systems Conference, ANZIIS 2001
Y2 - 18 November 2001 through 21 November 2001
ER -