TY - GEN
T1 - Lip reading system using novel Japanese visemes classification and hierarchical weighted discrimination
AU - Okita, Shinsuke
AU - Mitsukura, Yasue
AU - Hamada, Nozomu
PY - 2013/12/1
Y1 - 2013/12/1
N2 - In recent years, automatic lip reading based on 'visemes' have been studied by researchers for realizing human-machine interactive communication system in many applications. However there are a lot of problems such as the definition of the number of viseme classes, discrimination method of visemes, speech recognition method based on visemes, and so on. In this paper, a novel classification of Japanese visemes and hierarchical weighted discrimination method for speech recognition are proposed to address these problems. We augmented the classification number of visemes from 6(conventional) to 9 to represent the words in more detailed by visemes. In addition, considering the difficulty in discriminating with increase of the number of visemes, the hierarchical weighted discrimination method is proposed. For the purpose of comparing with the conventional method, the ATR phonetically balanced word group, which is large vocabulary and includes various visemes, was used and applied to word recognition experiments. From these results, we confirmed the proposed method worked well.
AB - In recent years, automatic lip reading based on 'visemes' have been studied by researchers for realizing human-machine interactive communication system in many applications. However there are a lot of problems such as the definition of the number of viseme classes, discrimination method of visemes, speech recognition method based on visemes, and so on. In this paper, a novel classification of Japanese visemes and hierarchical weighted discrimination method for speech recognition are proposed to address these problems. We augmented the classification number of visemes from 6(conventional) to 9 to represent the words in more detailed by visemes. In addition, considering the difficulty in discriminating with increase of the number of visemes, the hierarchical weighted discrimination method is proposed. For the purpose of comparing with the conventional method, the ATR phonetically balanced word group, which is large vocabulary and includes various visemes, was used and applied to word recognition experiments. From these results, we confirmed the proposed method worked well.
KW - Image processing
KW - Pattern recognition
KW - lip reading
KW - visemes mouth-shape code
KW - visual speech recognition
UR - http://www.scopus.com/inward/record.url?scp=84894158868&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=84894158868&partnerID=8YFLogxK
U2 - 10.1109/ISPACS.2013.6704608
DO - 10.1109/ISPACS.2013.6704608
M3 - Conference contribution
AN - SCOPUS:84894158868
SN - 9781467363617
T3 - ISPACS 2013 - 2013 International Symposium on Intelligent Signal Processing and Communication Systems
SP - 531
EP - 536
BT - ISPACS 2013 - 2013 International Symposium on Intelligent Signal Processing and Communication Systems
T2 - 2013 21st International Symposium on Intelligent Signal Processing and Communication Systems, ISPACS 2013
Y2 - 12 November 2013 through 15 November 2013
ER -