Non-monologue HMM-based speech synthesis for service robots: A cloud robotics approach

Komei Sugiura, Yoshinori Shiga, Hisashi Kawai, Teruhisa Misu, Chiori Hori

研究成果: Conference article査読

12 被引用数 (Scopus)


Robot utterances generally sound monotonous, unnatural, and unfriendly because their Text-to-Speech (TTS) systems are not optimized for communication but for text-reading. Here we present a non-monologue speech synthesis for robots. We collected a speech corpus in a non-monologue style in which two professional voice talents read scripted dialogues. Hidden Markov models (HMMs) were then trained with the corpus and used for speech synthesis. We conducted experiments in which the proposed method was evaluated by 24 subjects in three scenarios: text-reading, dialogue, and domestic service robot (DSR) scenarios. In the DSR scenario, we used a physical robot and compared our proposed method with a baseline method using the standard Mean Opinion Score (MOS) criterion. Our experimental results showed that our proposed method's performance was (1) at the same level as the baseline method in the text-reading scenario and (2) exceeded it in the DSR scenario. We deployed our proposed system as a cloud-based speech synthesis service so that it can be used without any cost.

ジャーナルProceedings - IEEE International Conference on Robotics and Automation
出版ステータスPublished - 2014 9 22
イベント2014 IEEE International Conference on Robotics and Automation, ICRA 2014 - Hong Kong, China
継続期間: 2014 5 312014 6 7

ASJC Scopus subject areas

  • ソフトウェア
  • 制御およびシステム工学
  • 人工知能
  • 電子工学および電気工学


「Non-monologue HMM-based speech synthesis for service robots: A cloud robotics approach」の研究トピックを掘り下げます。これらがまとまってユニークなフィンガープリントを構成します。