Machine Lip-Reading of Japanese Vowels Utilizing A Stereoscopic Vision System

Masami Nakano, Tomio Watanabe

研究成果: Article査読

1 被引用数 (Scopus)


The lip-shape recognition of Japanese vowels utilizing a stereoscopic vision system has been performed to enhance the discrimination of five vowels by machine lip-reading. The opening angle P4 between the upper lip and the lower lip is selected as the typical three-dimensional feature parameter of lip shape, in addition to the usual feature parameters such as the width P1 and the height P2 of the lip shape and distance P3 between the tips of the upper lip and the chin. Compared with the 3 two-dimensional parameters P1, P2 and P3 used in a single-vision system, the 3 and 4 three-dimensional parameters led to the significant increase of discrimination rate, and in particular, the discrimination rate by the 4 three-dimensional parameters was more than 90% for every tested subject. Recognition for an unspecified subject was also examined, based on the selections of the variance-covariance matrix and the mean vector of feature variables, by which the Mahalanobis' generalized square distance and the maximum likelihood discrimination function were determined.

ジャーナルtransactions of the japan society of mechanical engineers series c
出版ステータスPublished - 1994

ASJC Scopus subject areas

  • 材料力学
  • 機械工学
  • 産業および生産工学


「Machine Lip-Reading of Japanese Vowels Utilizing A Stereoscopic Vision System」の研究トピックを掘り下げます。これらがまとまってユニークなフィンガープリントを構成します。