Machine Lip-Reading of Japanese Vowels Utilizing A Stereoscopic Vision System

Masami Nakano, Tomio Watanabe

研究成果: Article査読

1 被引用数 (Scopus)

抄録

The lip-shape recognition of Japanese vowels utilizing a stereoscopic vision system has been performed to enhance the discrimination of five vowels by machine lip-reading. The opening angle P4 between the upper lip and the lower lip is selected as the typical three-dimensional feature parameter of lip shape, in addition to the usual feature parameters such as the width P1 and the height P2 of the lip shape and distance P3 between the tips of the upper lip and the chin. Compared with the 3 two-dimensional parameters P1, P2 and P3 used in a single-vision system, the 3 and 4 three-dimensional parameters led to the significant increase of discrimination rate, and in particular, the discrimination rate by the 4 three-dimensional parameters was more than 90% for every tested subject. Recognition for an unspecified subject was also examined, based on the selections of the variance-covariance matrix and the mean vector of feature variables, by which the Mahalanobis' generalized square distance and the maximum likelihood discrimination function were determined.

本文言語English
ページ(範囲)525-530
ページ数6
ジャーナルtransactions of the japan society of mechanical engineers series c
60
570
DOI
出版ステータスPublished - 1994
外部発表はい

ASJC Scopus subject areas

  • 材料力学
  • 機械工学
  • 産業および生産工学

フィンガープリント

「Machine Lip-Reading of Japanese Vowels Utilizing A Stereoscopic Vision System」の研究トピックを掘り下げます。これらがまとまってユニークなフィンガープリントを構成します。

引用スタイル