TY - JOUR
T1 - Learning lexicons from spoken utterances based on statistical model selection
AU - Taguchi, Ryo
AU - Iwahashi, Naoto
AU - Nose, Takashi
AU - Funakoshi, Kotaro
AU - Nakano, Mikio
PY - 2009/11/26
Y1 - 2009/11/26
N2 - This paper proposes a method for the unsupervised learning of lexicons from pairs of a spoken utterance and an object as its meaning without any a priori linguistic knowledge other than a phoneme acoustic model. In order to obtain a lexicon, a statistical model of the joint probability of a spoken utterance and an object is learned based on the minimum description length principle. This model consists of a list of word phoneme sequences and three statistical models: the phoneme acoustic model, a word-bigram model, and a word meaning model. Experimental results show that the method can acquire acoustically, grammatically and semantically appropriate words with about 85% phoneme accuracy.
AB - This paper proposes a method for the unsupervised learning of lexicons from pairs of a spoken utterance and an object as its meaning without any a priori linguistic knowledge other than a phoneme acoustic model. In order to obtain a lexicon, a statistical model of the joint probability of a spoken utterance and an object is learned based on the minimum description length principle. This model consists of a list of word phoneme sequences and three statistical models: the phoneme acoustic model, a word-bigram model, and a word meaning model. Experimental results show that the method can acquire acoustically, grammatically and semantically appropriate words with about 85% phoneme accuracy.
KW - Language acquisition
KW - Lexical learning
KW - Model selection
UR - http://www.scopus.com/inward/record.url?scp=70450210026&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=70450210026&partnerID=8YFLogxK
M3 - Conference article
AN - SCOPUS:70450210026
SP - 2731
EP - 2734
JO - Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH
JF - Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH
SN - 2308-457X
T2 - 10th Annual Conference of the International Speech Communication Association, INTERSPEECH 2009
Y2 - 6 September 2009 through 10 September 2009
ER -