TY - GEN
T1 - Bayesian feature enhancement using a mixture of unscented transformations for uncertainty decoding of noisy speech
AU - Shinohara, Yusuke
AU - Akamine, Masami
PY - 2009
Y1 - 2009
N2 - A new parameter estimation method for the Model-Based Feature Enhancement (MBFE) is presented. The conventional MBFE uses the vector Taylor series to calculate the parameters of non-linearly transformed distributions, though the linearization leads to a degraded performance. We use the unscented transformation to estimate the parameters, where a minimal number of samples propagated through the nonlinear transformation are used. By avoiding the linearization, the parameters are estimated more accurately. Experimental results on Aurora2 show that the proposed method reduces the word error rate by 8.48% relatively, while the computational cost is just modestly higher, compared with the conventional MBFE.
AB - A new parameter estimation method for the Model-Based Feature Enhancement (MBFE) is presented. The conventional MBFE uses the vector Taylor series to calculate the parameters of non-linearly transformed distributions, though the linearization leads to a degraded performance. We use the unscented transformation to estimate the parameters, where a minimal number of samples propagated through the nonlinear transformation are used. By avoiding the linearization, the parameters are estimated more accurately. Experimental results on Aurora2 show that the proposed method reduces the word error rate by 8.48% relatively, while the computational cost is just modestly higher, compared with the conventional MBFE.
KW - Feature enhancement
KW - Noisy speech recognition
KW - Uncertainty decoding
KW - Unscented transformation
KW - Vector Taylor series
UR - http://www.scopus.com/inward/record.url?scp=70349206345&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=70349206345&partnerID=8YFLogxK
U2 - 10.1109/ICASSP.2009.4960647
DO - 10.1109/ICASSP.2009.4960647
M3 - Conference contribution
AN - SCOPUS:70349206345
SN - 9781424423545
T3 - ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings
SP - 4569
EP - 4572
BT - 2009 IEEE International Conference on Acoustics, Speech, and Signal Processing - Proceedings, ICASSP 2009
T2 - 2009 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2009
Y2 - 19 April 2009 through 24 April 2009
ER -