Maximum likelihood subband polynomial regression for robust speech recognition期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

Maximum likelihood subband polynomial regression for robust speech recognition

Authors:	Yong Lü Zhenyang Wu

Affiliation:	1. College of Computer and Information Engineering, Hohai University, Nanjing 210098, China;2. School of Information Science and Engineering, Southeast University, Nanjing 210096, China

Abstract:	In this paper, we propose a model adaptation algorithm based on maximum likelihood subband polynomial regression (MLSPR) for robust speech recognition. In this algorithm, the cepstral mean vectors of prior trained hidden Markov models (HMMs) are converted to the log-spectral domain by the inverse discrete cosine transform (DCT) and each log-spectral mean vector is divided into several subband vectors. The relationship between the training and testing subband vectors is approximated by a polynomial function. The polynomial coefficients are estimated from adaptation data using the expectation–maximization (EM) algorithm under the maximum likelihood (ML) criterion. The experimental results show that the proposed MLSPR algorithm is superior to both the maximum likelihood linear regression (MLLR) adaptation and maximum likelihood subband weighting (MLSW) approach. In the MLSPR adaptation, only a very small amount of adaptation data is required and therefore it is more useful for fast model adaptation.

Keywords:
本文献已被 ScienceDirect 等数据库收录！