Maximum likelihood subband polynomial regression for robust speech recognition |
| |
Authors: | Yong Lü Zhenyang Wu |
| |
Affiliation: | 1. College of Computer and Information Engineering, Hohai University, Nanjing 210098, China;2. School of Information Science and Engineering, Southeast University, Nanjing 210096, China |
| |
Abstract: | In this paper, we propose a model adaptation algorithm based on maximum likelihood subband polynomial regression (MLSPR) for robust speech recognition. In this algorithm, the cepstral mean vectors of prior trained hidden Markov models (HMMs) are converted to the log-spectral domain by the inverse discrete cosine transform (DCT) and each log-spectral mean vector is divided into several subband vectors. The relationship between the training and testing subband vectors is approximated by a polynomial function. The polynomial coefficients are estimated from adaptation data using the expectation–maximization (EM) algorithm under the maximum likelihood (ML) criterion. The experimental results show that the proposed MLSPR algorithm is superior to both the maximum likelihood linear regression (MLLR) adaptation and maximum likelihood subband weighting (MLSW) approach. In the MLSPR adaptation, only a very small amount of adaptation data is required and therefore it is more useful for fast model adaptation. |
| |
Keywords: | |
本文献已被 ScienceDirect 等数据库收录! |
|