Intelligibility enhancement for noisy whispered speech using asymmetric cost function |
| |
Authors: | ZHOU Jian ;ZHENG Wenming ;WANG Qingyun ;ZHAO Li |
| |
Institution: | [1]Key Laboratory of Intelligent Computing and Signal Processing of Ministry of Education, Anhui University Hefei 230601; [2]Key Laboratory of Undemvater Acoustic Signal Processing of Ministry of Education, Southeast University Nanjing 210096; [3]Key Laboratory of Child Development and Learning Science of Ministry of Education, Southeast University Nanjing Jiangsu 210096 |
| |
Abstract: | We proposed two whispered speech enhancement methods based on asymmetric cost functions in this paper to deal with the amplification and attenuation distortions of whispered speech distinctively.The modified Itakura-Saito(MIS)distance function provides more penalties to speech amplification distortion,whereas the Kullback-Leibler(KL)divergence function gives more penalties to speech attenuation distortion.The experimental results show that the MIS function based method achieves significant improvement of intelligibility in contrast to the conventional speech enhancement algorithms when the signal-to-noise ratio(SNR)falls below-6 dB,whereas the KL function based one achieves the similar result as the minimum mean square error(MMSE)speech enhancement method.The results show that the effects of the amplification and attenuation distortions on the intelligibility of the enhanced whisper are different,where larger attenuation distortion may result in better intelligibility of speech with low SNR.However,the attenuation distortion has small effects on intelligibility of speech with high SNR. |
| |
Keywords: | asymmetric noisy attenuation estimator divergence amplification distortion minimizing absent Bayesian |
本文献已被 CNKI 维普 等数据库收录! |