首页 | 本学科首页   官方微博 | 高级检索  
     检索      


Intelligibility enhancement for noisy whispered speech using asymmetric cost function
Authors:ZHOU Jian  ;ZHENG Wenming  ;WANG Qingyun  ;ZHAO Li
Institution:[1]Key Laboratory of Intelligent Computing and Signal Processing of Ministry of Education, Anhui University Hefei 230601; [2]Key Laboratory of Undemvater Acoustic Signal Processing of Ministry of Education, Southeast University Nanjing 210096; [3]Key Laboratory of Child Development and Learning Science of Ministry of Education, Southeast University Nanjing Jiangsu 210096
Abstract:We proposed two whispered speech enhancement methods based on asymmetric cost functions in this paper to deal with the amplification and attenuation distortions of whispered speech distinctively.The modified Itakura-Saito(MIS)distance function provides more penalties to speech amplification distortion,whereas the Kullback-Leibler(KL)divergence function gives more penalties to speech attenuation distortion.The experimental results show that the MIS function based method achieves significant improvement of intelligibility in contrast to the conventional speech enhancement algorithms when the signal-to-noise ratio(SNR)falls below-6 dB,whereas the KL function based one achieves the similar result as the minimum mean square error(MMSE)speech enhancement method.The results show that the effects of the amplification and attenuation distortions on the intelligibility of the enhanced whisper are different,where larger attenuation distortion may result in better intelligibility of speech with low SNR.However,the attenuation distortion has small effects on intelligibility of speech with high SNR.
Keywords:asymmetric  noisy  attenuation  estimator  divergence  amplification  distortion  minimizing  absent  Bayesian  
本文献已被 CNKI 维普 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号