首页 | 本学科首页   官方微博 | 高级检索  
     检索      

Speech enhancement based on multitaper spectrum and psychoacoustical weighting rule
作者姓名:WU  Hongwei  WU  Zhenyang  ZHAO  Li
作者单位:[1]College of Information Science and Engineering, Southeast University Nanjing 210096 [2]School of Electronics and Information, Suzhou University Suzhou 215021
基金项目:国家重点基础研究发展计划(973计划);国家自然科学基金;苏州大学校科研和教改项目
摘    要:Multitaper spectrum has lower variance than the traditional periodogram. The noise spectrum and the noise to noisy signal spectrum ratio (NNSR) were estimated from the multitaper spectrum of the noisy signal; the pre-enhanced speech for calculating the noise masking threshold was obtained by the spectral amplitude subtraction method, whose gain is a function of NNSR; the final enhanced speech was obtained by suppressing the Fourier spectrum of the noisy speech with the psychoacoustical weighting rule incorporating the noise masking threshold. Because of the low variance feature of the multitaper spectrum, a modified offset formula was proposed to calculate the noise masking threshold, thus the reconstructed speech with this modification has an improvement in MBSD (Modified Bark Spectral Distortion). When a maximum limitation less than one to the psychoacoustical weighting rule is further proposed, the higher the input SNR (> 0 dB) is, the more improvement the segmental SNR and the overall SNR have. The informal listening tests show that there is little speech distortion for the enhanced speech processed by the proposed method, the background noise is reduced much and free of musical noise.

关 键 词:多带光谱  心理声学  语音  评价指标
修稿时间:2006-11-04

Speech enhancement based on multitaper spectrum and psychoacoustical weighting rule
Authors:WU Hongwei  WU Zhenyang  ZHAO Li
Abstract:
Keywords:
本文献已被 CNKI 维普 万方数据 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号