首页 | 本学科首页   官方微博 | 高级检索  
     

采用扩展型双线性变换法将耳语音转换为正常语音的研究
引用本文:陶智, 赵鹤鸣, 谈雪丹, 顾济华, 张晓俊, 吴迪. 采用扩展型双线性变换法将耳语音转换为正常语音的研究[J]. 声学学报, 2012, 37(6): 651-658. DOI: 10.15949/j.cnki.0371-0025.2012.06.011
作者姓名:陶智  赵鹤鸣  谈雪丹  顾济华  张晓俊  吴迪
作者单位:1 苏州大学物理科学与技术学院 能源学院 江苏 215006;
基金项目:国家自然科学基金(61271359,61071215)、苏州市科技发展计划(SYG201001)和苏州大学捷美生物医学工程仪器联合重点实验室资助项目。
摘    要:提出了一种采用扩展型双线性变换将耳语音转换为正常语音的方法。根据耳语音在不同频段的共振峰偏移程度不同,将耳语音的频谱进行分段处理,在此基础上建立耳语音转换为正常语音的转换函数。由于耳语音在各频段相对于正常语音非线性偏移,在双线性变换函数中引入扩展因子,使其对频谱的非线性偏移与对共振峰带宽的压缩更加符合耳语音转换为正常语音的实际转换需求,有效减小了转换语音与正常语音的谱失真距离。实验结果表明,本文的转换语音在音质和可懂度上均得到了有效提高。

收稿时间:2011-09-27
修稿时间:2011-11-26

Research of conversion from whispered speech to normal speech by the extended bilinear transformation
TAO Zhi, ZHAO Heming, TAN Xuedan, GU Jihua, ZHANG Xiaojun, WU Di. Research of conversion from whispered speech to normal speech by the extended bilinear transformation[J]. ACTA ACUSTICA, 2012, 37(6): 651-658. DOI: 10.15949/j.cnki.0371-0025.2012.06.011
Authors:TAO Zhi  ZHAO Heming  TAN Xuedan  GU Jihua  ZHANG Xiaojun  WU Di
Affiliation:1 School of Physical Science and Technology & School of Energy, Soochow University Suzhou 215006;2 School of Electronics and Information Engineering, Soochow University Suzhou 215006
Abstract:One method of conversion from whispered speech to formal speech based on the extended bilinear transformation is proposed.On account of the different deviation degrees of the whisper's formants in different frequency bands,the spectrum of the whispered speech will be processed in the separate partitions of this paper.On the basis of this spectrum,we will establish a conversion function able to usefully convert whispered speech to formal speech. Because of the whisper's non-linear offset in relation to normal speech,this paper introduces an expansion factor in the bilinear transform function making it correspond more closely to the actual conversion demands of whispered speech to formal speech.The introduction of this factor takes the non-linear move of the spectrum and the compression of the formant bandwidth into consideration,thus effectively reducing the spectrum distortion distance in the conversion.The experiment results show that the conversion presented in this paper effectively improves both the sound quality and the intelligibility of whispered speech. 
Keywords:
点击此处可从《声学学报》浏览原始摘要信息
点击此处可从《声学学报》下载免费的PDF全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号