结合注意力机制的改进U-Net网络在端到端语音增强中的应用 Application of improved U-Net network with attention mechanism in end-to-end speech enhancement期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

按检索

结合注意力机制的改进U-Net网络在端到端语音增强中的应用

引用本文：	武瑞沁,陈雪勤,俞杰,王丽荣,赵鹤鸣.结合注意力机制的改进U-Net网络在端到端语音增强中的应用[J].声学学报,2022,47(2):266-275.

作者姓名：	武瑞沁陈雪勤俞杰王丽荣赵鹤鸣

作者单位：	苏州大学电子信息学院苏州 215006

基金项目：	国家自然科学基金项目(61340004)资助；

摘要：	设计了一个适用于端到端语音增强的改进的U-Net (Attention Dilated Convolution U-Net,ADC-U-Net)网络模型。与基线U-Net网络相比,一方面通过加入空洞卷积减小由采样带来的信息损失;另一方面引入了注意力机制结构,结合了含噪语音更多的上下文信息,提取更深层次和更丰富的特征信息。与传统语音增强方法相比,所提模型无需提取特征、对特征去噪、重构语音3个步骤,避免了对显性特征的依赖,转而由网络模型通过多层次多尺度学习获得隐性特征。用多个主客观指标对增强语音的质量和可懂度进行了评价。实验数据显示所提算法在噪声抑制能力和对噪声的适应度方面均表现出良好的性能,与基线U-Net网络及其它模型相比,展示了良好的语音质量和可懂度。
关键词：	语音增强端到端 U-Net网络空洞卷积注意力机制
收稿时间：	2020-12-03
Application of improved U-Net network with attention mechanism in end-to-end speech enhancement

Institution:	School of Electronic and Information Engineering, Soochow University, Suzhou 215006

Abstract:	An improved U-Net(ADC-U-Net)network model for end-to-end speech enhancement is designed based on the U-Net network.Compared with the baseline U-Net network,on the one hand,the information loss caused by sampling is reduced by adding the void convolution.On the other hand,the attention mechanism structure is introduced,which combines more contextual information of noisy speech to extract deeper and richer feature information.Compared with traditional speech enhancement methods,the proposed model does not need three steps of feature extraction,feature denoising and speech reconstruction,and avoids the dependence on explicit features.Instead,the network model obtains implicit features through multi-level and multi-scale learning.The quality and intelligibility of enhanced speech are evaluated by several subjective and objective indexes.Experimental data show that the proposed algorithm performs well in noise suppression and adaptability.Compared with the baseline U-Net network and other models,the proposed algorithm demonstrates good speech quality and intelligibility.

Keywords:

	点击此处可从《声学学报》浏览原始摘要信息
	点击此处可从《声学学报》下载免费的PDF全文

设为首页 | 免责声明 | 关于勤云 | 加入收藏