考虑帧间信息的语音带宽扩展 Speech bandwidth extension supported by temporal information期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

按检索

考虑帧间信息的语音带宽扩展

引用本文：	王迎雪,赵胜辉,匡镜明.考虑帧间信息的语音带宽扩展[J].声学学报,2017,42(3):370-376.

作者姓名：	王迎雪赵胜辉匡镜明

作者单位：	1. 北京理工大学信息与电子学院北京 100081;

摘要：	语音带宽扩展是为了提高语音质量,利用语音低频和高频之间的相关性重构语音高频的一种技术。高斯混合模型法是语音带宽技术中被广泛应用的一种方法,但是,该方法的映射函数是分段线性函数,且没有考虑语音前后帧的相关信息。因此,提出了一种基于条件受限玻尔兹曼机的方法。该方法利用条件受限玻尔兹曼机提取了语音信号的帧间信息,同时将语音低频、高频特征参数映射为高阶统计特性,深层发掘和模拟了语音低频和高频之间的非线性关系。客观和主观对比测试结果都表明,该方法性能优于传统的高斯混合模型方法。
关键词：	语音带宽扩展条件受限玻尔兹曼机帧间信息高斯混合模型
收稿时间：	2015-10-10
Speech bandwidth extension supported by temporal information

Institution:	1. School of Information and Electronic, Beijing Institute of Technology Beijing 100081;2. School of Computer Science, Carnegie Mellon University Pittsburgh 15213 US

Abstract:	Speech Bandwidth Extension (BWE) aims to improve the quality of speech by reconstructing the missing High Frequency (HF) components using the correlation that exists between the Low Frequency (LF) and HF of speech. The Gaussian Mixture Model (GMM) based methods are widely used. However, the derived mapping function by GMM is a piece-wise linear transformation and ignores the temporal information of speech. Thus, a novel BWE method is proposed for estimation of the HF parts of speech by exploiting Conditional Restricted Boltzmann Machines (CRBM). The proposed method introduces CRBM to obtain time information and model deep non-linear relationships between the spectral envelope features of LF and HF by building high-order eigen spaces between the LF and HF of the speech signal. The objective and subjective test results show that the proposed method outperforms the conventional GMM based method.

Keywords:
本文献已被 CNKI 等数据库收录！
	点击此处可从《声学学报》浏览原始摘要信息
	点击此处可从《声学学报》下载免费的PDF全文

设为首页 | 免责声明 | 关于勤云 | 加入收藏