采用L<sub>1/2</sub>稀疏约束的梅尔倒谱系数语音重建方法 Speech reconstruction from Mel-frequency cepstral coefficients via L1/2 sparse constraint期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

按检索

采用L_1/2稀疏约束的梅尔倒谱系数语音重建方法

引用本文：	周健,刘荣敏,窦云峰,路成,陶亮.采用L_1/2稀疏约束的梅尔倒谱系数语音重建方法[J].声学学报,2018,43(6):991-999.

作者姓名：	周健刘荣敏窦云峰路成陶亮

作者单位：	1. 安徽大学计算智能与信号处理教育部重点实验室合肥 230039;

基金项目：	安徽省自然科学基金项目(1708085MF151)安徽省高校自然科学研究项目(KJ2018A0018)资助国家自然科学基金项目(61301295,61371217)

摘要：	提出了一种利用L_1/2稀疏约束从梅尔倒谱系数重建语音时域信号方法。从梅尔倒谱系数估计语音幅度谱是一个欠定问题,现有的方法均采用幅度谱最小均方误差估计或采用L1正则化进行幅度谱的稀疏约束。相比于L₁正则化模型,L_1/2的稀疏约束特性更强,为此,本文在从梅尔倒谱系数估计语音幅度谱时引入L_1/2正则化约束,并利用求解的稀疏幅度谱估计相位谱,最后利用估计的频谱重建时域语音信号。实验结果表明,与幅度谱最小均方误差法相比,本文算法所估计出的语音信号具有更高的语音质量;在噪声环境下进行语音重建实验,与L₁正则化幅度谱估计方法相比,本文算法重建的语音质量更好,表现出更好抗噪性。
关键词：	MFCC 语音重构稀疏约束 L1/2正则化K
收稿时间：	2017-06-26
Speech reconstruction from Mel-frequency cepstral coefficients via L1/2 sparse constraint

Institution:	1. Key Laboratory of Intelligent Computing and Signal Processing of Ministry of Education, Anhui University Hefei 230039;2. Institute of Media Computing, Anhui University Hefei 230601;3. School of Information Science and Engineering, Southeast University Nanjing 21009

Abstract:	Reconstruction the time domain speech signal from Mel-frequency cepstral coefficients (MFCCs) based on L1/2 sparse constraint is proposed. Since estimating the speech amplitude spectrum from MFCCs is an underdetermined problem, existing methods usually adopt either minimum mean square error minimization of the amplitude spectrum or the L₁ regularization based sparse constraint to estimate the amplitude spectrum. Compared to the L₁ regularization, the L_1/2 regularization has stronger ability to obtain the sparse components of a speech signal. Thus, we use L_1/2 regularization constraint when estimating amplitude spectrum from MFCCs in the proposed method. The phase spectrum is estimated from the estimated sparse amplitude spectrum. Finally the time domain speech signal is reconstructed from the estimated spectrum. Experimental results show that the speech signal reconstructed by the proposed method gains higher speech quality than that by the minimum mean square error method. Specifically, the proposed method outperforms the L₁ regularization method in the aspect of speech quality under the noise environment, indicating noise robustness of the proposed method.

Keywords:
本文献已被 CNKI 等数据库收录！
	点击此处可从《声学学报》浏览原始摘要信息
	点击此处可从《声学学报》下载免费的PDF全文

设为首页 | 免责声明 | 关于勤云 | 加入收藏