首页 | 本学科首页   官方微博 | 高级检索  
     检索      

基于离散余弦变换的语音压缩采样和编码算法
引用本文:武朋辉,杨百龙,时磊.基于离散余弦变换的语音压缩采样和编码算法[J].应用声学,2015,34(1):17-23.
作者姓名:武朋辉  杨百龙  时磊
作者单位:第二炮兵工程大学信息工程系;96424部队,第二炮兵工程大学信息工程系,第二炮兵工程大学士官学院
基金项目:军队装备科研基金资助项目
摘    要:针对语音无线通信中带宽资源受限的问题,提出基于压缩采样的低速率语音编码算法。以基尼系数为指标,比较不同稀疏变换域下语音信号的稀疏性,分析常见重构算法对语音信号压缩采样观测信号的重构特性。对标准耳蜗滤波器——伽马啁啾滤波器组的参数进行研究,并以梯度投影稀疏重建(GPSR)算法重构语音信号。利用语音质量感知评估(PESQ)、信噪比和主观听觉测试,对编解码后的合成语音信号进行了质量评估。实验表明,基于压缩感知的语音编码器以4 kbps的低速率对语音进行编码时,PESQ得分可达到3.16,计算复杂度相对较低,可以用于实际的语音编码环境。

关 键 词:低速率编码  压缩采样  基尼系数  离散余弦变换
收稿时间:2014/3/28 0:00:00
修稿时间:2014/12/25 0:00:00

Speech compressive sensing and codec algorithm based on DCT
Wu Penghui,Yang Bailong and Shi Lei.Speech compressive sensing and codec algorithm based on DCT[J].Applied Acoustics,2015,34(1):17-23.
Authors:Wu Penghui  Yang Bailong and Shi Lei
Institution:Department of Information Engineering, The Second Artillery Engineering University; Unit 96424 of PLA,Department of Information Engineering, The Second Artillery Engineering University,College of NCO, The Second Artillery Engineering University
Abstract:Due to restricted bandwidth in wireless speech communication, a new low-bit rate speech codec based on compressive sampling under discrete cosine transform is proposed. Speech sparsity under different transformations is compared. The Gini index is utilized to gage the coefficient sparsity. Before sampling, gammachirp filterbank parameters are selected in the speech frame. During reconstruction, the GPSR is used to recover the signal. Subjective and objective tests show that the proposed technique gets 3.16 PESQ mean score, and the bit-rate reaches to 4kbps. Furthermore, compared to MELP, its computation complexity is relatively low and it can be used under real applications.
Keywords:
本文献已被 CNKI 等数据库收录!
点击此处可从《应用声学》浏览原始摘要信息
点击此处可从《应用声学》下载免费的PDF全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号