首页 | 本学科首页   官方微博 | 高级检索  
     检索      

基于稀疏表示和特征加权的离格双耳声源定位*
引用本文:丁建策,厉 剑,郑成诗,李晓东.基于稀疏表示和特征加权的离格双耳声源定位*[J].应用声学,2019,38(6):917-925.
作者姓名:丁建策  厉 剑  郑成诗  李晓东
作者单位:中国科学院声学研究所噪声与振动重点实验室 北京,中国科学院大学 北京;中国科学院声学研究所噪声与振动重点实验室 北京,中国科学院大学 北京;中国科学院声学研究所噪声与振动重点实验室 北京,中国科学院大学 北京;中国科学院声学研究所噪声与振动重点实验室 北京
基金项目:(61571435; 61801468)
摘    要:基于头相关传递函数数据库的传统双耳声源定位方法的定位角度往往被限定在头相关传递函数数据库的离散测量点上。当头相关传递函数数据库的测量方位角间隔较大时,这类算法的性能会显著下降,这就是典型的离格问题。该文提出了基于加权宽带稀疏贝叶斯学习的离格双耳声源定位算法。首先该算法建立离格双耳信号的稀疏表示模型,然后利用双耳相干与扩散能量比特征对各个频点进行加权以降低噪声和混响的影响,最后通过加权宽带稀疏贝叶斯学习方法估计离格声源的方位角。实验结果表明,该算法在各种复杂的声学环境下都有着较高的定位精度和鲁棒性,特别是提高了离格条件下的声源定位性能。

关 键 词:离格双耳声源定位  稀疏表示  双耳相干与扩散能量比  宽带稀疏贝叶斯学习
收稿时间:2019/3/11 0:00:00
修稿时间:2019/10/27 0:00:00

Off-grid binaural sound source localization using sparse representation and feature weighting
DING Jiance,LI Jian,ZHENG Chengshi and LI Xiaodong.Off-grid binaural sound source localization using sparse representation and feature weighting[J].Applied Acoustics,2019,38(6):917-925.
Authors:DING Jiance  LI Jian  ZHENG Chengshi and LI Xiaodong
Institution:Key Laboratory of Noise and Vibration Research,Institute of Acoustics,Chinese Academy of Sciences,University of Chinese Academy of Sciences;Key Laboratory of Noise and Vibration Research,Institute of Acoustics,Chinese Academy of Sciences,University of Chinese Academy of Sciences;Key Laboratory of Noise and Vibration Research,Institute of Acoustics,Chinese Academy of Sciences,University of Chinese Academy of Sciences;Key Laboratory of Noise and Vibration Research,Institute of Acoustics,Chinese Academy of Sciences
Abstract:Traditional binaural sound source localization(BSSL)techniques using measured head-related transfer function (HRTF) databases often suffer a typical off-grid problem, where their estimated azimuth angles are restricted at the measured azimuth angles of HRTF databases. When the interval of the measured azimuth angles is large, the performance of these techniques will degrade significantly. This paper proposes an off-grid BSSL algorithm based on weighted wideband sparse Bayesian learning. First, the algorithm establishes an off-grid sparse representation model. Then weighted values based on binaural coherent-to-diffuse power ratio (BCDR) for each frequency band are calculated to reduce the impact of noise and reverberation. Finally, a weighted wideband sparse Bayesian learning method is derived to solve the off-grid BSSL problem. Experimental results show that the proposed method can achieve higher localization accuracy and is more robust than the compared BSSL techniques in various acoustic environments, especially under the off-grid situations.
Keywords:Off-grid binaural sound source localization  Sparse representation  Coherent-to-diffuse power ratio  Wideband sparse Bayesian learning
本文献已被 CNKI 等数据库收录!
点击此处可从《应用声学》浏览原始摘要信息
点击此处可从《应用声学》下载免费的PDF全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号