期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

1.

王玫刘小娟宋浠瑜李江和仇洪冰《应用声学》2024,43(2):367-377

针对混响环境中,多径效应、散射、衍射等原因导致声源定位失败或分辨能力不足的现象,提出一种基于主导声源检测MUSIC群时延的邻近多声源定位方法。该方法采用球形传声器阵列,相比平面阵列可以捕获3D声场信息,利用球谐域下信号的频率分量与角度分量解耦的优势,从而可直接利用频率平滑技术处理宽带语声信号而不需要构造聚焦矩阵,并在球谐域下通过设置阈值对一组时频段进行主导声源检测,从而选择出包含直达声的一组时频块来构造MUSIC群时延空间谱。上述举措在提升波达方向估计在高混响环境下定位鲁棒性的同时,也提高了多个邻近声源的分辨能力。仿真实验结果表明,所提出的主导声源检测MUSIC群时延算法,在高混响和低信噪比条件下,仍具有更好的定位精度与更优的邻近多声源分辨效果。相似文献

2.

球形阵列自适应网格分层定位算法

下载免费PDF全文

李睿林陈志菲吴鸣赵静程利军鲍明樊文欣《应用声学》2023,42(2):292-303

刚性球形传声器阵列可以在无空间模糊的条件下进行球谐域数据处理,基于球形阵列的可控波束形成器导向响应功率(SRP)算法定位精度高,但是计算量大,计算效率低。通过将球面致密的全局网格搜索替换为分层搜索策略可以有效减小SRP算法的计算量。提出MRE-SRP算法保持球形阵列SRP定位精度的同时降低计算量,首先通过球谐域MUSIC(SH-MUSIC)算法判断入射声源的数量减小搜索区域;其次将相对熵模型引入球谐域SRP(SH-SRP)定位算法中,提取网格分层前后的信息增益,设计自适应网格选择判据,实现分层多分辨率网格的精准再细分,从而降低计算量。实验验证了所提出算法的性能,结果显示在单双声源定位中,该算法可以实现较高的定位精度,精准选择分层网格,计算量减少75%以上。相似文献

3.

偏度最大化多通道逆滤波语音去混响研究* 总被引：1，自引：1，他引：0

下载免费PDF全文

郭颖彭任华郑成诗李晓东《应用声学》2019,38(1):58-67

房间混响会降低语音质量和语音可懂度。高阶统计量是衡量非高斯性的重要参量,基于语音非高斯特性可实现语音去混响。本文提出一种基于高阶统计量的多通道语音去混响方法,该方法首次用多通道语音信号线性预测残差的三阶统计量偏度（Skewness）构造代价函数,以去混响重建信号线性预测残差的偏度最大化为目标自适应地更新逆滤波器;同时结合语音信号的产生模型,提出基于偏度准则的线性预测与房间脉冲响应逆滤波联合估计方法,进一步提高去混响算法性能。实验结果表明,该方法相较于已有的基于线性预测残差四阶统计量峰度（Kurtosis）的方法具有更好的去混响效果,且对噪声具有更强的鲁棒性。相似文献

4.

基于卡尔曼滤波的低复杂度去混响算法* 总被引：1，自引：1，他引：0

下载免费PDF全文

齐园蕾杨飞然杨军《应用声学》2018,37(4):559-566

在电话会议、智能音箱等应用场景下,传声器往往处在声源的远场。混响信号的存在会掩蔽后续到达的直达声信号,降低传声器接收信号的语音质量,以及语音识别系统的准确识别率。多通道线性预测算法是一种经典的盲去混响算法,但该算法往往具有较高的计算复杂度。本文提出了一种简化的卡尔曼滤波更新算法,通过对角化卡尔曼滤波器状态向量误差协方差矩阵,降低了自适应多通道线性预测去混响算法的复杂度。通过与现有分块对角简化算法对比发现,本文提出的简化算法在保证语音质量的同时,进一步降低了原卡尔曼滤波算法的复杂度。相似文献

5.

基于稀疏表示和特征加权的离格双耳声源定位* 总被引：1，自引：1，他引：0

下载免费PDF全文

丁建策厉剑郑成诗李晓东《应用声学》2019,38(6):917-925

基于头相关传递函数数据库的传统双耳声源定位方法的定位角度往往被限定在头相关传递函数数据库的离散测量点上。当头相关传递函数数据库的测量方位角间隔较大时,这类算法的性能会显著下降,这就是典型的离格问题。该文提出了基于加权宽带稀疏贝叶斯学习的离格双耳声源定位算法。首先该算法建立离格双耳信号的稀疏表示模型,然后利用双耳相干与扩散能量比特征对各个频点进行加权以降低噪声和混响的影响,最后通过加权宽带稀疏贝叶斯学习方法估计离格声源的方位角。实验结果表明,该算法在各种复杂的声学环境下都有着较高的定位精度和鲁棒性,特别是提高了离格条件下的声源定位性能。相似文献

6.

一种利用分布式传声器阵列的声源三维定位方法 总被引：3，自引：0，他引：3

下载免费PDF全文

柯炜张铭张铁成《声学学报》2017,42(3):361-369

为了提高噪声和混响条件下分布式传声器阵列进行声源定位的性能,提出一种利用空间稀疏性和压缩感知原理的声源三维定位方法。该方法首先通过两次离散余弦变换方式提取出声音信号特征,并用该特征来构建稀疏定位模型,以便能够综合利用语音信号的短时和长时特性,同时降低模型维数;然后利用在线字典学习技术动态调整字典,克服稀疏模型与实际信号之间的失配问题,增强稀疏定位模型的鲁棒性;进而提出一种改进的平滑l₀范数稀疏重构算法来进行声源位置解算,以提高低信噪比条件下的重构精度。仿真结果表明该方法不仅可以实现多目标定位,而且具有较强的抗噪声和抗混响能力. 相似文献

7.

仿人耳听觉的助听器双耳声源定位算法

《声学学报：英文版》2015,(3)

为提高复杂场景下的听障患者的语言理解度,本文提出一种仿人耳听觉的助听器双耳声源定位算法。算法首先借鉴耳蜗分频特性和听觉掩蔽特性,将声音信号进行多通道分解,并提取人耳敏感频带的信号进行双耳时间差(Interaural Time Difference,ITD)估计;然后基于人耳哈斯效应,提取有效的ITD信息;最后采用头相关模型,将ITD转化为声源方向信息。同时,为了改善混响和多干扰声场景下的声源定位能力,本文提出一种多通道的加权联合策略。仿真和场景测试实验表明,算法的抗干扰性强,定位精度高。而且,在7名受试者的理解度测试中,同现有的助听器增强算法相比,结合定位算法的语音增强算法达到3~5 dB的性能改善。相似文献

8.

应对说话人位置突变的鲁棒语声去混响*

下载免费PDF全文

吴礼福孙帅恒王雷孙芯年《应用声学》2022,41(6):851-859

多通道线性预测算法是一种有效的语声去混响方法,但若房间内说话人位置发生变化,房间冲激响应也会随之改变,从而导致去混响效果变差。本文研究了基于递归最小二乘算法的自适应多通道线性预测语声去混响方法,提出了一种去混响过程中检测说话人位置突变的方法,它借助已有的多通道传声器信号,并未明显增加系统开销。实验表明该方法可以有效检测说话人位置的突变,并通过重置滤波器改善算法的收敛速度和稳定性。相似文献

9.

采用球谐分解的二范数广义逆波束形成

下载免费PDF全文

高玥卢铃吴鸣杨军曹寅《应用声学》2022,41(1):12-20

基于传声器阵列的声成像技术是解决噪声源识别的有效途径之一.该文提出了一种基于球谐分解的L2范数广义逆波束形成算法,并对此算法在分布式球形阵列布放方案下进行了定位精度及鲁棒性的对比分析研究.仿真结果显示,此算法对低频相干声源具有较高的空间定位精确度,且阵元位置误差对此算法性能的影响有限.通过在半消声室进行实验进一步证明了... 相似文献

10.

基于改进最小方差无失真响应角度谱算法的气体泄漏定位方法

下载免费PDF全文

王天琪杨祥国晏园刘成龙张金磊《应用声学》2024,43(1):110-118

针对噪声环境下微小气体泄漏难以准确定位的问题,提出了一种基于改进最小方差无失真响应角度谱算法的气体泄漏定位方法。该算法通过引入信噪比追踪加权的方式,提取受噪声影响较小且单个声源能量占优的时频支撑域,并通过Softplus激活函数自适应地调整不同频率分量对角度谱函数的贡献,增加泄漏声源占优的时频域权重;此外,引入基于时频稀疏性的分频带处理,使各子频带内存在一个主导声源能量占优,抑制低频段噪声能量的积累同时避免高频混叠现象。通过软件仿真计算以及实验验证算法的性能,结果表明改进最小方差无失真响应角度谱算法可以实现气体泄漏源的精准定位,定位结果的最大误差在3.5°以内。相比传统算法,该方法在低信噪比和低采样点数下有更高的稳定性、抗噪能力及准确率,可为气体泄漏定位的实际应用提供一定的参考价值。相似文献

11.

Speech dereverberation method based on spectral subtraction and spectral line enhancement

Zhe Chen Rui WangFuliang Yin Bingqian WangWenwen Peng 《Applied Acoustics》2016

Speech signals recorded with a distant microphone usually are interfered by the spatial reverberation in the room, which severely degrades the clarity and intelligibility of speech. A speech dereverberation method based on spectral subtraction and spectral line enhancement is proposed in this paper. Following the generalized statistical reverberation model, the power spectrum of late reverberation is estimated and removed from the reverberation speech by the spectral subtraction method. Then, according to the human auditory model, a spectral line enhancement technique based on adaptive post-filtering is adopted to further eliminate the reverberant components between adjacent speech formants. The proposed method can effectively suppress the spatial reverberation and improve the auditory perception of speech. The subjective and objective evaluation results reveal that the perceptual quality of speech is greatly improved by the proposed method. 相似文献

12.

Single-channel dereverberation using a non-causal minimum variance distortionless response filter

MS Song HG Kang 《The Journal of the Acoustical Society of America》2012,132(1):EL29-EL35

This letter presents a single-channel speech dereverberation approach using a non-causal minimum variance distortionless response (MVDR) filter. The non-causal filter is adopted to utilize the additional information of the desired signal that lies in subsequent frames. Note that the desired signal output has minimal distortion due to the introduction of the MVDR criterion. The proposed system further suppresses the late reverberation by employing a statistical reverberant model. Experimental results demonstrate the superiority of the proposed algorithm to conventional approaches. 相似文献

13.

分布式无线声传感网加权预测误差语声去混响方法

下载免费PDF全文

韩哲郑成诗柯雨璇李晓东《应用声学》2022,41(1):21-31

室内混响会严重降低语声质量,因此在室内语声通信中对混响的抑制显得尤为重要.针对无线声传感网,该文提出一种基于加权预测误差的分布式自适应去混响算法.通过调整传统递归最小二乘算法,所提出的分布式加权预测误差算法仅需利用一路相同的参考信号和其他节点的本地输出而非全部信号,便可实现最优输出,从而大幅度降低节点间传输的通道数与各... 相似文献

14.

使用同心多环阵提升声源定位鲁棒性*

下载免费PDF全文

张国昌吴鸣韩欣宇武帅兵殷兰杨军《应用声学》2019,38(6):907-916

房间混响带来的多径失真是影响声源波达方向估计精度的主要因素之一。应用于环形阵列的相干信号子空间方法可以降低相干反射声带来的不利影响。该方法对环形阵进行谐波展开,并利用环谐波域导向矢量的频率无关特性聚焦各频率下的空间相关矩阵。但单环阵列存在展开系数零点,这会导致严重的噪声放大而降低定位的鲁棒性。该文提出了一种环谐波域的最小模同心多环阵,来缓解系数零点处的噪声放大问题。设计了一套麦克风阵列系统,用于评估同心多环阵列定位的鲁棒性。仿真和实验结果均表明:与相同孔径和阵元数的单环阵相比,使用最小模准则设计的同心多环阵列可以显著提升混响场景下的声源定位的稳健性。相似文献

15.

Binaural segregation in multisource reverberant environments

Roman N Srinivasan S Wang D 《The Journal of the Acoustical Society of America》2006,120(6):4040-4051

In a natural environment, speech signals are degraded by both reverberation and concurrent noise sources. While human listening is robust under these conditions using only two ears, current two-microphone algorithms perform poorly. The psychological process of figure-ground segregation suggests that the target signal is perceived as a foreground while the remaining stimuli are perceived as a background. Accordingly, the goal is to estimate an ideal time-frequency (T-F) binary mask, which selects the target if it is stronger than the interference in a local T-F unit. In this paper, a binaural segregation system that extracts the reverberant target signal from multisource reverberant mixtures by utilizing only the location information of target source is proposed. The proposed system combines target cancellation through adaptive filtering and a binary decision rule to estimate the ideal T-F binary mask. The main observation in this work is that the target attenuation in a T-F unit resulting from adaptive filtering is correlated with the relative strength of target to mixture. A comprehensive evaluation shows that the proposed system results in large SNR gains. In addition, comparisons using SNR as well as automatic speech recognition measures show that this system outperforms standard two-microphone beamforming approaches and a recent binaural processor. 相似文献

16.

Evaluation of an adaptive beamforming method for hearing aids. 总被引：3，自引：0，他引：3

J E Greenberg P M Zurek 《The Journal of the Acoustical Society of America》1992,91(3):1662-1676

In this paper evaluations of a two-microphone adaptive beamforming system for hearing aids are presented. The system, based on the constrained adaptive beamformer described by Griffiths and Jim [IEEE Trans. Antennas Propag. AP-30, 27-34 (1982)], adapts to preserve target signals from straight ahead and to minimize jammer signals arriving from other directions. Modifications of the basic Griffiths-Jim algorithm are proposed to alleviate problems of target cancellation and misadjustment that arise in the presence of strong target signals. The evaluations employ both computer simulations and a real-time hardware implementation and are restricted to the case of a single jammer. Performance is measured by the spectrally weighted gain in the target-to-jammer ratio in the steady state. Results show that in environments with relatively little reverberation: (1) the modifications allow good performance even with misaligned arrays and high input target-to-jammer ratios; and (2) performance is better with a broadside array with 7-cm spacing between microphones than with a 26-cm broadside or a 7-cm endfire configuration. Performance degrades in reverberant environments; at the critical distance of a room, improvement with a practical system is limited to a few dB. 相似文献

17.

Pitch-based monaural segregation of reverberant speech

Roman N Wang D 《The Journal of the Acoustical Society of America》2006,120(1):458-469

In everyday listening, both background noise and reverberation degrade the speech signal. Psychoacoustic evidence suggests that human speech perception under reverberant conditions relies mostly on monaural processing. While speech segregation based on periodicity has achieved considerable progress in handling additive noise, little research in monaural segregation has been devoted to reverberant scenarios. Reverberation smears the harmonic structure of speech signals, and our evaluations using a pitch-based segregation algorithm show that an increase in the room reverberation time causes degraded performance due to weakened periodicity in the target signal. We propose a two-stage monaural separation system that combines the inverse filtering of the room impulse response corresponding to target location and a pitch-based speech segregation method. As a result of the first stage, the harmonicity of a signal arriving from target direction is partially restored while signals arriving from other directions are further smeared, and this leads to improved segregation. A systematic evaluation of the system shows that the proposed system results in considerable signal-to-noise ratio gains across different conditions. Potential applications of this system include robust automatic speech recognition and hearing aid design. 相似文献