期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

1.

An improved post-filter of acoustic echo canceller based on subband implementation

Kai Chen Pu-yang Xu Jing Lu Bo-ling Xu 《Applied Acoustics》2009,70(6):886-893

In this paper, an effective post-filter structure for subband-based acoustic echo cancellation (SAEC) is proposed. Compared with the current subband-based post-filters, the proposed one can more consistently suppress the background noise, as well as the residual echoes. To reduce the effects of the distortion of the near-end speech, two alternative modified versions of the post-filter are proposed, which guarantees the quality of speech communications. The proposed post-filters are seamlessly combined with the subband-based AEC system with quite small computational burden. The instrumental evaluation and listening test both demonstrate the superiority of the post-filters. 相似文献

2.

Time-domain noise reduction based on an orthogonal decomposition for desired signal extraction

J Benesty J Chen Y Arden Huang T Gaensler 《The Journal of the Acoustical Society of America》2012,132(1):452-464

This paper addresses the problem of noise reduction in the time domain where the clean speech sample at every time instant is estimated by filtering a vector of the noisy speech signal. Such a clean speech estimate consists of both the filtered speech and residual noise (filtered noise) as the noisy vector is the sum of the clean speech and noise vectors. Traditionally, the filtered speech is treated as the desired signal after noise reduction. This paper proposes to decompose the clean speech vector into two orthogonal components: one is correlated and the other is uncorrelated with the current clean speech sample. While the correlated component helps estimate the clean speech, it is shown that the uncorrelated component interferes with the estimation, just as the additive noise. Based on this orthogonal decomposition, the paper presents a way to define the error signal and cost functions and addresses the issue of how to design different optimal noise reduction filters by optimizing these cost functions. Specifically, it discusses how to design the maximum SNR filter, the Wiener filter, the minimum variance distortionless response (MVDR) filter, the tradeoff filter, and the linearly constrained minimum variance (LCMV) filter. It demonstrates that the maximum SNR, Wiener, MVDR, and tradeoff filters are identical up to a scaling factor. It also shows from the orthogonal decomposition that many performance measures can be defined, which seem to be more appropriate than the traditional ones for the evaluation of the noise reduction filters. 相似文献

3.

基于能量损失率估计的传声器阵列语音增强

廖逢钗李鹏徐波《声学学报》2009,34(3):281-288

在延时相加波束形成和维纳滤波技术的基础上,提出了一种基于能量损失率估计的传声器阵列后滤波语音增强算法。该算法通过检测线性不等间距传声器阵列中各嵌套子阵在波束形成前后的能量变化来估计维纳滤波器的权系数,实现了语音增强的目标。在仿真数据集上的实验评估表明,相比原始语音,该算法增强后的语音在信噪比、对数谱距离和感知质量等指标上平均分别改善了17.1 dB,1.001和0.935,具有很好的应用前景。相似文献

4.

An iterative noise cross-PSD estimation for two-microphone speech enhancement 总被引：2，自引：0，他引：2

Mohsen Rahmani Ahmad Akbari Beghdad Ayad 《Applied Acoustics》2009,70(3):514-521

Among various speech enhancement methods, dual-microphone methods are of a great importance for their low cost implementation and for exploiting spatial-filtering benefits of the microphone arrays. Coherence based methods are well-known as efficient two-microphone noise reduction techniques. These techniques do not work well, when received noise signals are correlated. These can be improved when the cross power spectral density (CPSD) of noise is available. In this paper, we propose an iterative approach for estimation of the noise CPSD to be employed in coherence based methods. We compare the proposed iterative noise CPSD estimation with a noise CPSD estimation technique based on voice activity detector (VAD), both of which are employed in a two-microphone speech enhancement, separately. Evaluation results show that the two-microphone speech enhancement scheme utilizing the proposed noise CPSD estimation technique performs superior than the enhancement system using the VAD-based noise CPSD estimation. 相似文献

5.

On the efficacy of spatial sampling using manual scanning paths to determine the spatial average sound pressure level in rooms

Hopkins C 《The Journal of the Acoustical Society of America》2011,129(5):3027-3034

In architectural acoustics, noise control and environmental noise, there are often steady-state signals for which it is necessary to measure the spatial average, sound pressure level inside rooms. This requires using fixed microphone positions, mechanical scanning devices, or manual scanning. In comparison with mechanical scanning devices, the human body allows manual scanning to trace out complex geometrical paths in three-dimensional space. To determine the efficacy of manual scanning paths in terms of an equivalent number of uncorrelated samples, an analytical approach is solved numerically. The benchmark used to assess these paths is a minimum of five uncorrelated fixed microphone positions at frequencies above 200 Hz. For paths involving an operator walking across the room, potential problems exist with walking noise and non-uniform scanning speeds. Hence, paths are considered based on a fixed standing position or rotation of the body about a fixed point. In empty rooms, it is shown that a circle, helix, or cylindrical-type path satisfy the benchmark requirement with the latter two paths being highly efficient at generating large number of uncorrelated samples. In furnished rooms where there is limited space for the operator to move, an efficient path comprises three semicircles with 45°-60° separations. 相似文献

6.

The effect of multimicrophone noise reduction systems on sound source localization by users of binaural hearing aids

Van den Bogaert T Doclo S Wouters J Moonen M 《The Journal of the Acoustical Society of America》2008,124(1):484-497

This paper evaluates the influence of three multimicrophone noise reduction algorithms on the ability to localize sound sources. Two recently developed noise reduction techniques for binaural hearing aids were evaluated, namely, the binaural multichannel Wiener filter (MWF) and the binaural multichannel Wiener filter with partial noise estimate (MWF-N), together with a dual-monaural adaptive directional microphone (ADM), which is a widely used noise reduction approach in commercial hearing aids. The influence of the different algorithms on perceived sound source localization and their noise reduction performance was evaluated. It is shown that noise reduction algorithms can have a large influence on localization and that (a) the ADM only preserves localization in the forward direction over azimuths where limited or no noise reduction is obtained; (b) the MWF preserves localization of the target speech component but may distort localization of the noise component. The latter is dependent on signal-to-noise ratio and masking effects; (c) the MWF-N enables correct localization of both the speech and the noise components; (d) the statistical Wiener filter approach introduces a better combination of sound source localization and noise reduction performance than the ADM approach. 相似文献

7.

基于双传声器的蓝牙耳机降噪算法

下载免费PDF全文

严馨叶邱小军卢晶《应用声学》2014,33(4):313-323

用于免提通信设备的语音增强算法一直是研究的热点问题,而算法处理结果的音质问题近年来也备受关注。针对基于双传声器降噪的蓝牙耳机系统,将常用多通道传声器降噪算法归纳为基于相干函数法和基于空间预分离法这两大类进行分析和比较。基于相干函数法利用两个通道间信号的相干函数对含噪信号滤波达到降噪目的,而基于空间预分离法利用空间特性从含噪信号中分离出噪声参考信号来消除噪声。分析基于降噪量、语音音质和综合性能三个指标,从约束语音损伤的角度分析最优解的形式,并对比两类算法的实际性能。结果表明选择合适的算法可权衡降噪量与语音损伤,达到较好的综合性能。相似文献

8.

门控卷积神经网络多通路声学回声消除算法*

下载免费PDF全文

李国腾郑成诗柯雨璇李晓东《应用声学》2024,43(3):478-478

多通路声重放系统能够增强听者的现实感与空间感,但在免提通信条件下,其不可避免会受到噪声和回声干扰,严重影响通信质量。针对上述问题,本文提出了一种基于门控卷积循环神经网络的多通路声学回声消除和噪声抑制方法。该方法以传声器接收信号和重放声道的压缩复数谱为网络输入,以近端语音的压缩复数谱为网络的输出目标,直接从传声器拾取信号中恢复近端纯净语音,无需对声重放信号进行去相关处理,解决了传统自适应滤波方法中存在的非唯一解问题,同时保证了多通路声重放质量。仿真和真实声学环境实验均表明本文所提出的方法可显著消除多通路声重放系统的噪声和回声,在语音质量和回声返回衰减增益方面均优于传统算法。相似文献

9.

Time-delay estimation of reverberated speech exploiting harmonic structure.

M S Brandstein 《The Journal of the Acoustical Society of America》1999,105(5):2914-2919

The relative time delay associated with a speech signal received at a pair of spatially separated microphones is a key component in talker localization and microphone array beamforming procedures. The traditional method for estimating this parameter utilizes the generalized cross correlation (GCC), the performance of which is compromised by the presence of room reverberations and background noise. Typically, the GCC filtering criteria used are either focused on the signal degradations due to additive noise or those due exclusively to multipath channel effects. There has been relatively little success at applying GCC weighting schemes which are robust to both of these conditions. This paper details an alternative approach which attempts to employ a signal-dependent criterion, namely, the estimated periodicity of the speech signal, to design a GCC filter appropriate for the combination of noise and multipath distortions. Simulations are performed across a range of room conditions to illustrate the utility of the proposed time-delay estimation method relative to conventional GCC filtering approaches. 相似文献

10.

Single-channel speech enhancement method using reconstructive NMF with spectrotemporal speech presence probabilities

Seongjae Lee David K. Han Hanseok Ko 《Applied Acoustics》2017

In this paper, a novel single microphone channel-based speech enhancement technique is presented. While most of the conventional nonnegative matrix factorization-based approaches focus on generating a basis matrix of speech and noise for enhancement, the proposed algorithm performs an additional process to reconstruct speech from noisy speech when these two elements are highly overlapped in selected spectral bands. This process involves a log-spectral amplitude based estimator, which provides the spectrotemporal speech presence probability to obtain a more accurate reconstruction. Moreover, the proposed algorithm applies an unsupervised learning method to the input noise, so it is adaptable to any type of environmental noise without a pre-trained dictionary. The experimental results demonstrate that the proposed algorithm obtains improved speech enhancement performance compared with conventional single channel-based approaches. 相似文献

11.

Binaural processing of modulated interaural level differences

Thompson ER Dau T 《The Journal of the Acoustical Society of America》2008,123(2):1017-1029

Two experiments are presented that measure the acuity of binaural processing of modulated interaural level differences (ILDs) using psychoacoustic methods. In both experiments, dynamic ILDs were created by imposing an interaurally antiphasic sinusoidal amplitude modulation (AM) signal on high-frequency carriers, which were presented over headphones. In the first experiment, the sensitivity to dynamic ILDs was measured as a function of the modulation frequency using puretone, and interaurally correlated and uncorrelated narrow-band noise carriers. The intrinsic interaural level fluctuations of the uncorrelated noise carriers raised the ILD modulation detection thresholds with respect to the pure-tone carriers. The diotic fluctuations of the correlated noise carriers also caused a small increase in the thresholds over the pure-tone carriers, particularly with low ILD modulation frequencies. The second experiment investigated the modulation frequency selectivity in dynamic ILD processing by imposing an interaurally uncorrelated bandpass noise AM masker in series with the interaurally antiphasic AM signal on a pure-tone carrier. By varying the masker center frequencies relative to the signal modulation frequency, broadly tuned, bandpass-shaped patterns were obtained. Simulations with an existing binaural model show that a low-pass filter to limit the binaural temporal resolution is not sufficient to predict the results of the experiments. 相似文献

12.

Effects of directional microphone and adaptive multichannel noise reduction algorithm on cochlear implant performance

Chung K Zeng FG Acker KN 《The Journal of the Acoustical Society of America》2006,120(4):2216-2227

Although cochlear implant (CI) users have enjoyed good speech recognition in quiet, they still have difficulties understanding speech in noise. We conducted three experiments to determine whether a directional microphone and an adaptive multichannel noise reduction algorithm could enhance CI performance in noise and whether Speech Transmission Index (STI) can be used to predict CI performance in various acoustic and signal processing conditions. In Experiment I, CI users listened to speech in noise processed by 4 hearing aid settings: omni-directional microphone, omni-directional microphone plus noise reduction, directional microphone, and directional microphone plus noise reduction. The directional microphone significantly improved speech recognition in noise. Both directional microphone and noise reduction algorithm improved overall preference. In Experiment II, normal hearing individuals listened to the recorded speech produced by 4- or 8-channel CI simulations. The 8-channel simulation yielded similar speech recognition results as in Experiment I, whereas the 4-channel simulation produced no significant difference among the 4 settings. In Experiment III, we examined the relationship between STIs and speech recognition. The results suggested that STI could predict actual and simulated CI speech intelligibility with acoustic degradation and the directional microphone, but not the noise reduction algorithm. Implications for intelligibility enhancement are discussed. 相似文献

13.

Model-based speech enhancement using a bone-conducted signal

Kechichian P Srinivasan S 《The Journal of the Acoustical Society of America》2012,131(3):EL262-EL267

Codebook-based single-microphone noise suppressors, which exploit prior knowledge about speech and noise statistics, provide better performance in nonstationary noise. However, as the enhancement involves a joint optimization over speech and noise codebooks, this results in high computational complexity. A codebook-based method is proposed that uses a reference signal observed by a bone-conduction microphone, and a mapping between air- and bone-conduction codebook entries generated during an offline training phase. A smaller subset of air-conducted speech codebook entries that accurately models the clean speech signal is selected using this reference signal. Experiments support the expected improvement in performance at low computational complexity. 相似文献

14.

一种抑制方向性噪声的双耳语音增强算法

方义冯海泓陈友元胡晓城《声学学报》2016,41(6):897-904

为了给双耳听力设备佩戴者带来更好的语音可懂度,提出了一种利用双耳时间差与声级差的近场语音增强算法,该方法首先利用这两种差异来估计语音的功率谱和语音的相干函数,然后计算干扰噪声在左右耳间的头相关传输函数的比值,最后构造两个维纳滤波器。客观评价的参数显示该算法去噪效果优于对比算法而目标语音的时间差误差和声级差误差低于对比算法。主观的言语接受阈测试表明该方法能有效提高语音可懂度。结果表明,该算法在能够有效去除干扰噪声的同时,保留了目标语音的空间信息。相似文献

15.

A noise cross PSD estimator based on improved minimum statistics method for two-microphone speech enhancement dedicated to a bilateral cochlear implant

Fathi Kallel Mohamed Ghorbel Mondher Frikha Christian Berger-Vachon Ahmed Ben Hamida 《Applied Acoustics》2012,73(3):256-264

Coherence based methods have been successfully applied to dual-microphone noise reduction systems. These techniques showed good results when noise signals on two microphones were uncorrelated, but their performance decreased with correlated noises. It could be improved when the cross power spectral density (CPSD) of received noises is available.In this paper, an improved minimum tracking (IMT) technique for noise CPSD estimation was proposed. The performance of this technique was compared to two other noise CPSD estimators based on voice activity detection (VAD) and minimum tracking (MT) approaches. Evaluation was performed at four signal-to-noise ratios (SNR) and two interfering noise source configurations.Results showed a superiority of the IMT approach in terms of low computing time and quality indicated by the perceptual evaluation of speech quality (PESQ) scores. Then, subjective listening tests were carried out with 50 normal hearing listeners using a specific bilateral cochlear implant (BCI) simulator and utilizing the French Lafon database corrupted by additional babble noise. Results obtained with the proposed technique were better than the two previously mentioned noise CPSD estimators. 相似文献

16.

扩散噪声下协方差矩阵重构的语音分离与降噪

下载免费PDF全文

曾庆宁王师琦《声学学报》2021,46(5):775-784

针对传统多通道语音分离算法在扩散噪声下性能下降的问题,提出了一种用于语音分离及降噪的空间协方差模型及参数估计方法.该方法将扩散噪声视为独立声源,利用由导向矢量重构的空间协方差矩阵建模目标声源的空间特性,并通过空间协方差分析方法估计用于语音分离的多通道维纳滤波器.同时,还提出了一种联合该方法的后置滤波器参数框架,为输出信... 相似文献

17.

Speech enhancement by array crosstalk resistant ANC and spectrum subtraction

References：《声学学报：英文版》2008,27(1):85-96

Microphone array-based speech enhancement has great importance for speech communications and speech recognition. To reduce the aperture of the microphone array and to increase the effect of the speech enhancement will greatly broaden the application areas of the microphone array. An array crosstalk resistant adaptive noise cancellation method is therefore presented. And then an improved spectral subtraction algorithm is further cascaded to obtain better enhancement results. Theoretic analysis and experiments indicate that the proposed scheme needs only a very small microphone array while it simultaneously achieves a higher SNR improvement. Besides, the proposed scheme can be used in many noisy environments and is easy for real-time implementation. 相似文献

18.

吸顶式传声器阵列阵元坐标标定方法

下载免费PDF全文

肖颖杨亦春韩宝坤滕鹏晓《声学学报》2015,40(6):827-835

提出了一种吸顶式传声器阵列阵元坐标的标定方法。针对在混响声场中,时延估计算法性能严重下降从而导致在标定传声器阵元坐标时产生较大误差的问题,提出了利用脉冲声源作为标定声源,并且截取脉冲源直达声的方法来抑制混响声场的影响,提高传声器阵元坐标标定的精度。建立了阵元坐标标定的误差分析模型,并以白噪声和脉冲声源作为标定声源进行数据仿真和对比分析。仿真结果表明,使用脉冲声源作为标定声源能有效地抑制混响声场的影响,获得传声器阵列阵元的准确坐标。同时,在封闭的房间内建立起孔径为3.5 m、64阵元的螺旋状吸顶传声器阵列进行了实验研究,实验结果验证了本文提出方法的有效性。相似文献

19.

A cross-spectrum weighting algorithm for speech enhancement and array processing: combining phase-shift information and stationary signal properties

Schwetz I Gruhler G Obermayer K 《The Journal of the Acoustical Society of America》2006,119(2):952-964

In this paper, a gain function for noise cancellation with a two-channel microphone array is presented. This gain function combines ideas from one- and multichannel algorithms. It is developed using a minimum mean square error estimator for the amplitude of the speech signal from the cross spectrum between two microphone signals. To consider speech pauses and the absence of spectral components of the speech, an extension of this gain function is presented. The performance of the overall gain function is shown in terms of the cancellation of (diffuse) driving noise as well as the cancellation of an interfering speech signal, both recorded in a car. 相似文献

20.

一阶指向可调差分阵列的前后向比分析与优化设计

下载免费PDF全文

石佳韵陈华伟《声学学报》2020,45(5):683-695

一阶指向可调差分传声器阵列具有尺寸小、主瓣指向灵活可调、以及阵列响应不随频率变化等优点,因而在音频处理领域得到了重要关注。但差分传声器阵列对阵元失配误差较为敏感,在实际设计中需要予以考虑。前后向比直接反映了传声器阵列对后向噪声干扰的抑制程度,是差分传声器阵列设计中的重要指标。本文从理论上深入剖析了失配误差对一阶指向可调差分传声器阵列前后向比性能的影响,揭示了其中蕴含的规律。在此基础上,针对实际中同时存在随机失配误差情况,提出了一种最差性能优化设计方法。仿真实验和实测实验验证了本文理论分析的正确性和所提优化设计方法的有效性。相似文献