首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 140 毫秒
1.
Bilateral cochlear implant (BI-CI) recipients achieve high word recognition scores in quiet listening conditions. Still, there is a substantial drop in speech recognition performance when there is reverberation and more than one interferers. BI-CI users utilize information from just two directional microphones placed on opposite sides of the head in a so-called independent stimulation mode. To enhance the ability of BI-CI users to communicate in noise, the use of two computationally inexpensive multi-microphone adaptive noise reduction strategies exploiting information simultaneously collected by the microphones associated with two behind-the-ear (BTE) processors (one per ear) is proposed. To this end, as many as four microphones are employed (two omni-directional and two directional) in each of the two BTE processors (one per ear). In the proposed two-microphone binaural strategies, all four microphones (two behind each ear) are being used in a coordinated stimulation mode. The hypothesis is that such strategies combine spatial information from all microphones to form a better representation of the target than that made available with only a single input. Speech intelligibility is assessed in BI-CI listeners using IEEE sentences corrupted by up to three steady speech-shaped noise sources. Results indicate that multi-microphone strategies improve speech understanding in single- and multi-noise source scenarios.  相似文献   

2.
快速收敛最小方差无畸变响应算法研究及应用   总被引:4,自引:0,他引:4  
周胜增  杜选民 《声学学报》2009,34(6):515-520
常规最小方差无畸变响应(MVDR)自适应波束形成是一种高分辨窄带波束形成器,它是利用实际声场的窄带互谱密度矩阵(CSDM)估计出自适应波束形成权向量。在实际应用中,MVDR算法需要较长的观测时间估计协方差矩阵,不利于对高速运动目标进行定位;对于宽带目标信号,MVDR算法需要对每一个CSDM进行求逆运算,计算量较大;在相干源条件下,目标信号之间会发生"对消"现象,MVDR算法性能急剧恶化。本文提出了基于子带子阵处理的快速收敛MVDR自适应波束形成方法。首先将全频带划分成一组子带,将接收线阵划分成一组子阵,然后对每一子带计算降维的驾驶协方差矩阵(STCM),从而得到快速收敛MVDR自适应波束形成的权值和空间谱估计结果。同时采用双向空间平滑方法对相干源进行MVDR空间谱估计。仿真和海试数据处理结果表明该算法在保证高分辨力的同时,具有瞬时收敛的性能,双向空间平滑技术具有良好的解相干性能。   相似文献   

3.
As advanced signal processing algorithms have been proposed to enhance hearing protective device (HPD) performance, it is important to determine how directional microphones might affect the localization ability of users and whether they might cause safety hazards. The effect of in-the-ear microphone directivity was assessed by measuring sound source identification of speech in the horizontal plane. Recordings of speech in quiet and in noise were made with Knowles Electronic Manikin for Acoustic Research wearing bilateral in-the-ear hearing aids with microphones having adjustable directivity (omnidirectional, cardioid, hypercardioid, supercardioid). Signals were generated from 16 locations in a circular array. Sound direction identification performance of eight normal hearing listeners and eight hearing-impaired listeners revealed that directional microphones did not degrade localization performance and actually reduced the front-back and lateral localization errors made when listening through omnidirectional microphones. The summed rms speech level for the signals entering the two ears appear to serve as a cue for making front-back discriminations when using directional microphones in the experimental setting. The results of this study show that the use of matched directional microphones when worn bilaterally do not have a negative effect on the ability to localize speech in the horizontal plane and may thus be useful in HPD design.  相似文献   

4.
最小方差无失真响应波束形成算法在应用于语音等宽带信号时,依赖窄带假设可以在频域各个子带分别进行滤波。窄带假设下语音信号协方差矩阵是秩-1矩阵,而实际中窄带信号模型只是实际信号模型的一种近似,同时由于存在统计量估计误差,估计的语音信号协方差矩阵的秩一般大于1。提出利用语音协方差矩阵和噪声协方差矩阵的广义主特征向量来估计相对传递函数,用于重构语音信号协方差矩阵为秩-1矩阵。在REVERB数据集以及CHiME-4数据集上进行实验验证,最小方差无失真响应波束形成算法经过语音协方差矩阵低秩近似后,对估计误差的鲁棒性提高,输出信噪比分别提升平均0.8 dB和1.4 dB,同时提升了语音识别准确率。   相似文献   

5.
Several array-processing algorithms were implemented and evaluated with experienced hearing-aid users. The array consisted of four directional microphones mounted broadside on a headband worn on the top of the listener's head. The algorithms included two adaptive array-processing algorithms, one fixed array-processing algorithm, and a reference condition consisting of binaural directional microphones. The algorithms were evaluated under conditions with both one and three independent noise sources. Performance metrics included quantitative speech reception thresholds and qualitative subject preference ratings for ease-of-listening measured using a paired-comparison procedure. On average, the fixed algorithm improved speech reception thresholds by 2 dB, while the adaptive algorithms provided 7-9-dB improvement over the reference condition. Subjects judging ease-of-listening generally preferred all array-processing algorithms over the reference condition. The results suggest that these adaptive algorithms should be evaluated further in more realistic acoustic environments.  相似文献   

6.
本文提出基于Khatri-Rao子空间和传播算子的宽带声源波达方向估计算法。该算法将声源不同频率处的协方差矩阵变换重排为一个高维矩阵,然后利用传播算子方法估计宽带声源波达方向。该算法计算复杂度介于聚焦Khatri-Rao子空间和相干子空间算法之间。仿真和实验结果表明,该算法在降低计算量的同时,估计误差与聚焦Khatri-Rao子空间算法相近,远小于相干子空间算法。  相似文献   

7.
石倩  陈航艇  张鹏远 《声学学报》2022,47(1):139-150
提出了波达方向初始化空间混合概率模型的语音增强算法.通过声源定位估计出声源波达方向,再根据此计算相对传递函数,进而构造空间协方差矩阵来初始化空间混合概率模型.论证了相对传递函数在作为模型参数中语音协方差矩阵的主特征向量时,空间混合概率模型对应的概率分布可达到最大值,进而使期望最大化算法在迭代时更易收敛,以得到期望的掩蔽...  相似文献   

8.
The present study is concerned with the convolutive Blind Source Separation (BSS) of sound sources that leads to a significant speech intelligibility enhancement. Two experiments were conducted. In the first experiment two different algorithms of convolutive BSS were compared. Both methods are based on second order statistics since such approach is simple and gives satisfactory performance. The data resulted from this experiment suggested that with different approaches, different speech intelligibility improvement could be obtained. In the second experiment the influence of the spatial configuration of the cardioid microphones on the BSS performance was measured. It was revealed that the best separation for a considered spatial configuration can be obtained when microphones are directed alternately.  相似文献   

9.
A weighting orthogonal method for constant beamwidth beamforming matrices is proposed. This method multiplies weighting factors to each orthogonal beamforming matrix corresponding to different frequency bins. The method proposed doesn't cause waveform aberration, and doesn't cause additional loss of array signal-to-noise ratio when the sources have uniform spectrum. The waveform aberration and additional loss of array signal-to-noise ratio can not be avoided simultaneously by ordinary orthogonal method. So we can get good detection and estimation performances at the same time by the weightmg method. Simulation results and water tank experiments are presented to confirm the conclusion above.  相似文献   

10.
The conventional MVDR adaptive beamformer is a high-resolution narrowband beamformer which estimates the optimal beamforming weights using narrowband CSDM of real acoustic field.In practical applications,MVDR algorithm needs long observation time to estimate the covariance matrix.This inherent property makes it difficult to localize fast-moving targets.For wideband signals,MVDR algorithm needs inverting every CSDM which increases the computational demands.For correlated sources,the performance of MVDR will degrade dramatically because the signals will cancel each other.A fast-convergent MVDR algorithm based on subband subarray processing is proposed.The full frequency band is divided into sets of subbands and the line array is divided into sets of subarrays.For every subband the STCM of reduced dimensions is calculated.Then adaptive beamforming weight of fast-convergent MVDR algorithm and spatial spectrum estimation are obtained.At the same time,spatial spectrum estimation can be made for correlated sources using the two-sided spatial smoothing method. Results of simulation and trial data show that the proposed method has high-resolution and near-instantaneous convergence property,two-sided spatial smoothing has satisfactory validity of decorrelation.  相似文献   

11.
Speech can remain intelligible for listeners with normal hearing when processed by narrow bandpass filters that transmit only a small fraction of the audible spectrum. Two experiments investigated the basis for the high intelligibility of narrowband speech. Experiment 1 confirmed reports that everyday English sentences can be recognized accurately (82%-98% words correct) when filtered at center frequencies of 1500, 2100, and 3000 Hz. However, narrowband low predictability (LP) sentences were less accurately recognized than high predictability (HP) sentences (20% lower scores), and excised narrowband words were even less intelligible than LP sentences (a further 23% drop). While experiment 1 revealed similar levels of performance for narrowband and broadband sentences at conversational speech levels, experiment 2 showed that speech reception thresholds were substantially (>30 dB) poorer for narrowband sentences. One explanation for this increased disparity between narrowband and broadband speech at threshold (compared to conversational speech levels) is that spectral components in the sloping transition bands of the filters provide important cues for the recognition of narrowband speech, but these components become inaudible as the signal level is reduced. Experiment 2 also showed that performance was degraded by the introduction of a speech masker (a single competing talker). The elevation in threshold was similar for narrowband and broadband speech (11 dB, on average), but because the narrowband sentences required considerably higher sound levels to reach their thresholds in quiet compared to broadband sentences, their target-to-masker ratios were very different (+23 dB for narrowband sentences and -12 dB for broadband sentences). As in experiment 1, performance was better for HP than LP sentences. The LP-HP difference was larger for narrowband than broadband sentences, suggesting that context provides greater benefits when speech is distorted by narrow bandpass filtering.  相似文献   

12.
This paper introduces a miniaturized microphone array using the Directionally Constrained Minimization of Power (DCMP) method, which utilizes the transfer functions of microphones located at the same place, namely aggregated microphones. The phased microphone array realizes a noise reduction and direction of arrival (DOA) estimation system according to differences in the arrival time, phase shift, and/or the level of the sound wave for each microphone. Hence it is difficult to miniaturize the microphone array. The objective of our research is to miniaturize the system size using aggregated microphones. In this paper, we first show that the phased microphone array system and the proposed aggregated microphone system can be described within the same framework. We then apply a microphone array under directional constraint to the aggregated microphones and compare the proposed method with the microphone array. We show the directional pattern of the aggregated microphones. We also show the experimental results regarding DOA estimation.  相似文献   

13.
曾庆宁  王师琦 《声学学报》2021,46(5):775-784
针对传统多通道语音分离算法在扩散噪声下性能下降的问题,提出了一种用于语音分离及降噪的空间协方差模型及参数估计方法。该方法将扩散噪声视为独立声源,利用由导向矢量重构的空间协方差矩阵建模目标声源的空间特性,并通过空间协方差分析方法估计用于语音分离的多通道维纳滤波器。同时,还提出了一种联合该方法的后置滤波器参数框架,为输出信号降噪和失真的折中提供了更多选择。在扩散噪声下的单目标和多目标实验中,所提方法的语音提取和分离性能都优于对比算法,联合参数的后置滤波器可提供更为符合人们要求的降噪语音,验证了所提模型与参数估计方法的有效性。   相似文献   

14.
王泽林  陈锴  卢晶 《声学学报》2020,45(5):696-706
在车载分布式传声器阵列场景中,结合盲源分离TRINICON (Triple-N ICA for convolutive mixtures)算法与多说话人状态判决实现期望语音抽取。根据分布式传声器阵列与声源的相对位置关系,设计特定的盲源分离初始化条件以保证输出通道与声源的映射关系;根据分布式传声器阵列的频响特点,设计特征矢量来进行多说话人判决,并将判决结果引入TRINICON算法参数迭代过程。在使用实际车载录音数据的仿真评测中,所提方法在不同信噪比下有较高的鲁棒性,可有效提升TRINICON算法的收敛速度和语音信号的信扰比,且可以确保准确的通道映射。评测结果表明该方法可以在车载场景中有效抽取出期望语音,为车载复杂场景下的声信息提取提供了一种可靠且收敛快速的解决方法。   相似文献   

15.
16.
Estimating the direction of a sound source is an important technique used in various engineering fields, including intelligent robots and surveillance systems. In a household where a user’s voice and noises emitted from electric appliances originate from arbitrary directions in 3-D space, robots need to recognize the directions of multiple sound sources in order to effectively interact with the user.This paper proposes an ear-based estimation (localization) system using two artificial robot ears, each consisting of a spiral-shaped pinna and two microphones, for application in humanoid robots. Four microphones are asymmetrically placed on the left and right sides of the head. The proposed localization algorithm is based on a spatially mapped generalized cross-correlation function which is transformed from the time domain to the space domain by using a measured inter-channel time difference map. For validation of the proposed localization method, two experiments (single- and multiple-source cases) were conducted using male speech. In the case of a single source, with the exception of laterally biased sources, the localization was achieved with an error of less than 10°. In a multiple-source environment, one source was fixed at the front side and the other source changed its direction; from the experimental results, the error rates on the localization of the fixed and varying sources are 0% and 36.9% respectively within an error bound of 15°.  相似文献   

17.
In this contribution, a novel dual-channel speech enhancement technique is introduced. The proposed approach uses the dissimilarity between the power of received signals in the two channels as a criterion for speech enhancement and noise reduction. We claim that in near field conditions, where the distances between microphones and sound source are short, the difference in the received power levels at the two microphones is an estimate of the clean speech signal power. Then, apply this theory to present an optimum method for speech enhancement. Fortunately, the method has the ability to cope with problems such as transient noise and nearby microphones which are two of the main problems of the proposed dual-microphone speech enhancement techniques. Using objective speech quality measures and spectrogram analysis, we show that the proposed method results in improved speech quality.  相似文献   

18.
Algorithms for estimation of the spatial spectrum of ocean noise using a single hydroacoustic combined receiving module (CRM), which records the field of acoustic pressure and its gradient projections onto three mutually orthogonal spatial directions for small wave sizes, are discussed. It is shown that in spite of a rather “obtuse” cosine directional response pattern of vector channels of this receiving module it is possible to obtain good determination of the direction toward the local source due to recording the vector characteristic of the acoustic field (acoustic power flux), and in the absence of powerful local sources in the water area the spatial spectrum of noise close to a realistic one can be obtained.  相似文献   

19.
The performance of broadband sonar array processing can degrade significantly in shallow-water environments when interference becomes angularly spread due to multipath propagation. Particularly for towed line arrays near endfire, elevation angle spreading of multipath interference often results in masking of weaker sources of interest. While adaptive beamforming in a series of narrow frequency bands can suppress coherent multipath interference, this approach requires long observation times to estimate the required narrowband covariance matrices. To form wideband covariance matrices which can be estimated with less observation time, plane-wave focusing methods have been used to avoid interference covariance matrix rank inflation. This paper extends wideband focusing to the case of coherent multipath interference. The approach presented here, called waveguide invariant focusing (WIF), exploits a robust relationship for the frequency dependence of horizontal wave number differences. Unlike matched-field methods, WIF does not model multipath wave fronts but rather makes the interference appear to occupy the same rank-one subspace across frequency. This permits formation of wideband covariance matrices without interference rank inflation. Simulation experiments in a realistic ocean environment indicate that adaptive beamforming using WIF covariance matrices can provide a significant array gain improvement over conventional adaptive methods with limited observation time.  相似文献   

20.
A signal-processing algorithm that modifies the interaural time delays associated with directional sources is described. Signals received at two microphones are processed by four linear filters arranged in a lattice configuration to produce two outputs, one for each ear. Since the processing is linear, the method is equally applicable to single or multiple directional sources. The filters are designed to minimize the average squared error between a user specified desired space warping function and the actual warping function that they implement. Two classes of filters are considered: filters whose frequency response is unconstrained and filters constrained to be causal with finite impulse response. In both cases the solution of the least-squares problem is given and properties of the actual space warping function are examined. Perceptual experiments and analysis of acoustic waveforms are utilized to demonstrate the effectiveness of the algorithm. Extension of this method for utilizing more than two microphones is described.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号