首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 0 毫秒
1.
Several studies have described a release from speech-on-speech masking associated with separation of target and masker sources in the median sagittal plane. Some have excluded the possibility that small differences between target and masker interaural time disparities can fully account for this release. This study explored the mechanisms underlying the spatial release from speech-on-speech masking that can be obtained in the absence of such differences. In one condition, interaural time disparities were removed from the nominal median-sagittal-plane, head-related impulse responses used to generate the virtual auditory space within which competing sentences were presented. In other conditions, interaural level and spectral disparities also were manipulated by presenting competing sentences monaurally or diotically after convolution with one ear's head-related impulse responses. It was found that substantial spatial release from masking can be obtained in the absence of any interaural disparities and that such disparities probably make a relatively minor contribution to spatial release from speech-on-speech masking in the median sagittal plane. It is argued that this release from masking is driven primarily by a reduction in informational masking that occurs when monaural information at one, or both, of the listener's ears facilitates differentiation of competing sentences that emanate from spatially separated sources.  相似文献   

2.
For a mixture of target speech and noise in anechoic conditions, the ideal binary mask is defined as follows: It selects the time-frequency units where target energy exceeds noise energy by a certain local threshold and cancels the other units. In this study, the definition of the ideal binary mask is extended to reverberant conditions. Given the division between early and late reflections in terms of speech intelligibility, three ideal binary masks can be defined: an ideal binary mask that uses the direct path of the target as the desired signal, an ideal binary mask that uses the direct path and early reflections of the target as the desired signal, and an ideal binary mask that uses the reverberant target as the desired signal. The effects of these ideal binary mask definitions on speech intelligibility are compared across two types of interference: speech shaped noise and concurrent female speech. As suggested by psychoacoustical studies, the ideal binary mask based on the direct path and early reflections of target speech outperforms the other masks as reverberation time increases and produces substantial reductions in terms of speech reception threshold for normal hearing listeners.  相似文献   

3.
Although most recent multitalker research has emphasized the importance of binaural cues, monaural cues can play an equally important role in the perception of multiple simultaneous speech signals. In this experiment, the intelligibility of a target phrase masked by a single competing masker phrase was measured as a function of signal-to-noise ratio (SNR) with same-talker, same-sex, and different-sex target and masker voices. The results indicate that informational masking, rather than energetic masking, dominated performance in this experiment. The amount of masking was highly dependent on the similarity of the target and masker voices: performance was best when different-sex talkers were used and worst when the same talker was used for target and masker. Performance did not, however, improve monotonically with increasing SNR. Intelligibility generally plateaued at SNRs below 0 dB and, in some cases, intensity differences between the target and masking voices produced substantial improvements in performance with decreasing SNR. The results indicate that informational and energetic masking play substantially different roles in the perception of competing speech messages.  相似文献   

4.
5.
Sound source segregation refers to the ability to hear as separate entities two or more sound sources comprising a mixture. Masking refers to the ability of one sound to make another sound difficult to hear. Often in studies, masking is assumed to result from a failure of segregation, but this assumption may not always be correct. Here a method is offered to identify the relation between masking and sound source segregation in studies and an example is given of its application.  相似文献   

6.
Two experiments compared the effect of supplying visual speech information (e.g., lipreading cues) on the ability to hear one female talker's voice in the presence of steady-state noise or a masking complex consisting of two other female voices. In the first experiment intelligibility of sentences was measured in the presence of the two types of maskers with and without perceived spatial separation of target and masker. The second study tested detection of sentences in the same experimental conditions. Results showed that visual cues provided more benefit for both recognition and detection of speech when the masker consisted of other voices (versus steady-state noise). Moreover, visual cues provided greater benefit when the target speech and masker were spatially coincident versus when they appeared to arise from different spatial locations. The data obtained here are consistent with the hypothesis that lipreading cues help to segregate a target voice from competing voices, in addition to the established benefit of supplementing masked phonetic information.  相似文献   

7.
为方便兰姆波信号分析与模式定征,提出一种将短时傅里叶变换(Short-Time Fourier Transform,STFT)与独立元分析(Independent Component Analysis,ICA)相结合的多模式超声兰姆波识别方法。首先通过STFT将兰姆波时域信号投影至时频域,基于各模式信号在时频域相对独立的特点,利用ICA实现混叠模式分离。根据分离模式时频能量脊提取各模式群速度曲线,进而估计板厚。将方法运用于时域有限差分(Finite-Difference Time-Domain,FDTD)法仿真与钢板实验,分离得到A0、A1和S0三种模式。仿真与实验中平均群速度估计误差约为1.5%和2.0%,板厚估计误差约为0.3%和2.0%。仿真结果表明,在信噪比(Signal-to-Noise Ratio,SNR)不小于0 dB的情况下,时频域独立元分析方法均可实现兰姆波多模式分离、群速度曲线提取及板厚估计。  相似文献   

8.
Phase and component segregation in lipid membranes was studied by means of mathematical modeling. The time dependence of phase segregation on the lateral diffusion coefficient was calculated. The phase and component segregations were also simulated by multiparticle Monte-Carlo methods, and the phase diagrams of the system were obtained.  相似文献   

9.
为方便兰姆波信号分析与模式定征,提出一种将短时傅里叶变换(Short-Time Fourier Transform,STFT)与独立元分析(Independent Component Analysis,ICA)相结合的多模式超声兰姆波识别方法。首先通过STFT将兰姆波时域信号投影至时频域,基于各模式信号在时频域相对独立的特点,利用ICA实现混叠模式分离。根据分离模式时频能量脊提取各模式群速度曲线,进而估计板厚。将方法运用于时域有限差分(Finite-Difference Time-Domain,FDTD)法仿真与钢板实验,分离得到A0、A1和S0三种模式。仿真与实验中平均群速度估计误差约为1.5%和2.0%,板厚估计误差约为0.3%和2.0%。仿真结果表明,在信噪比(Signal-to-Noise Ratio,SNR)不小于0 dB的情况下,时频域独立元分析方法均可实现兰姆波多模式分离、群速度曲线提取及板厚估计。  相似文献   

10.
Although many studies have shown that intelligibility improves when a speech signal and an interfering sound source are spatially separated in azimuth, little is known about the effect that spatial separation in distance has on the perception of competing sound sources near the head. In this experiment, head-related transfer functions (HRTFs) were used to process stimuli in order to simulate a target talker and a masking sound located at different distances along the listener's interaural axis. One of the signals was always presented at a distance of 1 m, and the other signal was presented 1 m, 25 cm, or 12 cm from the center of the listener's head. The results show that distance separation has very different effects on speech segregation for different types of maskers. When speech-shaped noise was used as the masker, most of the intelligibility advantages of spatial separation could be accounted for by spectral differences in the target and masking signals at the ear with the higher signal-to-noise ratio (SNR). When a same-sex talker was used as the masker, the intelligibility advantages of spatial separation in distance were dominated by binaural effects that produced the same performance improvements as a 4-5-dB increase in the SNR of a diotic stimulus. These results suggest that distance-dependent changes in the interaural difference cues of nearby sources play a much larger role in the reduction of the informational masking produced by an interfering speech signal than in the reduction of the energetic masking produced by an interfering noise source.  相似文献   

11.
Positron emission tomography (PET) was used to investigate the neural basis of the comprehension of speech in unmodulated noise ("energetic" masking, dominated by effects at the auditory periphery), and when presented with another speaker ("informational" masking, dominated by more central effects). Each type of signal was presented at four different signal-to-noise ratios (SNRs) (+3, 0, -3, -6 dB for the speech-in-speech, +6, +3, 0, -3 dB for the speech-in-noise), with listeners instructed to listen for meaning to the target speaker. Consistent with behavioral studies, there was SNR-dependent activation associated with the comprehension of speech in noise, with no SNR-dependent activity for the comprehension of speech-in-speech (at low or negative SNRs). There was, in addition, activation in bilateral superior temporal gyri which was associated with the informational masking condition. The extent to which this activation of classical "speech" areas of the temporal lobes might delineate the neural basis of the informational masking is considered, as is the relationship of these findings to the interfering effects of unattended speech and sound on more explicit working memory tasks. This study is a novel demonstration of candidate neural systems involved in the perception of speech in noisy environments, and of the processing of multiple speakers in the dorso-lateral temporal lobes.  相似文献   

12.
The present study sought to clarify the role of non-simultaneous masking in the binaural masking level difference for maskers that fluctuate in level. In the first experiment the signal was a brief 500-Hz tone, and the masker was a bandpass noise (100-2000 Hz), with the initial and final 200-ms bursts presented at 40-dB spectrum level and the inter-burst gap presented at 20-dB spectrum level. Temporal windows were fitted to thresholds measured for a range of gap durations and signal positions within the gap. In the second experiment, individual differences in out of phase (NoSπ) thresholds were compared for a brief signal in a gapped bandpass masker, a brief signal in a steady bandpass masker, and a long signal in a narrowband (50-Hz-wide) noise masker. The third experiment measured brief tone detection thresholds in forward, simultaneous, and backward masking conditions for a 50- and for a 1900-Hz-wide noise masker centered on the 500-Hz signal frequency. Results are consistent with comparable temporal resolution in the in phase (NoSo) and NoSπ conditions and no effect of temporal resolution on individual observers' ability to utilize binaural cues in narrowband noise. The large masking release observed for a narrowband noise masker may be due to binaural masking release from non-simultaneous, informational masking.  相似文献   

13.
Previous studies have shown that threshold for a signal in tone-on-tone simultaneous masking is sometimes lower when the masker is continuous than when it is gated. Threshold may also decline as signal onset is delayed relative to the onset of a longer duration masker, though it may increase again near masker offset. In the present study, the level of a 1250-Hz sinusoidal masker was found which would just mask a 20-ms, 1000-Hz sinusoid presented at 10-dB sensation level (SL). Masker duration was 20 or 400 ms; in the latter case, the signal was presented in one of three temporal positions within the masker. The level of the 1250-Hz masker necessary to mask the signal was reduced, sometimes by as much as 20-25 dB, by a 20-ms, 500-Hz sinusoid (transient masker) presented at the times when the signal might occur, but at a level 30 dB below that at which it would mask the 10-dB SL signal. This suggests that, in the earlier studies, at least some of the elevation in threshold in the presence of a short-duration masker or at the beginning (or end) of a longer duration masker may have been due to the transient responses to the masker affecting detection of the signal, but not necessarily masking the signal in terms of excitation in the signal "channel."  相似文献   

14.
The influence of the degree of envelope modulation and periodicity on the loudness and effectiveness of sounds as forward maskers was investigated. In the first experiment, listeners matched the loudness of complex tones and noise. The tones had a fundamental frequency (F0) of 62.5 or 250 Hz and were filtered into a frequency range from the 10th harmonic to 5000 Hz. The Gaussian noise was filtered in the same way. The components of the complex tones were added either in cosine phase (CPH), giving a large crest factor, or in random phase (RPH), giving a smaller crest factor. For each F0, subjects matched the loudness between all possible stimulus pairs. Six different levels of the fixed stimulus were used, ranging from about 30 dB SPL to about 80 dB SPL in 10-dB steps. Results showed that, at a given overall level, the CPH and the RPH tones were louder than the noise, and that the CPH tone was louder than the RPH tone. The difference in loudness was larger at medium than at low levels and was only slightly reduced by the addition of a noise intended to mask combination tones. The differences in loudness were slightly smaller for the higher than for the lower F0. In the second experiment, the stimuli with the lower F0s were used as forward maskers of a 20-ms sinusoid, presented at various frequencies within the spectral range of the maskers. Results showed that the CPH tone was the least effective forward masker, even though it was the loudest. The differences in effectiveness as forward maskers depended on masker level and signal frequency; in order to produce equal masking, the level of the CPH tone had to be up to 35 dB above that of the RPH tone and the noise. The implications of these results for models of loudness are discussed and a model is presented based on neural activity patterns in the auditory nerve; this predicts the general pattern of loudness matches. It is suggested that the effects observed in the experiments may have been influenced by two factors: cochlear compression and suppression.  相似文献   

15.
Masking asymmetry was investigated over a wide range of stimulus intensities for two signal frequencies, fo = 1.0 and 4.0 kHz, using both fixed-masker and fixed-signal paradigms. The masker was a notched noise with the upper and lower edges of the notch, fu and fl, respectively, placed asymmetrically about fo. For various notch widths, the asymmetry of masking was measured as the difference between the masked threshold obtained when fl was nearer fo and that obtained when fu was nearer fo. For maskers with wide notches, (fu - fl)/fo greater than 0.15, masking asymmetry changed with stimulus level; at the highest level, masked threshold was greatest when fl was nearer fo, and, at the lowest level the asymmetry reversed slightly for fo = 1.0 kHz so that masked threshold was actually greater when fu was nearer fo. Nonparallel growth of masking functions reveal changes in masking asymmetry with signal level as well as with masker level. It is concluded that the nonlinear growth of masking with level is due primarily to changes in the auditory filter, rather than changes in the detector following the filter.  相似文献   

16.
17.
Models of the additivity of masking   总被引:1,自引:0,他引:1  
  相似文献   

18.
Upward shifts in the masking pattern with increasing masker intensity   总被引:1,自引:0,他引:1  
Masking patterns obtained with forward-masking paradigms and relatively intense maskers sometimes have their peaks at the masker frequency and sometimes at a frequency well above it. Here it is shown that which outcome is obtained depends upon certain temporal parameters of the procedure. Specifically, the masking pattern for a 2000-Hz tone showed a gradual shift toward higher frequencies as masker intensity was increased from 65 to 95 dB SPL when long signals (about 50 ms) and long masker-to-signal intervals (about 50 ms) were used, but the effect was absent or smaller when the signals and intervals were short. This shift did not occur with a 750-Hz masker. Upward shifts in the masking pattern with increasing masker intensity are in accord with the view that the peak of displacement of the traveling-wave envelope migrates basally with increasing intensity--an idea that has frequently been suggested as an explanation of the so-called half-octave shift so routinely seen in auditory fatigue experiments.  相似文献   

19.
A procedure for calculating the potential and field of multipole lenses in which the central part of the poles has the form of an ideal curve is presented. The procedure is based on the method of conformal mapping. The properties of these lenses are compared with those whose poles have a polygonal profile. It is shown that sufficiently wide ideal-center poles provide a better field quality than polygonal poles.  相似文献   

20.
In this work we present a comparative study of the thermodynamic properties of two two-dimensional systems. One of them corresponds to a two-dimensional system of coupled harmonic oscillators, while the second is a nonlinear anharmonic oscillator. We present the spectrum for the harmonic oscillator and we present for the first time a calculation of the WKB spectrum and the wave functions for the nonlinear oscillator. We show that the Ω-potential has significantly different behavior in the high temperature region. This may be noted, for example, in a study of the temperature dependence of the specific heat in planar anisotropic media. State Institute of Radio Technology, Electronics, and Automation, Technical University, Moscow. Translated from Izvestiya Vysshikh Uchebnykh Zavedenii, Fizika, Vol. 42, No. 1, pp. 8–17, January, 1999.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号