首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 15 毫秒
1.
Speech reception thresholds (SRTs) were measured for target speech presented concurrently with interfering speech (spoken by a different speaker). In experiment 1, the target and interferer were divided spectrally into high- and low-frequency bands and presented over headphones in three conditions: monaural, dichotic (target and interferer to different ears), and swapped (the low-frequency target band and the high-frequency interferer band were presented to one ear, while the high-frequency target band and the low-frequency interferer band were presented to the other ear). SRTs were highest in the monaural condition and lowest in the dichotic condition; SRTs in the swapped condition were intermediate. In experiment 2, two new conditions were devised such that one target band was presented in isolation to one ear while the other band was presented at the other ear with the interferer. The pattern of SRTs observed in experiment 2 suggests that performance in the swapped condition reflects the intelligibility of the target frequency bands at just one ear; the auditory system appears unable to exploit advantageous target-to-interferer ratios at different ears when segregating target speech from a competing speech interferer.  相似文献   

2.
3.
The equalization stage in the equalization-cancellation model of binaural unmasking compensates for the interaural time delay (ITD) of a masking noise by introducing an opposite, internal delay [N. I. Durlach, in Foundations of Modern Auditory Theory, Vol. II., edited by J. V. Tobias (Academic, New York, 1972)]. Culling and Summerfield [J. Acoust. Soc. Am. 98, 785-797 (1995)] developed a multi-channel version of this model in which equalization was "free" to use the optimal delay in each channel. Two experiments were conducted to test if equalization was indeed free or if it was "restricted" to the same delay in all channels. One experiment measured binaural detection thresholds, using an adaptive procedure, for 1-, 5-, or 17-component tones against a broadband masking noise, in three binaural configurations (N0S180, N180S0, and N90S270). The thresholds for the 1-component stimuli were used to normalize the levels of each of the 5- and 17-component stimuli so that they were equally detectable. If equalization was restricted, then, for the 5- and 17-component stimuli, the N90S270 and N180S0 configurations would yield a greater threshold than the N0S180 configurations. No such difference was found. A subsequent experiment measured binaural detection thresholds, via psychometric functions, for a 2-component complex tone in the same three binaural configurations. Again, no differential effect of configuration was observed. An analytic model of the detection of a complex tone showed that the results were more consistent with free equalization than restricted equalization, although the size of the differences was found to depend on the shape of the psychometric function for detection.  相似文献   

4.
The role of perceived spatial separation in the unmasking of speech   总被引:12,自引:0,他引:12  
Spatial separation of speech and noise in an anechoic space creates a release from masking that often improves speech intelligibility. However, the masking release is severely reduced in reverberant spaces. This study investigated whether the distinct and separate localization of speech and interference provides any perceptual advantage that, due to the precedence effect, is not degraded by reflections. Listeners' identification of nonsense sentences spoken by a female talker was measured in the presence of either speech-spectrum noise or other sentences spoken by a second female talker. Target and interference stimuli were presented in an anechoic chamber from loudspeakers directly in front and 60 degrees to the right in single-source and precedence-effect (lead-lag) conditions. For speech-spectrum noise, the spatial separation advantage for speech recognition (8 dB) was predictable from articulation index computations based on measured release from masking for narrow-band stimuli. The spatial separation advantage was only 1 dB in the lead-lag condition, despite the fact that a large perceptual separation was produced by the precedence effect. For the female talker interference, a much larger advantage occurred, apparently because informational masking was reduced by differences in perceived locations of target and interference.  相似文献   

5.
6.
A study was made of the effect of interaural time delay (ITD) and acoustic headshadow on binaural speech intelligibility in noise. A free-field condition was simulated by presenting recordings, made with a KEMAR manikin in an anechoic room, through earphones. Recordings were made of speech, reproduced in front of the manikin, and of noise, emanating from seven angles in the azimuthal plane, ranging from 0 degree (frontal) to 180 degrees in steps of 30 degrees. From this noise, two signals were derived, one containing only ITD, the other containing only interaural level differences (ILD) due to headshadow. Using this material, speech reception thresholds (SRT) for sentences in noise were determined for a group of normal-hearing subjects. Results show that (1) for noise azimuths between 30 degrees and 150 degrees, the gain due to ITD lies between 3.9 and 5.1 dB, while the gain due to ILD ranges from 3.5 to 7.8 dB, and (2) ILD decreases the effectiveness of binaural unmasking due to ITD (on the average, the threshold shift drops from 4.6 to 2.6 dB). In a second experiment, also conducted with normal-hearing subjects, similar stimuli were used, but now presented monaurally or with an overall 20-dB attenuation in one channel, in order to simulate hearing loss. In addition, SRTs were determined for noise with fixed ITDs, for comparison with the results obtained with head-induced (frequency dependent) ITDs.(ABSTRACT TRUNCATED AT 250 WORDS)  相似文献   

7.
Three experiments were carried out that employed low-frequency tone complexes with interaural delays that varied across the frequency domain. In the first experiment, threshold interaural delays were measured for three-tone complexes for which one, two, or all three components were delayed. The center frequency was 750 Hz and the frequency spacing (delta f) between components was 20, 50, 100, 250, or 450 Hz. For all delta f's, the presence of two diotic components elevated the threshold interaural delays obtained for the third component relative to that obtained for a pure tone of the same frequency. In the second experiment, observers made left-right judgments regarding the direction of movement of signals for which two components were delayed by 25 microseconds to the left ear during one interval and to the right ear during the other interval, while a third component of a variable time difference was delayed to the opposite side as the tone pair. Subjects reported single intracranial images during each interval, and the data showed that interaural delays of one component to one ear could be offset by interaural delays of the other two components to the other ear. In the final experiment, threshold interaural delays were measured for five-tone complexes in which one, two, three, four, or five components were delayed. The center frequency was 750 Hz and delta f was fixed at 100 Hz. Thresholds decreased in a linear fashion as the number of delayed components increased, falling by about a factor of 5 as the number of delayed components went from one to five. These results are consistent with spectrally synthetic binaural processing, with the lateral position of intracranial images determined by a combination of interaural information across the spectrum. These effects could be brought about by a linear combination of the outputs of frequency-specific cross-correlation networks or by a wideband cross correlation of the signals at the two ears.  相似文献   

8.
Three experiments investigated the roles of interaural time differences (ITDs) and level differences (ILDs) in spatial unmasking in multi-source environments. In experiment 1, speech reception thresholds (SRTs) were measured in virtual-acoustic simulations of an anechoic environment with three interfering sound sources of either speech or noise. The target source lay directly ahead, while three interfering sources were (1) all at the target's location (0 degrees,0 degrees,0 degrees), (2) at locations distributed across both hemifields (-30 degrees,60 degrees,90 degrees), (3) at locations in the same hemifield (30 degrees,60 degrees,90 degrees), or (4) co-located in one hemifield (90 degrees,90 degrees,90 degrees). Sounds were convolved with head-related impulse responses (HRIRs) that were manipulated to remove individual binaural cues. Three conditions used HRIRs with (1) both ILDs and ITDs, (2) only ILDs, and (3) only ITDs. The ITD-only condition produced the same pattern of results across spatial configurations as the combined cues, but with smaller differences between spatial configurations. The ILD-only condition yielded similar SRTs for the (-30 degrees,60 degrees,90 degrees) and (0 degrees,0 degrees,0 degrees) configurations, as expected for best-ear listening. In experiment 2, pure-tone BMLDs were measured at third-octave frequencies against the ITD-only, speech-shaped noise interferers of experiment 1. These BMLDs were 4-8 dB at low frequencies for all spatial configurations. In experiment 3, SRTs were measured for speech in diotic, speech-shaped noise. Noises were filtered to reduce the spectrum level at each frequency according to the BMLDs measured in experiment 2. SRTs were as low or lower than those of the corresponding ITD-only conditions from experiment 1. Thus, an explanation of speech understanding in complex listening environments based on the combination of best-ear listening and binaural unmasking (without involving sound-localization) cannot be excluded.  相似文献   

9.
The purpose of this study was to measure listeners' abilities to detect brief changes in interaural temporal disparities (ITDs) or interaural intensitive disparities (IIDs) conveyed by bursts of noise (probes) temporally and symmetrically flanked by segments of diotic or uncorrelated noise. Thresholds were measured using a four-interval, two-alternative, forced-choice adaptive task and the total duration of the bursts of noise was either 20, 40, or 100 ms. Probes were temporally centered within each burst and the durations of the probes ranged from 2 to 100 ms, depending upon the duration of the (longer) total burst of noise within which they were embedded. The results indicate that, for a given total duration of noise, there is a rapid decrease in threshold ITD or threshold IID as the duration of the probe is increased so that it occupies a larger portion of the total burst of noise. Mathematical analyses revealed that both threshold ITDs and threshold IIDs could be well accounted for by assuming that the listener processes both types of binaural cues via a single, symmetric, double-exponential temporal window. Interestingly, the shapes of the temporal windows that fit the data obtained from the human listeners resemble the shapes of the temporal windows derived by Wagner [H. Wagner, J. Comp. Physiol. A 169, 281-289 (1991)], who studied the barn owl. The time constants and relative weightings yielded temporal window functions that heavily emphasize information occurring within the very temporal center of the window. This temporal window function was found to be generalizable in the sense that it also accounts for classic data reported by Grantham and Wightman [D.W. Gratham and F.L. Wightman, J. Acoust. Soc. Am. 63, 511-523 (1978)] concerning sensitivity to dynamically changing interaural disparities.  相似文献   

10.
A commonly accepted physiological model for lateralization of low-frequency sounds by interaural time delay (ITD) stipulates that binaural comparison neurons receive input from frequency-matched channels from each ear. Here, the effects of hypothetical interaural frequency mismatches on this model are reported. For this study, the cat's auditory system peripheral to the binaural comparison neurons was represented by a neurophysiologically derived model, and binaural comparison neurons were represented by cross-correlators. The results of the study indicate that, for binaural comparison neurons receiving input from one cochlear channel from each ear, interaural CF mismatches may serve to either augment or diminish the effective difference in ipsilateral and contralateral axonal time delays from the periphery to the binaural comparison neuron. The magnitude of this increase or decrease in the effective time delay difference can be up to 400 microseconds for CF mismatches of 0.2 octaves or less for binaural neurons with CFs between 250 Hz and 2.5 kHz. For binaural comparison neurons with nominal CFs near 500 Hz, the 25-microsecond effective time delay difference caused by a 0.012-octave CF mismatch is equal to the ITD previously shown to be behaviorally sufficient for the cat to lateralize a low-frequency sound source.  相似文献   

11.
12.
Two experiments are presented that measure the acuity of binaural processing of modulated interaural level differences (ILDs) using psychoacoustic methods. In both experiments, dynamic ILDs were created by imposing an interaurally antiphasic sinusoidal amplitude modulation (AM) signal on high-frequency carriers, which were presented over headphones. In the first experiment, the sensitivity to dynamic ILDs was measured as a function of the modulation frequency using puretone, and interaurally correlated and uncorrelated narrow-band noise carriers. The intrinsic interaural level fluctuations of the uncorrelated noise carriers raised the ILD modulation detection thresholds with respect to the pure-tone carriers. The diotic fluctuations of the correlated noise carriers also caused a small increase in the thresholds over the pure-tone carriers, particularly with low ILD modulation frequencies. The second experiment investigated the modulation frequency selectivity in dynamic ILD processing by imposing an interaurally uncorrelated bandpass noise AM masker in series with the interaurally antiphasic AM signal on a pure-tone carrier. By varying the masker center frequencies relative to the signal modulation frequency, broadly tuned, bandpass-shaped patterns were obtained. Simulations with an existing binaural model show that a low-pass filter to limit the binaural temporal resolution is not sufficient to predict the results of the experiments.  相似文献   

13.
Listeners' sensitivity to interaural correlation of the envelope of high-frequency waveforms and whether such sensitivity might account for detectability in a masking-level difference paradigm were assessed. Thresholds of interaural envelope decorrelation (from a reference correlation of 1.0) were measured for bands of noise centered at 4 kHz and bandwidths ranging from 50-1600 Hz. Decorrelation of the envelope was achieved by "mixing" two independent narrow-band noises. Separately, with the same listeners, NoSo and NoS pi detection thresholds were measured for maskers of the same center frequency and bandwidths. For bandwidths of noise up to about 400 Hz, listeners were similarly sensitive to interaural decorrelation in both types of task. However, for bandwidths greater than 400 Hz or so, while sensitivity in the discrimination task was unaffected, sensitivity was reduced in the NoS pi conditions. Additional data suggested that listeners were able to maintain their sensitivity independent of bandwidth in the discrimination task by focusing on binaural information within select spectral regions of the stimuli.  相似文献   

14.
Spatial release from masking was studied in a three-talker soundfield listening experiment. The target talker was presented at 0 degrees azimuth and the maskers were either colocated or symmetrically positioned around the target, with a different masker talker on each side. The symmetric placement greatly reduced any "better ear" listening advantage. When the maskers were separated from the target by +/-15 degrees , the average spatial release from masking was 8 dB. Wider separations increased the release to more than 12 dB. This large effect was eliminated when binaural cues and perceived spatial separation were degraded by covering one ear with an earplug and earmuff. Increasing reverberation in the room increased the target-to-masker ratio (TM) for the separated, but not colocated, conditions reducing the release from masking, although a significant advantage of spatial separation remained. Time reversing the masker speech improved performance in both the colocated and spatially separated cases but lowered TM the most for the colocated condition, also resulting in a reduction in the spatial release from masking. Overall, the spatial tuning observed appears to depend on the presence of interaural differences that improve the perceptual segregation of sources and facilitate the focus of attention at a point in space.  相似文献   

15.
利用相空间重构时间延迟特性进行语音音质分析   总被引:1,自引:0,他引:1       下载免费PDF全文
在简要分析相空间重构时间延迟计算方法的基础上,利用复自相关时间延迟计算方法进行了语音信号相空间重构时间延迟的计算分析,通过对比不同信噪比下复自相关-时间延迟曲线,提出了利用时间延迟参数进行语音音质评估的方法,利用BP网络对大量试验数据进行建模,通过统计分析认为该方法具有一定的可行性。  相似文献   

16.
17.
18.
This study introduces a new test (CRISP-Jr.) for measuring speech intelligibility and spatial release from masking (SRM) in young children ages 2.5-4 years. Study 1 examined whether thresholds, masking, and SRM obtained with a test designed for older children (CRISP) and CRISP-Jr. are comparable in 4 to 5-year-old children. Thresholds were measured for target speech in front, in quiet, and with a different-sex masker either in front or on the right. CRISP-Jr. yielded higher speech reception thresholds (SRTs) than CRISP, but the amount of masking and SRM did not differ across the tests. In study 2, CRISP-Jr. was extended to a group of 3-year-old children. Results showed that while SRTs were higher in the younger group, there were no age differences in masking and SRM. These findings indicate that children as young as 3 years old are able to use spatial cues in sound source segregation, which suggests that some of the auditory mechanisms that mediate this ability develop early in life. In addition, the findings suggest that measures of SRM in young children are not limited to a particular set of stimuli. These tests have potentially useful applications in clinical settings, where bilateral fittings of amplification devices are evaluated.  相似文献   

19.
20.
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号