期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

1.

Spectral and threshold effects on recognition of speech at higher-than-normal levels

Dubno JR Horwitz AR Ahlstrom JB 《The Journal of the Acoustical Society of America》2006,120(1):310-320

To examine spectral and threshold effects for speech and noise at high levels, recognition of nonsense syllables was assessed for low-pass-filtered speech and speech-shaped maskers and high-pass-filtered speech and speech-shaped maskers at three speech levels, with signal-to-noise ratio held constant. Subjects were younger adults with normal hearing and older adults with normal hearing but significantly higher average quiet thresholds. A broadband masker was always present to minimize audibility differences between subject groups and across presentation levels. For subjects with lower thresholds, the declines in recognition of low-frequency syllables in low-frequency maskers were attributed to nonlinear growth of masking which reduced "effective" signal-to-noise ratio at high levels, whereas the decline for subjects with higher thresholds was not fully explained by nonlinear masking growth. For all subjects, masking growth did not entirely account for declines in recognition of high-frequency syllables in high-frequency maskers at high levels. Relative to younger subjects with normal hearing and lower quiet thresholds, older subjects with normal hearing and higher quiet thresholds had poorer consonant recognition in noise, especially for high-frequency speech in high-frequency maskers. Age-related effects on thresholds and task proficiency may be determining factors in the recognition of speech in noise at high levels. 相似文献

2.

Recognition of filtered words in noise at higher-than-normal levels: decreases in scores with and without increases in masking

Dubno JR Horwitz AR Ahlstrom JB 《The Journal of the Acoustical Society of America》2005,118(2):923-933

To examine spectral effects on declines in speech recognition in noise at high levels, word recognition for 18 young adults with normal hearing was assessed for low-pass-filtered speech and speech-shaped maskers or high-pass-filtered speech and speech-shaped maskers at three speech levels (70, 77, and 84 dB SPL) for each of three signal-to-noise ratios (+8, +3, and -2 dB). An additional low-level noise produced equivalent masked thresholds for all subjects. Pure-tone thresholds were measured in quiet and in all maskers. If word recognition was determined entirely by signal-to-noise ratio, and was independent of signal levels and the spectral content of speech and maskers, scores should remain constant with increasing level for both low- and high-frequency speech and maskers. Recognition of low-frequency speech in low-frequency maskers and high-frequency speech in high-frequency maskers decreased significantly with increasing speech level when signal-to-noise ratio was held constant. For low-frequency speech and speech-shaped maskers, the decline was attributed to nonlinear growth of masking which reduced the "effective" signal-to-noise ratio at high levels, similar to previous results for broadband speech and speech-shaped maskers. Masking growth and reduced "effective" signal-to-noise ratio accounted for some but not all the decline in recognition of high-frequency speech in high-frequency maskers. 相似文献

3.

Benefit of modulated maskers for speech recognition by younger and older adults with normal hearing

Dubno JR Horwitz AR Ahlstrom JB 《The Journal of the Acoustical Society of America》2002,111(6):2897-2907

To assess age-related differences in benefit from masker modulation, younger and older adults with normal hearing but not identical audiograms listened to nonsense syllables in each of two maskers: (1) a steady-state noise shaped to match the long-term spectrum of the speech, and (2) this same noise modulated by a 10-Hz square wave, resulting in an interrupted noise. An additional low-level broadband noise was always present which was shaped to produce equivalent masked thresholds for all subjects. This minimized differences in speech audibility due to differences in quiet thresholds among subjects. An additional goal was to determine if age-related differences in benefit from modulation could be explained by differences in thresholds measured in simultaneous and forward maskers. Accordingly, thresholds for 350-ms pure tones were measured in quiet and in each masker; thresholds for 20-ms signals in forward and simultaneous masking were also measured at selected signal frequencies. To determine if benefit from modulated maskers varied with masker spectrum and to provide a comparison with previous studies, a subgroup of younger subjects also listened in steady-state and interrupted noise that was not spectrally shaped. Articulation index (AI) values were computed and speech-recognition scores were predicted for steady-state and interrupted noise; predicted benefit from modulation was also determined. Masked thresholds of older subjects were slightly higher than those of younger subjects; larger age-related threshold differences were observed for short-duration than for long-duration signals. In steady-state noise, speech recognition for older subjects was poorer than for younger subjects, which was partially attributable to older subjects' slightly higher thresholds in these maskers. In interrupted noise, although predicted benefit was larger for older than younger subjects, scores improved more for younger than for older subjects, particularly at the higher noise level. This may be related to age-related increases in thresholds in steady-state noise and in forward masking, especially at higher frequencies. Benefit of interrupted maskers was larger for unshaped than for speech-shaped noise, consistent with AI predictions. 相似文献

4.

Use of context by young and aged adults with normal hearing

Dubno JR Ahlstrom JB Horwitz AR 《The Journal of the Acoustical Society of America》2000,107(1):538-546

Word recognition in sentences with and without context was measured in young and aged subjects with normal but not identical audiograms. Benefit derived from context by older adults has been obscured, in part, by the confounding effect of even mildly elevated thresholds, especially as listening conditions vary in difficulty. This problem was addressed here by precisely controlling signal-to-noise ratio across conditions and by accounting for individual differences in signal-to-noise ratio. Pure-tone thresholds and word recognition were measured in quiet and threshold-shaped maskers that shifted quiet thresholds by 20 and 40 dB. Word recognition was measured at several speech levels in each condition. Threshold was defined as the speech level (or signal-to-noise ratio) corresponding to the 50 rau point on the psychometric function. As expected, thresholds and slopes of psychometric functions were different for sentences with context compared to those for sentences without context. These differences were equivalent for young and aged subjects. Individual differences in word recognition among all subjects, young and aged, were accounted for by individual differences in signal-to-noise ratio. With signal-to-noise ratio held constant, word recognition for all subjects remained constant or decreased only slightly as speech and noise levels increased. These results suggest that, given equivalent speech audibility, older and younger listeners derive equivalent benefit from context. 相似文献

5.

Relative contribution of target and masker temporal fine structure to the unmasking of consonants in noise

Apoux F Healy EW 《The Journal of the Acoustical Society of America》2011,130(6):4044-4052

The present study assessed the relative contribution of the "target" and "masker" temporal fine structure (TFS) when identifying consonants. Accordingly, the TFS of the target and that of the masker were manipulated simultaneously or independently. A 30 band vocoder was used to replace the original TFS of the stimuli with tones. Four masker types were used. They included a speech-shaped noise, a speech-shaped noise modulated by a speech envelope, a sentence, or a sentence played backward. When the TFS of the target and that of the masker were disrupted simultaneously, consonant recognition dropped significantly compared to the unprocessed condition for all masker types, except the speech-shaped noise. Disruption of only the target TFS led to a significant drop in performance with all masker types. In contrast, disruption of only the masker TFS had no effect on recognition. Overall, the present data are consistent with previous work showing that TFS information plays a significant role in speech recognition in noise, especially when the noise fluctuates over time. However, the present study indicates that listeners rely primarily on TFS information in the target and that the nature of the masker TFS has a very limited influence on the outcome of the unmasking process. 相似文献

6.

Forward- and simultaneous-masked thresholds in bandlimited maskers in subjects with normal hearing and cochlear hearing loss

Dubno JR Ahlstrom JB 《The Journal of the Acoustical Society of America》2001,110(2):1049-1057

Forward- and simultaneous-masked thresholds were measured at 0.5 and 2.0 kHz in bandpass maskers as a function of masker bandwidth and in a broadband masker with the goal of estimating psychophysical suppression. Suppression was operationally defined in two ways: (1) as a change in forward-masked threshold as a function of masker bandwidth, and (2) as a change in effective masker level with increased masker bandwidth, taking into account the nonlinear growth of forward masking. Subjects were younger adults with normal hearing and older adults with cochlear hearing loss. Thresholds decreased as a function of masker bandwidth in forward masking, which was attributed to effects of suppression; thresholds remained constant or increased slightly with increasing masker bandwidth in simultaneous masking. For subjects with normal hearing, slightly larger estimates of suppression were obtained at 2.0 kHz rather than at 0.5 kHz. For hearing-impaired subjects, suppression was reduced in regions of hearing loss. The magnitude of suppression was strongly correlated with the absolute threshold at the signal frequency, but did not vary with thresholds at frequencies remote from the signal. The results suggest that measuring forward-masked thresholds in bandlimited and broadband maskers may be an efficient psychophysical method for estimating suppression. 相似文献

7.

A release from masking by continuous, random, notched noise

R P Carlyon 《The Journal of the Acoustical Society of America》1987,81(2):418-426

Thresholds for 10-ms sinusoids simultaneously masked by bursts of bandpass noise centered on the signal frequency were measured for a wide range of signal frequencies and noise levels. Thresholds were defined as the signal power relative to the masker power at the output of an auditory filter centered on the signal frequency. It was found that the presentation of a continuous random noise, with a spectral notch centered on the signal frequency, produced a reduction in signal thresholds of up to 11 dB. A notched noise spectrum level of 0-5 dB above that of the masker proved most effective in producing a masking release, as measured by a reduction in masked threshold. A release from masking of up to 7 dB could be obtained with a continuous bandpass noise. The most effective spectrum level of this noise was 5 dB below that of the masker. The effect of the continuous notched noise was to reduce signal-to-masker ratios at threshold to about 0 dB, regardless of the threshold in the absence of continuous noise. Thus the greatest release from masking occurred when "unreleased" thresholds were highest. The release from masking is almost complete within 320 ms of notched noise onset, and persists for about 160 ms after notched noise offset, regardless of notched noise level. The phenomenon is similar in many ways to the "overshoot" effect reported by Zwicker [J. Acoust. Soc. Am. 37, 653-663 (1965)]. It is argued that both effects can be largely attributed to peripheral short-term adaptation, a mechanism which is also believed to be involved in forward masking. 相似文献

8.

The effects of compression ratio, signal-to-noise ratio, and level on speech recognition in normal-hearing listeners. 总被引：2，自引：0，他引：2

B W Hornsby T A Ricketts 《The Journal of the Acoustical Society of America》2001,109(6):2964-2973

Previous research has demonstrated reduced speech recognition when speech is presented at higher-than-normal levels (e.g., above conversational speech levels), particularly in the presence of speech-shaped background noise. Persons with hearing loss frequently listen to speech-in-noise at these levels through hearing aids, which incorporate multiple-channel, wide dynamic range compression. This study examined the interactive effects of signal-to-noise ratio (SNR), speech presentation level, and compression ratio on consonant recognition in noise. Nine subjects with normal hearing identified CV and VC nonsense syllables in a speech-shaped noise at two SNRs (0 and +6 dB), three presentation levels (65, 80, and 95 dB SPL) and four compression ratios (1:1, 2:1, 4:1, and 6:1). Stimuli were processed through a simulated three-channel, fast-acting, wide dynamic range compression hearing aid. Consonant recognition performance decreased as compression ratio increased and presentation level increased. Interaction effects were noted between SNR and compression ratio, as well as between presentation level and compression ratio. Performance decrements due to increases in compression ratio were larger at the better (+6 dB) SNR and at the lowest (65 dB SPL) presentation level. At higher levels (95 dB SPL), such as those experienced by persons with hearing loss, increasing compression ratio did not significantly affect speech intelligibility. 相似文献

9.

Psychophysical suppression effects for tonal and speech signals.

J R Dubno J B Ahlstrom 《The Journal of the Acoustical Society of America》2001,110(4):2108-2119

This experiment assessed the benefits of suppression and the impact of reduced or absent suppression on speech recognition in noise. Psychophysical suppression was measured in forward masking using tonal maskers and suppressors and band limited noise maskers and suppressors. Subjects were 10 younger and 10 older adults with normal hearing, and 10 older adults with cochlear hearing loss. For younger subjects with normal hearing, suppression measured with noise maskers increased with masker level and was larger at 2.0 kHz than at 0.8 kHz. Less suppression was observed for older than younger subjects with normal hearing. There was little evidence of suppression for older subjects with cochlear hearing loss. Suppression measured with noise maskers and suppressors was larger in magnitude and more prevalent than suppression measured with tonal maskers and suppressors. The benefit of suppression to speech recognition in noise was assessed by obtaining scores for filtered consonant-vowel syllables as a function of the bandwidth of a forward masker. Speech-recognition scores in forward maskers should be higher than those in simultaneous maskers given that forward maskers are less effective than simultaneous maskers. If suppression also mitigated the effects of the forward masker and resulted in an improved signal-to-noise ratio, scores should decrease less in forward masking as forward-masker bandwidth increased, and differences between scores in forward and simultaneous maskers should increase, as was observed for younger subjects with normal hearing. Less or no benefit of suppression to speech recognition in noise was observed for older subjects with normal hearing or hearing loss. In general, as suppression measured with tonal signals increased, the combined benefit of forward masking and suppression to speech recognition in noise also increased. 相似文献

10.

Effects of simulated cochlear-implant processing on speech reception in fluctuating maskers 总被引：7，自引：0，他引：7

Qin MK Oxenham AJ 《The Journal of the Acoustical Society of America》2003,114(1):446-454

This study investigated the effects of simulated cochlear-implant processing on speech reception in a variety of complex masking situations. Speech recognition was measured as a function of target-to-masker ratio, processing condition (4, 8, 24 channels, and unprocessed) and masker type (speech-shaped noise, amplitude-modulated speech-shaped noise, single male talker, and single female talker). The results showed that simulated implant processing was more detrimental to speech reception in fluctuating interference than in steady-state noise. Performance in the 24-channel processing condition was substantially poorer than in the unprocessed condition, despite the comparable representation of the spectral envelope. The detrimental effects of simulated implant processing in fluctuating maskers, even with large numbers of channels, may be due to the reduction in the pitch cues used in sound source segregation, which are normally carried by the peripherally resolved low-frequency harmonics and the temporal fine structure. The results suggest that using steady-state noise to test speech intelligibility may underestimate the difficulties experienced by cochlear-implant users in fluctuating acoustic backgrounds. 相似文献

11.

The effects of hearing loss on the contribution of high- and low-frequency speech information to speech understanding

Hornsby BW Ricketts TA 《The Journal of the Acoustical Society of America》2003,113(3):1706-1717

The speech understanding of persons with "flat" hearing loss (HI) was compared to a normal-hearing (NH) control group to examine how hearing loss affects the contribution of speech information in various frequency regions. Speech understanding in noise was assessed at multiple low- and high-pass filter cutoff frequencies. Noise levels were chosen to ensure that the noise, rather than quiet thresholds, determined audibility. The performance of HI subjects was compared to a NH group listening at the same signal-to-noise ratio and a comparable presentation level. Although absolute speech scores for the HI group were reduced, performance improvements as the speech and noise bandwidth increased were comparable between groups. These data suggest that the presence of hearing loss results in a uniform, rather than frequency-specific, deficit in the contribution of speech information. Measures of auditory thresholds in noise and speech intelligibility index (SII) calculations were also performed. These data suggest that differences in performance between the HI and NH groups are due primarily to audibility differences between groups. Measures of auditory thresholds in noise showed the "effective masking spectrum" of the noise was greater for the HI than the NH subjects. 相似文献

12.

Formant discrimination in noise for isolated vowels

Liu C Kewley-Port D 《The Journal of the Acoustical Society of America》2004,116(5):3119-3129

Formant discrimination for isolated vowels presented in noise was investigated for normal-hearing listeners. Discrimination thresholds for F1 and F2, for the seven American English vowels /i, I, epsilon, ae, [symbol see text], a, u/, were measured under two types of noise, long-term speech-shaped noise (LTSS) and multitalker babble, and also under quiet listening conditions. Signal-to-noise ratios (SNR) varied from -4 to +4 dB in steps of 2 dB. All three factors, formant frequency, signal-to-noise ratio, and noise type, had significant effects on vowel formant discrimination. Significant interactions among the three factors showed that threshold-frequency functions depended on SNR and noise type. The thresholds at the lowest levels of SNR were highly elevated by a factor of about 3 compared to those in quiet. The masking functions (threshold vs SNR) were well described by a negative exponential over F1 and F2 for both LTSS and babble noise. Speech-shaped noise was a slightly more effective masker than multitalker babble, presumably reflecting small benefits (1.5 dB) due to the temporal variation of the babble. 相似文献

13.

Effects of non-simultaneous masking on the binaural masking level difference

Buss E Hall Iii JW 《The Journal of the Acoustical Society of America》2011,129(2):907-919

The present study sought to clarify the role of non-simultaneous masking in the binaural masking level difference for maskers that fluctuate in level. In the first experiment the signal was a brief 500-Hz tone, and the masker was a bandpass noise (100-2000 Hz), with the initial and final 200-ms bursts presented at 40-dB spectrum level and the inter-burst gap presented at 20-dB spectrum level. Temporal windows were fitted to thresholds measured for a range of gap durations and signal positions within the gap. In the second experiment, individual differences in out of phase (NoSπ) thresholds were compared for a brief signal in a gapped bandpass masker, a brief signal in a steady bandpass masker, and a long signal in a narrowband (50-Hz-wide) noise masker. The third experiment measured brief tone detection thresholds in forward, simultaneous, and backward masking conditions for a 50- and for a 1900-Hz-wide noise masker centered on the 500-Hz signal frequency. Results are consistent with comparable temporal resolution in the in phase (NoSo) and NoSπ conditions and no effect of temporal resolution on individual observers' ability to utilize binaural cues in narrowband noise. The large masking release observed for a narrowband noise masker may be due to binaural masking release from non-simultaneous, informational masking. 相似文献

14.

The effects of spatial separation in distance on the informational and energetic masking of a nearby speech signal

Brungart DS Simpson BD 《The Journal of the Acoustical Society of America》2002,112(2):664-676

Although many studies have shown that intelligibility improves when a speech signal and an interfering sound source are spatially separated in azimuth, little is known about the effect that spatial separation in distance has on the perception of competing sound sources near the head. In this experiment, head-related transfer functions (HRTFs) were used to process stimuli in order to simulate a target talker and a masking sound located at different distances along the listener's interaural axis. One of the signals was always presented at a distance of 1 m, and the other signal was presented 1 m, 25 cm, or 12 cm from the center of the listener's head. The results show that distance separation has very different effects on speech segregation for different types of maskers. When speech-shaped noise was used as the masker, most of the intelligibility advantages of spatial separation could be accounted for by spectral differences in the target and masking signals at the ear with the higher signal-to-noise ratio (SNR). When a same-sex talker was used as the masker, the intelligibility advantages of spatial separation in distance were dominated by binaural effects that produced the same performance improvements as a 4-5-dB increase in the SNR of a diotic stimulus. These results suggest that distance-dependent changes in the interaural difference cues of nearby sources play a much larger role in the reduction of the informational masking produced by an interfering speech signal than in the reduction of the energetic masking produced by an interfering noise source. 相似文献

15.

Detection of temporal gaps in bandlimited noise: effects of variations in bandwidth and signal-to-masker ratio

M J Shailer B C Moore 《The Journal of the Acoustical Society of America》1985,77(2):635-639

Thresholds were measured for the detection of a temporal gap in a bandlimited noise signal presented in a continuous wideband masker, using an adaptive forced-choice procedure. In experiment I the ratio of signal spectrum level to masker spectrum level (the SMR) was fixed at 10 dB and gap thresholds were measured as a function of signal bandwidth at three center frequencies: 0.4, 1.0, and 6.5 kHz. Performance improved with increasing bandwidth and increasing center frequency. For a subset of conditions, gap threshold was also measured as bandwidth was varied keeping the upper cutoff frequency of the signal constant. In this case the variation of gap threshold with bandwidth was more gradual, suggesting that subjects detect the gap using primarily the highest frequency region available in the signal. At low center frequencies, however, subjects may have a limited ability to combine information in different frequency regions. In experiment II gap thresholds were measured as a function of SMR for several signal bandwidths at each of three center frequencies: 0.5, 1.0, and 6.5 kHz. Gap thresholds improved with increasing SMR, but the improvement was minimal for SMRs greater than 12-15 dB. The results are used to evaluate the relative importance of factors influencing gap threshold. 相似文献

16.

Masking of short stimuli by noises with spiked spectra: II. Time summation and frequency selectivity of hearing in a narrow frequency range

L. K. Rimskaya-Korsakova M. R. Lalayants A. Ya. Supin G. A. Tavartkiladze 《Acoustical Physics》2011,57(2):216-223

The thresholds of masking of short high-frequency pulses with either different durations (1.25–25 ms) and similar central frequency or different central frequencies (3.6–4.4 kHz) but similar durations were measured to reveal manifestations of the properties of peripheral encoding in auditory perception. Noises with a spiked amplitude spectrum structure were used as maskers. The central frequency and the frequency band of a masker were 4 and 1 kHz, respectively. The central frequencies of a stimulus and a masker being equal, the noise the central frequency of which coincided with the frequency corresponding to a dip of an indented spectrum was called an off_(rip)-frequency masker. Owing to the off_(rip)-masker, stimuli-induced masking thresholds were formed taking into account excitation in a narrow region of a basila membrane and auditory nerve fibers with characteristic frequencies from a narrow range. High-frequency pulses with an envelope in the form of the Gaussian function and sinusoidal filling were used as stimuli. At masker levels of 30 dB above the auditory threshold, frequencies of off_(rip)-masker spectra spikes of 500–2000 Hz, and a central stimulus frequency of 4 kHz, the thresholds of tonal stimuli (25 ms in duration) masking in two out of three probationers were higher than the thresholds of masking of compact stimuli (1.25 ms in duration). In the third probationer, on the contrary, the thresholds of tonal stimuli masking were lower than the thresholds of compact stimuli masking. At masker levels of 50 dB, individual threshold differences disappeared. The obtained results were interpreted in the context of implementation of different methods of auditory encoding of the intensity. The methods were based on either the average frequency of auditory nerve pulsations or the number of fibers participating in the response. The interpretation was also carried out in the context of revealing manifestations of nonlinear properties of basila membrane displacements in auditory thresholds. The fact that the dependence of detection thresholds of compact stimuli on their central frequency in one of the two probationers did not reveal the minimum in case of coincidence of off_(rip)-masker and stimulus frequencies pointed to the presence of an auditory “problem zone” that was likely to be localized at the periphery of the auditory system. 相似文献

17.

Spatial unmasking of nearby speech sources in a simulated anechoic environment

Shinn-Cunningham BG Schickler J Kopco N Litovsky R 《The Journal of the Acoustical Society of America》2001,110(2):1118-1129

Spatial unmasking of speech has traditionally been studied with target and masker at the same, relatively large distance. The present study investigated spatial unmasking for configurations in which the simulated sources varied in azimuth and could be either near or far from the head. Target sentences and speech-shaped noise maskers were simulated over headphones using head-related transfer functions derived from a spherical-head model. Speech reception thresholds were measured adaptively, varying target level while keeping the masker level constant at the "better" ear. Results demonstrate that small positional changes can result in very large changes in speech intelligibility when sources are near the listener as a result of large changes in the overall level of the stimuli reaching the ears. In addition, the difference in the target-to-masker ratios at the two ears can be substantially larger for nearby sources than for relatively distant sources. Predictions from an existing model of binaural speech intelligibility are in good agreement with results from all conditions comparable to those that have been tested previously. However, small but important deviations between the measured and predicted results are observed for other spatial configurations, suggesting that current theories do not accurately account for speech intelligibility for some of the novel spatial configurations tested. 相似文献

18.

Additivity of forward masking

D L Neff W Jesteadt 《The Journal of the Acoustical Society of America》1983,74(6):1695-1701

Masked thresholds for a 1000-Hz sinusoidal signal were measured as a function of masker level in both forward and simultaneous masking for two types of maskers: a 1000-Hz sinusoid and a narrowband noise, 60-Hz wide, centered at 1000 Hz. In forward masking, the noise masker produced much steeper growth-of-masking functions than the sinusoid. Presenting a contralateral broadband noise "cue" with the forward masker dramatically reduced the slope of masking for the noise masker but did not influence results for the sinusoidal masker. The noise remained the more effective masker. The amount of masking produced by combinations of equally effective narrowband-noise and sinusoidal maskers was compared to that produced by each masker individually with and without the contralateral cue. No additional masking beyond that predicted by energy summation was measured for forward masking. Additional masking beyond energy-sum predictions was measured for analogous conditions in simultaneous masking. Comparisons of results obtained with and without the contralateral cue suggest that signal thresholds in the presence of narrowband-noise forward maskers can reflect nonperipheral auditory processes. 相似文献

19.

Variability and uncertainty in masking by competing speech

Freyman RL Helfer KS Balakrishnan U 《The Journal of the Acoustical Society of America》2007,121(2):1040-1046

This study investigated the role of uncertainty in masking of speech by interfering speech. Target stimuli were nonsense sentences recorded by a female talker. Masking sentences were recorded from ten female talkers and combined into pairs. Listeners' recognition performance was measured with both target and masker presented from a front loudspeaker (nonspatial condition) or with a masker presented from two loudspeakers, with the right leading the front by 4 ms (spatial condition). In Experiment 1, the sentences were presented in blocks in which the masking talkers, spatial configuration, and signal-to-noise (S-N) ratio were fixed. Listeners' recognition performance varied widely among the masking talkers in the nonspatial condition, much less so in the spatial condition. This result was attributed to variation in effectiveness of informational masking in the nonspatial condition. The second experiment increased uncertainty by randomizing masking talkers and S-N ratios across trials in some conditions, and reduced uncertainty by presenting the same token of masker across trials in other conditions. These variations in masker uncertainty had relatively small effects on speech recognition. 相似文献

20.

Physiological mechanisms of psychophysical masking: observations from auditory-nerve fibers 总被引：8，自引：0，他引：8

B Delgutte 《The Journal of the Acoustical Society of America》1990,87(2):791-809

Masking might be due either to the spread of the excitation produced by the masker to the place of the tone signal along the cochlea or to the suppression of the response to the signal by the masker. In order to identify the contributions of these two mechanisms to tone-on-tone masking, masked thresholds of auditory-nerve fibers were measured in anesthetized cats using the same stimulus paradigms and detection criteria as in psychophysics. Suppressive masking was identified by comparing thresholds for simultaneous masking with those for a nonsimultaneous masking technique resembling pulsation thresholds. These nonsimultaneous thresholds do not include the contribution of suppression to masking because suppression only occurs for stimuli that overlap in time. For each masker and signal frequency, the fibers with the lowest (or "best") masked thresholds had characteristic frequencies (CF) slightly on the opposite side of the masker frequency with respect to the signal frequency, consistent with the psychophysical phenomenon of off-frequency listening. Patterns of best masked thresholds against signal frequency resembled psychophysical masking patterns in that they showed a maximum for signal frequencies close to the masker, and a skew toward high frequencies. Masking was found to be both excitatory and suppressive, with the relative contribution of the two mechanisms depending on the frequency separation between signal and masker. Suppressive masking was large for signal frequencies well above the masker. For these conditions, simultaneous thresholds grew more rapidly with masker level than did nonsimultaneous thresholds, suggesting that the upward spread of masking is largely due to the growth of suppression rather than to that of excitation. 相似文献