首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 78 毫秒
1.
The use of across-frequency timing cues and the effect of disrupting these cues were examined across the frequency spectrum by introducing between-band asynchronies to pairs of narrowband temporal speech patterns. Sentence intelligibility by normal-hearing listeners fell when as little as 12.5 ms of asynchrony was introduced and was reduced to floor values by 100 ms. Disruptions to across-frequency timing had similar effects in the low-, mid-, and high-frequency regions, but band pairs having wider frequency separation were less disrupted by small amounts of asynchrony. In experiment 2, it was found that the disruptive influence of asynchrony on adjacent band pairs did not result from disruptions to the complex patterns present in overlapping excitation. The results of experiment 3 suggest that the processing of speech patterns may take place using mechanisms having different sensitivities to exact timing, similar to the dual mechanisms proposed for within- and across-channel gap detection. Preservation of relative timing can be critical to intelligibility. While the use of across-frequency timing cues appears similar across the spectrum, it may differ based on frequency separation. This difference appears to involve a greater reliance on exact timing during the processing of speech energy at proximate frequencies.  相似文献   

2.
The masking level difference (MLD) for a narrowband noise masker is associated with marked individual differences. This pair of studies examines factors that might account for these individual differences. Experiment 1 estimated the MLD for a 50 Hz wide band of masking noise centered at 500 or 2000 Hz, gated on for 400 ms. Tonal signals were either brief (15 ms) or long (200 ms), and brief signals were coincident with either a dip or peak in the masker envelope. Experiment 2 estimated the MLD for both signal and masker consisting of a 50 Hz wide bandpass noise centered on 500 Hz. Signals were generated to provide only interaural phase cues, only interaural level cues, or both. The pattern of individual differences was dominated by variability in NoSpi thresholds, and NoSpi thresholds were highly correlated across all conditions. Results suggest that the individual differences observed in Experiment 1 were not primarily driven by differences in the use of binaural fine structure cues or in binaural temporal resolution. The range of thresholds obtained for a brief NoSpi tonal signal at 500 Hz was consistent with a model based on normalized interaural correlation. This model was not consistent for analogous conditions at 2000 Hz.  相似文献   

3.
The present study sought to clarify the role of non-simultaneous masking in the binaural masking level difference for maskers that fluctuate in level. In the first experiment the signal was a brief 500-Hz tone, and the masker was a bandpass noise (100-2000 Hz), with the initial and final 200-ms bursts presented at 40-dB spectrum level and the inter-burst gap presented at 20-dB spectrum level. Temporal windows were fitted to thresholds measured for a range of gap durations and signal positions within the gap. In the second experiment, individual differences in out of phase (NoSπ) thresholds were compared for a brief signal in a gapped bandpass masker, a brief signal in a steady bandpass masker, and a long signal in a narrowband (50-Hz-wide) noise masker. The third experiment measured brief tone detection thresholds in forward, simultaneous, and backward masking conditions for a 50- and for a 1900-Hz-wide noise masker centered on the 500-Hz signal frequency. Results are consistent with comparable temporal resolution in the in phase (NoSo) and NoSπ conditions and no effect of temporal resolution on individual observers' ability to utilize binaural cues in narrowband noise. The large masking release observed for a narrowband noise masker may be due to binaural masking release from non-simultaneous, informational masking.  相似文献   

4.
Listeners have a remarkable ability to localize and identify sound sources in reverberant environments. The term "precedence effect" (PE; also known as the "Haas effect," "law of the first wavefront," and "echo suppression") refers to a group of auditory phenomena that is thought to be related to this ability. Traditionally, three measures have been used to quantify the PE: (1) Fusion: at short delays (1-5 ms for clicks) the lead and lag perceptually fuse into one auditory event; (2) Localization dominance: the perceived location of the leading source dominates that of the lagging source; and (3) Discrimination suppression: at short delays, changes in the location or interaural parameters of the lag are difficult to discriminate compared with changes in characteristics of the lead. Little is known about the relation among these aspects of the PE, since they are rarely studied in the same listeners. In the present study, extensive measurements of these phenomena were made for six normal-hearing listeners using 1-ms noise bursts. The results suggest that, for clicks, fusion lasts 1-5 ms; by 5 ms most listeners hear two sounds on a majority of trials. However, localization dominance and discrimination suppression remain potent for delays of 10 ms or longer. Results are consistent with a simple model in which information from the lead and lag interacts perceptually and in which the strength of this interaction decreases with spatiotemporal separation of the lead and lag. At short delays, lead and lag both contribute to spatial perception, but the lead dominates (to the extent that only one position is ever heard). At the longest delays tested, two distinct sounds are perceived (as measured in a fusion task), but they are not always heard at independent spatial locations (as measured in a localization dominance task). These results suggest that directional cues from the lag are not necessarily salient for all conditions in which the lag is subjectively heard as a separate event.  相似文献   

5.
The effect of spatial separation on the ability of human listeners to resolve a pair of concurrent broadband sounds was examined. Stimuli were presented in a virtual auditory environment using individualized outer ear filter functions. Subjects were presented with two simultaneous noise bursts that were either spatially coincident or separated (horizontally or vertically), and responded as to whether they perceived one or two source locations. Testing was carried out at five reference locations on the audiovisual horizon (0 degrees, 22.5 degrees, 45 degrees, 67.5 degrees, and 90 degrees azimuth). Results from experiment 1 showed that at more lateral locations, a larger horizontal separation was required for the perception of two sounds. The reverse was true for vertical separation. Furthermore, it was observed that subjects were unable to separate stimulus pairs if they delivered the same interaural differences in time (ITD) and level (ILD). These findings suggested that the auditory system exploited differences in one or both of the binaural cues to resolve the sources, and could not use monaural spectral cues effectively for the task. In experiments 2 and 3, separation of concurrent noise sources was examined upon removal of low-frequency content (and ITDs), onset/offset ITDs, both of these in conjunction, and all ITD information. While onset and offset ITDs did not appear to play a major role, differences in ongoing ITDs were robust cues for separation under these conditions, including those in the envelopes of high-frequency channels.  相似文献   

6.
Cochlear filtering results in earlier responses to high than to low frequencies. This study examined potential perceptual correlates of cochlear delays by measuring the perception of relative timing between tones of different frequencies. A brief 250-Hz tone was combined with a brief 1-, 2-, 4-, or 6-kHz tone. Two experiments were performed, one involving subjective judgments of perceived synchrony, the other involving asynchrony detection and discrimination. The functions relating the proportion of "synchronous" responses to the delay between the tones were similar for all tone pairs. Perceived synchrony was maximal when the tones in a pair were gated synchronously. The perceived-synchrony function slopes were asymmetric, being steeper on the low-frequency-leading side. In the second experiment, asynchrony-detection thresholds were lower for low-frequency rather than for high-frequency leading pairs. In contrast with previous studies, but consistent with the first experiment, thresholds did not depend on frequency separation between the tones, perhaps because of the elimination of within-channel cues. The results of the two experiments were related quantitatively using a decision-theoretic model, and were found to be highly correlated. Overall the results suggest that frequency-dependent cochlear group delays are compensated for at higher processing stages, resulting in veridical perception of timing relationships across frequency.  相似文献   

7.
Narrow-band sound localization related to external ear acoustics.   总被引:3,自引:0,他引:3  
Human subjects localized brief 1/6-oct bandpassed noise bursts that were centered at 6, 8, 10, and 12 kHz. All testing was done under binaural conditions. The horizontal component of subjects' responses was accurate, comparable to that for broadband localization, but the vertical and front/back components exhibited systematic errors. Specifically, responses tended to cluster within restricted ranges that were specific for each center frequency. The directional transfer functions of the subjects' external ears were measured for 360 horizontal and vertical locations. The spectra of the sounds that were present in the subjects' ear canals, the "proximal stimulus" spectra, were computed by combining the spectra of the narrow-band sound sources with the directional transfer functions for particular stimulus locations. Subjects consistently localized sounds to regions within which the associated directional transfer function correlated most closely with the proximal stimulus spectrum. A quantitative model was constructed that successfully predicted subjects' responses based on interaural level difference and spectral cues. A test of the model, using techniques adapted from signal detection theory, indicated that subjects tend to use interaural level difference and spectral shape cues independently, limited only by a slight spatial correlation of the two cues. A testing procedure is described that provides a quantitative comparison of various predictive models of sound localization.  相似文献   

8.
Spatial unmasking describes the improvement in the detection or identification of a target sound afforded by separating it spatially from simultaneous masking sounds. This effect has been studied extensively for speech intelligibility in the presence of interfering sounds. In the current study, listeners identified zebra finch song, which shares many acoustic properties with speech but lacks semantic and linguistic content. Three maskers with the same long-term spectral content but different short-term statistics were used: (1) chorus (combinations of unfamiliar zebra finch songs), (2) song-shaped noise (broadband noise with the average spectrum of chorus), and (3) chorus-modulated noise (song-shaped noise multiplied by the broadband envelope from a chorus masker). The amount of masking and spatial unmasking depended on the masker and there was evidence of release from both energetic and informational masking. Spatial unmasking was greatest for the statistically similar chorus masker. For the two noise maskers, there was less spatial unmasking and it was wholly accounted for by the relative target and masker levels at the acoustically better ear. The results share many features with analogous results using speech targets, suggesting that spatial separation aids in the segregation of complex natural sounds through mechanisms that are not specific to speech.  相似文献   

9.
The fidelity of reproducing free-field sounds using a virtual auditory display was investigated in two experiments. In the first experiment, listeners directly compared stimuli from an actual loudspeaker in the free field with those from small headphones placed in front of the ears. Headphone stimuli were filtered using head-related transfer functions (HRTFs), recorded while listeners were wearing the headphones, in order to reproduce the pressure signatures of the free-field sounds at the eardrum. Discriminability was investigated for six sound-source positions using broadband noise as a stimulus. The results show that the acoustic percepts of real and virtual sounds were identical. In the second experiment, discrimination between virtual sounds generated with measured and interpolated HRTFs was investigated. Interpolation was performed using HRTFs measured for loudspeaker positions with different spatial resolutions. Broadband noise bursts with flat and scrambled spectra were used as stimuli. The results indicate that, for a spatial resolution of about 6 degrees, the interpolation does not introduce audible cues. For resolutions of 20 degrees or more, the interpolation introduces audible cues related to timbre and position. For intermediate resolutions (10 degrees - 15 degrees) the data suggest that only timbre cues were used.  相似文献   

10.
Vibrotactile thresholds for the detection of a 50-ms vibratory stimulus on the thenar eminence of the hand were measured in the presence of and in the absence of a 700-ms suprathreshold vibratory masking stimulus. When thresholds were measured in the presence of the masking stimulus, stimulus onset asynchrony (SOA) was varied so that backward, simultaneous, and forward masking could be measured. The amount of masking, expressed as threshold shift, was greatest when the test stimulus was presented near the onset or offset of the masking stimulus. For both backward and forward masking, the amount of masking decreased as a function of increasing stimulus onset asynchrony. Comparisons were made of the amounts of masking measured when the test and masking stimuli were both sinusoids, and when the test stimulus was a sinusoid and the masking stimulus was noise. In all conditions, the masked threshold decreased approximately 4.0 dB when SOA was increased from 100 to 650 ms with reference to the onset of the 700-ms masking stimulus. More simultaneous masking was observed when sinusoidal test stimuli were detected in the presence of noise than when they were detected in the presence of sinusoidal maskers of the same frequency. The functions were essentially identical for detection of a low-frequency (20 Hz) test stimulus mediated by a non-Pacinian channel and detection of a high-frequency (250 Hz) test stimulus mediated by the Pacinian channel.  相似文献   

11.
The threshold for a signal masked by a narrow band of noise centered at the signal frequency (the on-frequency band) may be reduced by adding to the masker a second band of noise (the flanking band) whose envelope is correlated with that of the first band. This effect is called comodulation masking release (CMR). These experiments examine two questions. (1) How does the CMR vary with the number and ear of presentation of the flanking band(s)? (2) Is it possible to obtain a CMR when a binaural masking level difference (BMLD) is already present, and vice versa? Thresholds were measured for a 400-ms signal in a continuous 25-Hz-wide noise centered at signal frequencies (fs) of 250, 1000, and 4000 Hz. This masker was presented either alone or with one or more continuous flanking bands whose envelopes were either correlated or uncorrelated with that of the on-frequency band; their frequencies ranged from 0.5fs to 1.5fs. CMRs were measured for six conditions in which the signal, the on-frequency band, and the flanking band(s) were presented in various monaural and binaural combinations. When a single flanking band was used, the CMR was typically around 2-3 dB. The CMR increased to 5-6 dB if an additional flanking band was added. The effect of the additional band was similar whether it was in the same ear as the original band or in the opposite ear. At the lowest signal frequency, a large CMR was observed in addition to a BMLD and vice versa. At the highest signal frequency, the extra release from masking was small. The results are interpreted in terms of the cues producing the CMR and the BMLD.  相似文献   

12.
Thresholds for 10-ms sinusoids simultaneously masked by bursts of bandpass noise centered on the signal frequency were measured for a wide range of signal frequencies and noise levels. Thresholds were defined as the signal power relative to the masker power at the output of an auditory filter centered on the signal frequency. It was found that the presentation of a continuous random noise, with a spectral notch centered on the signal frequency, produced a reduction in signal thresholds of up to 11 dB. A notched noise spectrum level of 0-5 dB above that of the masker proved most effective in producing a masking release, as measured by a reduction in masked threshold. A release from masking of up to 7 dB could be obtained with a continuous bandpass noise. The most effective spectrum level of this noise was 5 dB below that of the masker. The effect of the continuous notched noise was to reduce signal-to-masker ratios at threshold to about 0 dB, regardless of the threshold in the absence of continuous noise. Thus the greatest release from masking occurred when "unreleased" thresholds were highest. The release from masking is almost complete within 320 ms of notched noise onset, and persists for about 160 ms after notched noise offset, regardless of notched noise level. The phenomenon is similar in many ways to the "overshoot" effect reported by Zwicker [J. Acoust. Soc. Am. 37, 653-663 (1965)]. It is argued that both effects can be largely attributed to peripheral short-term adaptation, a mechanism which is also believed to be involved in forward masking.  相似文献   

13.
The present investigation assessed the simultaneous and temporal masking produced by computer-generated synthetic vowels. The durations (100 and 200 ms) of each of four vowel-like maskers were employed. The masker was presented at 70 dB SPL. The probe signals were three filtered noise bursts whose spectral distributions corresponded to regions of high spectral energy in three English stop consonants. Quiet and masked thresholds were determined using the method of adjustment. Data are reported for two experienced listeners who participated in all the listening conditions. The results were generally in accord with the results of masking experiments using nonspeech signals in that both the frequency specificity of masking and temporal masking effects were demonstrated.  相似文献   

14.
Two experiments compared the effect of supplying visual speech information (e.g., lipreading cues) on the ability to hear one female talker's voice in the presence of steady-state noise or a masking complex consisting of two other female voices. In the first experiment intelligibility of sentences was measured in the presence of the two types of maskers with and without perceived spatial separation of target and masker. The second study tested detection of sentences in the same experimental conditions. Results showed that visual cues provided more benefit for both recognition and detection of speech when the masker consisted of other voices (versus steady-state noise). Moreover, visual cues provided greater benefit when the target speech and masker were spatially coincident versus when they appeared to arise from different spatial locations. The data obtained here are consistent with the hypothesis that lipreading cues help to segregate a target voice from competing voices, in addition to the established benefit of supplementing masked phonetic information.  相似文献   

15.
Experiment 1 examined detection and discrimination of monaural four-tone sequences composed of 400-, 500-, and 625-Hz sinusoids. In the baseline conditions, the masker was monaural composed of 25-Hz-wide bands of random noise centered on 320, 400, 500, 625, and 781 Hz. In the binaural masking release conditions, the noise was presented diotically. In the monaural masking release conditions, the noise was presented to the same ear as the signal, but it was comodulated. Tones had half-amplitude durations of 30, 60, or 150 ms. There was no delay between successive tones, so the rate of frequency change depended on tone duration. Listeners discriminated between sequences composed of 500-400-625-500 Hz and 500-625-400-500 Hz. Discrimination results were poor for rapid sequences in both monaural and binaural masking release conditions relative to baseline conditions. Results from experiment 2 indicated that poor discrimination for rapid sequences could also occur in the baseline conditions, provided that the frequency separation among tonal components was small. Sluggish processing in the present paradigm was not restricted to conditions relying on binaural cues. It is argued that sluggishness may reflect a long temporal window in monaural and binaural masking release conditions or an interaction between poor cue quality and task difficulty.  相似文献   

16.
Zurek [P. M. Zurek, J. Acoust. Soc. Am. Suppl. 1 78, S18 (1985)] noted what he termed "spectral dominance" in sensitivity to interaural delay for broadband stimuli. He found that interaural delays presented solely within high-frequency spectral regions were difficult, if not impossible, to detect in the presence of spectrally flanking, gated, diotic noise. In order to see if spectral dominance is a general result of the processing of interaural delays in broadband stimuli, similar experiments were conducted utilizing both gated and continuous flanking noises that were interaurally identical (diotic) or completely uncorrelated. Beyond replicating Zurek's basic findings, the data strongly suggest that the processing of interaural delays was largely unaffected when the flanking sounds were continuous and diotic. When the flanking sounds were interaurally uncorrelated, sensitivity was affected, but not drastically, for both gated and continuous conditions. Consequently, it appears that any inability to cope with conflicting interaural cues across spectral regions may be observed only under restricted conditions.  相似文献   

17.
This study examined spatial release from masking (SRM) when a target talker was masked by competing talkers or by other types of sounds. The focus was on the role of interaural time differences (ITDs) and time-varying interaural level differences (ILDs) under conditions varying in the strength of informational masking (IM). In the first experiment, a target talker was masked by two other talkers that were either colocated with the target or were symmetrically spatially separated from the target with the stimuli presented through loudspeakers. The sounds were filtered into different frequency regions to restrict the available interaural cues. The largest SRM occurred for the broadband condition followed by a low-pass condition. However, even the highest frequency bandpass-filtered condition (3-6 kHz) yielded a significant SRM. In the second experiment the stimuli were presented via earphones. The listeners identified the speech of a target talker masked by one or two other talkers or noises when the maskers were colocated with the target or were perceptually separated by ITDs. The results revealed a complex pattern of masking in which the factors affecting performance in colocated and spatially separated conditions are to a large degree independent.  相似文献   

18.
This study explored the ability of blind and sighted listeners to detect reflections, “echoes”, of burst trains or continuous noise. Echo detection was compared by presenting 5 ms bursts, rates from 1 to 64 bursts, with a continuous white noise, all during 500 ms. Sounds were recorded in an ordinary room through an artificial binaural head, the loudspeaker 1 m behind it. The reflecting object was an aluminum disk, diameter 0.5 m, placed at 1 m. The sounds were presented to 12 blind and 26 sighted participants in a laboratory using a 2-Alternative-Forced-Choice methodology. The task was to detect which of two sounds contained an echo. In Experiment 2, 1.5 m distance sounds were presented to the blind only. At 1 m, detection for the blind increased up to 64 bursts/500 ms, but for the sighted up to 32 bursts. At 1.5 m, the peak performance for the blind was at 32 bursts. At the 1 m, but not at the 1.5 m distance, the blind performed best with continuous white noise. The overlap in time of signal and echo at 1 m for 64 bursts was 60%, but at 1.5 m 82%. Avoiding an overlap between emitted bursts and returning echoes seems important for echolocation, indicating that an acoustic gaze, analogous to in echolocating animals, may also exist in humans.  相似文献   

19.
Illusory continuity of tonal and infratonal periodic sounds   总被引:2,自引:0,他引:2  
Temporal induction can restore masked or obliterated portions of signals so that tones may seem continuous when alternated with sounds having appropriate spectral composition and intensity. The upper intensity limits for the induction of tones (pulsation thresholds) are related to masking functions and have been used to define the characteristics of frequency domain (place) analysis of tones. The present study has found that induction also occurs for infratonal periodic sounds that require a time domain analysis for perception of acoustic repetition. Limits for temporal induction were determined for iterated frozen noise segments from 10-2000 Hz alternated with a louder on-line noise. Masked thresholds were also obtained for the pulsed signals presented along with continuous noise, and it was found that the relation between induction limits and masking changed with frequency. The results obtained for induction and masking are discussed in terms of general principles governing restoration of obliterated sounds.  相似文献   

20.
A series of experiments evaluated the effects of broadband noise (ipsilateral) on wave V of the brainstem auditory evoked response (BAER) elicited by tone bursts or clicks in the presence of high-pass masking noise. Experiment 1 used 1000- and 4000-Hz, 60-dB nHL tone bursts in the presence of broadband noise. With increasing noise level, wave V latency shift was greater for the 1000-Hz tone bursts, while amplitude decrements were similar for both tone-burst frequencies. Experiment 2 varied high-pass masker cutoff frequency and the level of subtotal masking in the presence of 50-dB nHL clicks. The effects of subtotal masking on wave V (increase in latency and decrease in amplitude) increased with increasing derived-band frequency. Experiment 3 covaried high-pass masker cutoff frequency and subtotal masking level for 1000- and 4000-Hz tone-burst stimuli. The effect of subtotal masking on wave V latency was reduced for both tone-burst frequencies when the response-generating region of the cochlear partition was limited by high-pass maskers. The results of these three experiments suggest that most of the wave V latency shift associated with increasing levels of broadband noise is mediated by a place mechanism when the stimulus is a moderate intensity (60 dB nHL), low-frequency (1000 Hz) tone burst. However, the interpretation of the latency shifts produced by broadband noise for 4000-Hz tone-burst stimuli is made more complex by multiple technical factors discussed herein.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号