首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 22 毫秒
1.
Auditory processing appears to include a series of domain-specific filtering operations that include tuning in the audio-frequency domain, followed by tuning in the temporal modulation domain, and perhaps tuning in the spectral modulation domain. To explore the possibility of tuning in the spectral modulation domain, a masking experiment was designed to measure masking patterns in the spectral modulation domain. Spectral modulation transfer functions (SMTFs) were measured for modulation frequencies from 0.25 to 14 cycles/octave superimposed on noise carriers either one octave (800-1600 Hz, 6400-12,800 Hz) or six octaves wide (200-12,800 Hz). The resulting SMTFs showed maximum sensitivity to modulation between 1 and 3 cycles/octave with reduced sensitivity above and below this region. Masked spectral modulation detection thresholds were measured for masker modulation frequencies of 1, 3, and 5 cycles/octave with a fixed modulation depth of 15 dB. The masking patterns obtained for each masker frequency and carrier band revealed tuning (maximum masking) near the masker frequency, which is consistent with the theory that spectral envelope perception is governed by a series of spectral modulation channels tuned to different spectral modulation frequencies.  相似文献   

2.
Iterated rippled noise (IRN) is generated by a cascade of delay and add (the gain after the delay is 1.0) or delay and subtract (the gain is -1.0) operations. The delay and add/subtract operations impart a spectral ripple and a temporal regularity to the noise. The waveform fine structure is different in these two conditions, but the envelope can be extremely similar. Four experiments were used to determine conditions in which the processing of IRN stimuli might be mediated by the waveform fine structure or by the envelope. In experiments 1 and 3 listeners discriminated among three stimuli in a single-interval task: IRN stimuli generated with the delay and add operations (g = 1.0), IRN stimuli generated using the delay and subtract operations (g = -1.0), and a flat-spectrum noise stimulus. In experiment 2 the listeners were presented two IRN stimuli that differed in delay (4 vs 6 ms) and a flat-spectrum noise stimulus that was not an IRN stimulus. In experiments 1 and 2 both the envelope and waveform fine structure contained the spectral ripple and temporal regularity. In experiment 3 only the envelope had this spectral and temporal structure. In all experiments discrimination was determined as a function of high-pass filtering the stimuli, and listeners could discriminate between the two IRN stimuli up to frequency regions as high as 4000-6000 Hz. Listeners could discriminate the IRN stimuli from the flat-spectrum noise stimulus at even higher frequencies (as high as 8000 Hz), but these discriminations did not appear to depend on the pitch of the IRN stimuli. A control experiment (fourth experiment) suggests that IRN discriminations in high-frequency regions are probably not due entirely to low-frequency nonlinear distortion products. The results of the paper imply that pitch processing of IRN stimuli is based on the waveform fine structure.  相似文献   

3.
Fastl and Bechly [J. Acoust. Soc. Am. 74, 754-757 (1983)] reported that the threshold of a brief 900-Hz signal simultaneously masked by a band of noise. 100 Hz wide, centered at 1000 Hz, was reduced by approximately 8 dB by the addition of an 1150-Hz tone having a level of 20 dB above that of the narrow-band masker. They concluded that this decrease in threshold was a demonstration of suppression in simultaneous masking. Here it is argued that Fastl and Bechly's results simply reflect the poorer detectability of signals masked by higher-frequency fluctuating maskers (their narrow-band masker) than by relatively flat-envelope maskers (their composite narrow-band plus tonal masker). The results of three experiments support the masker-envelope explanation. In the first experiment, as in the report of Fastl and Bechly, the masker centered at 1000 Hz (M1) was a narrow-band noise and the masker centered at 1150 Hz (M2) was a tone. Fastl and Bechly's result was replicated. However, thresholds obtained when M1 was presented alone (the M1-only condition) were more affected by the starting level of the signal within each adaptive track than were thresholds obtained when M1 and M2 were presented together (the M1+M2 condition). This result paralleled a previous report that starting level influenced performance more with fluctuating than with flat-envelope maskers. For the four of seven subjects wh showed learning, there was also more improvement in the M1-only than in the M1 + M2 condition. In the second experiment, M1 was a tone and M2 was a narrow-band noise.(ABSTRACT TRUNCATED AT 250 WORDS)  相似文献   

4.
Masked thresholds are measured and simulated for bandpass-noise signals ranging in bandwidth from 4 to 256 Hz in the presence of a masking bandpass noise also ranging in bandwidth from 4 to 256 Hz. Signal and masker are centered at 2 kHz. To investigate the role of temporal processing in simultaneous masking, simulations were performed with the modulation-filterbank model by Dau et al. [J. Acoust. Soc Am. 102, 2906-2919 (1997)]. For a fixed masker bandwidth, thresholds are independent of the signal bandwidth as long as the signal bandwidth does not exceed the masker bandwidth and thresholds decrease with increasing masker bandwidth in those conditions. A simple modulation-low-pass filter (energy integrator) would be sufficient to describe the experimental results in those conditions. In contrast, the processing by a modulation filterbank is necessary to account for the conditions of "asymmetry of masking," where thresholds for signals with bandwidths larger than the masker bandwidth are much lower than those for the reversed condition. In those conditions, the modulation-filterbank model is able to use the inherent higher modulation frequencies of the signal as an additional cue.  相似文献   

5.
This paper reports on measurements of fluorescence excitation spectra of single impurity molecules of terrylene in n-decane at T=1.7 K. Spectra measured within the same spectral interval but at consecutive instants of time exhibit zero-phonon (ZP) lines of single impurity molecules of several species, differing in the behavior of the line shape and frequency in time. On the one hand, one observes stable ZP lines, well approximated with a Lorentzian. On the other, one sees spectral features with a profile varying from one spectrum to another, with only individual fragments of such a profile allowing the Lorentzian approximation; such features are interpreted assuming the presence of unstable impurity molecules, the ZP lines of which display small (a few tens of MHz) spectral jumps with a time interval of about 10 s. Such molecules exhibit a substantial decrease in the spectral jump frequency within a measurement period of the order of 5000 s, which is attributed to a decrease in the contribution due to spectral diffusion resulting from sample structure relaxation.  相似文献   

6.
On the role of spectral transition for speech perception   总被引:2,自引:0,他引:2  
This paper examines the relationship between dynamic spectral features and the identification of Japanese syllables modified by initial and/or final truncation. The experiments confirm several main points. "Perceptual critical points," where the percent correct identification of the truncated syllable as a function of the truncation position changes abruptly, are related to maximum spectral transition positions. A speech wave of approximately 10 ms in duration that includes the maximum spectral transition position bears the most important information for consonant and syllable perception. Consonant and vowel identification scores simultaneously change as a function of the truncation position in the short period, including the 10-ms period for final truncation. This suggests that crucial information for both vowel and consonant identification is contained across the same initial part of each syllable. The spectral transition is more crucial than unvoiced and buzz bar periods for consonant (syllable) perception, although the latter features are of some perceptual importance. Also, vowel nuclei are not necessary for either vowel or syllable perception.  相似文献   

7.
This paper represents a systematic investigation of detection shot noise in carrier‐envelope phase (CEP) stabilization. Numerical simulations are conducted to calculate the influence of shot noise in laser oscillators. These results are compared with experimental results for Ti:sapphire lasers. It is found that shot noise imposes a limitation for obtaining sub‐100 mrad CEP jitters. Careful interferometer design is necessary to push this limit toward 10 mrad. In contrast to oscillator stabilization, shot noise appears to play a much more restrictive role in amplifier stabilization. Using spectral interferometry together with spectral broadening in sapphire, it already appears practically challenging to reach sub‐100 mrad jitters. Adaption of the optical nonlinearity in the broadening step appears key to further improvements of the CEP jitter of amplified systems. We believe that these improvements open a perspective for currently unfeasible applications of CEP stabilized pulses. Moreover, our considerations can be easily adapted to CEP stabilization of other laser types beyond Ti:sapphire.  相似文献   

8.
"Overshoot" is a simultaneous masking phenomenon: Thresholds for short high-frequency tone bursts presented shortly after the onset of a broadband masker are raised compared to thresholds in the presence of a continuous masker. Overshoot for 2-ms bursts of a 5000-Hz test tone is described for four subjects as a function of the spectral composition and level of the masker. First, it was verified that overshoot is largely independent of masker duration. Second, overshoot was determined for a variety of 10-ms masker bursts composed of differently filtered uniform masking noise with an overall level of 60 dB SPL: unfiltered, high-pass (cutoff at 3700 Hz), low-pass (cutoff at 5700 Hz), and third-octave-band-(centered at 5000 Hz) filtered uniform masking noises presented separately or combined with different bandpass maskers (5700-16000 Hz, 5700-9500 Hz, 8400-16000 Hz) were used. Third, masked thresholds were measured for maskers composed of an upper or lower octave band adjacent to the third-octave-band masker as a function of the level of the octave band. All maskers containing components above the critical band of the test tone led to overshoot; no additional overshoot was produced by masker components below it. Typical values of overshoot were on the order of 12 dB. Overshoot saturated when masker levels were above 60 dB SPL for the upper octave-band masker. The standard neurophysiological explanation of overshoot accounts only partially for these data. Details that must be accommodated by any full explanation of overshoot are discussed.  相似文献   

9.
Previous data on the masking level difference (MLD) have suggested that NoSpi detection for a long-duration signal is dominated by signal energy occurring in masker envelope minima. This finding was expanded upon using a brief 500-Hz tonal signal that coincided with either the envelope maximum or minimum of a narrow-band Gaussian noise masker centered at 500 Hz, and data were collected at a range of masker levels. Experiment 1 employed a typical MLD stimulus, consisting of a 30-ms signal and a 50-Hz-wide masker with abrupt spectral edges, and experiment 2 used stimuli generated to eliminate possible spectral cues. Results were quite similar for the two types of stimuli. At the highest masker level the MLD for signals coinciding with masker envelope minima was substantially larger than that for signals coinciding with envelope maxima, a result that was primarily due to decreased NoSpi thresholds in masker minima. For most observers this effect was greatly reduced or eliminated at the lowest masker level. These level effects are broadly consistent with the presence of physiological background noise and with a level-dependent binaural temporal window. Comparison of these results with predictions of a published model suggest that basilar-membrane compression alone does not account for this level effect.  相似文献   

10.
The form of fluctuating hydrodynamic forces has been derived recently [R.B. Jones, Physica 105A (1981) 395] from the stochastic fluid equations of Landau and Lifshitz. We show by general physical arguments as well as by a direct calculation that such forces do not exist in the classical limit.  相似文献   

11.
A common metric of auditory temporal processing is the difference in the threshold for a pure-tone signal masked by either unmodulated or amplitude-modulated noise. This technique may be viewed as a modification of the masking period pattern technique. Such measurements have been proposed as an efficient means of estimating auditory temporal resolution in a clinical setting, although in many cases threshold differences may reflect additional spectro-temporal processes. The primary purpose of the present experiment was to examine interactions among signal frequency and masker bandwidth and the effects of modulation frequency on modified masking period patterns. The results revealed unmodulated-modulated threshold differences that increased with increasing masker bandwidth and decreased with increasing modulation frequency. There was little effect of signal frequency for narrow-band noise maskers that were equal in absolute bandwidth across frequency. However, unmodulated-modulated threshold differences increased substantially with increasing signal frequency for bandwidths proportional to the signal frequency and for wideband maskers. Although the results are interpreted in terms of a combination of both within-channel and across-channel cues, the specific contributions of these cues in particular conditions are difficult to ascertain. Because modified masking period patterns depend strongly upon a number of specific stimulus parameters, and because it is difficult to determine with any precision the underlying perceptual processes, this technique is not recommended for use as a clinical measure of auditory temporal processing.  相似文献   

12.
The present paper describes the results from two experiments which explored the spectral boundaries for the nonlinear additivity of simultaneous masking. The first experiment measured the threshold for detection of a 2-kHz tone in the presence of two 800-Hz-wide bands of noise that had varying degrees of spectral overlap with each other and the 2-kHz signal. Results revealed an abrupt transition from linear to nonlinear additivity of masking as the spectral separation between the two maskers varied from some overlap to none. The second experiment examined alternative explanations for the data. Explanations based on restricted-listening or distortion-product-detection hypotheses were not supported by the results of this experiment. These data indicate that nonlinear additivity of simultaneous masking holds for maskers that do not overlap within the critical band centered on the signal frequency. This interpretation is also consistent with a large body of data on the monaural and binaural summation (additivity) of loudness.  相似文献   

13.
Speech recognition with altered spectral distribution of envelope cues.   总被引:8,自引:0,他引:8  
Recognition of consonants, vowels, and sentences was measured in conditions of reduced spectral resolution and distorted spectral distribution of temporal envelope cues. Speech materials were processed through four bandpass filters (analysis bands), half-wave rectified, and low-pass filtered to extract the temporal envelope from each band. The envelope from each speech band modulated a band-limited noise (carrier bands). Analysis and carrier bands were manipulated independently to alter the spectral distribution of envelope cues. Experiment I demonstrated that the location of the cutoff frequencies defining the bands was not a critical parameter for speech recognition, as long as the analysis and carrier bands were matched in frequency extent. Experiment II demonstrated a dramatic decrease in performance when the analysis and carrier bands did not match in frequency extent, which resulted in a warping of the spectral distribution of envelope cues. Experiment III demonstrated a large decrease in performance when the carrier bands were shifted in frequency, mimicking the basal position of electrodes in a cochlear implant. And experiment IV showed a relatively minor effect of the overlap in the noise carrier bands, simulating the overlap in neural populations responding to adjacent electrodes in a cochlear implant. Overall, these results show that, for four bands, the frequency alignment of the analysis bands and carrier bands is critical for good performance, while the exact frequency divisions and overlap in carrier bands are not as critical.  相似文献   

14.
15.
Masking asymmetry was investigated over a wide range of stimulus intensities for two signal frequencies, fo = 1.0 and 4.0 kHz, using both fixed-masker and fixed-signal paradigms. The masker was a notched noise with the upper and lower edges of the notch, fu and fl, respectively, placed asymmetrically about fo. For various notch widths, the asymmetry of masking was measured as the difference between the masked threshold obtained when fl was nearer fo and that obtained when fu was nearer fo. For maskers with wide notches, (fu - fl)/fo greater than 0.15, masking asymmetry changed with stimulus level; at the highest level, masked threshold was greatest when fl was nearer fo, and, at the lowest level the asymmetry reversed slightly for fo = 1.0 kHz so that masked threshold was actually greater when fu was nearer fo. Nonparallel growth of masking functions reveal changes in masking asymmetry with signal level as well as with masker level. It is concluded that the nonlinear growth of masking with level is due primarily to changes in the auditory filter, rather than changes in the detector following the filter.  相似文献   

16.
This study investigated the role of the amplitude envelope in the vicinity of consonantal release in the perception of the stop-glide contrast. Three sets of acoustic [b-w] continua, each in the vowel environments [a] and [i], were synthesized using parameters derived from natural speech. In the first set, amplitude, formant frequency, and duration characteristics were interpolated between exemplar stop and glide endpoints. In the second set, formant frequency and duration characteristics were interpolated, but all stimuli were given a stop amplitude envelope. The third set was like the second, except that all stimuli were given a glide amplitude envelope. Subjects were given both forced-choice and free-identification tasks. The results of the forced-choice task indicated that amplitude cues were able to override transition slope, duration, and formant frequency cues in the perception of the stop-glide contrast. However, results from the free-identification task showed that, although presence of a stop amplitude envelope turned all stimuli otherwise labeled as glides to stops, the presence of a glide amplitude envelope changed stimuli labeled otherwise as stops to fricatives rather than to glides. These results support the view that the amplitude envelope in the vicinity of the consonantal release is a critical acoustic property for the continuant / noncontinuant contrast. The results are discussed in relation to a theory of acoustic invariance.  相似文献   

17.
18.
Effect of spectral envelope smearing on speech reception. I.   总被引:2,自引:0,他引:2  
The effect of reduced spectral contrast on the speech-reception threshold (SRT) for sentences in noise and on phoneme identification, was investigated with 16 normal-hearing subjects. Signal processing was performed by smoothing the envelope of the squared short-time fast Fourier transform (FFT) by convolving it with a Gaussian-shaped filter, and overlapping additions to reconstruct a continuous signal. Spectral energy in the frequency region from 100 to 8000 Hz was smeared over bandwidths of 1/8, 1/4, 1/3, 1/2, 1, 2, and 4 oct for the SRT experiment. Vowel and consonant identification was studied for smearing bandwidths of 1/8, 1/2, and 2 oct. Results showed the SRT in noise to increase as the spectral energy was smeared over bandwidths exceeding the ear's critical bandwidth. Vowel identification suffered more from this type of processing than consonant identification. Vowels were primarily confused with the back vowels /c,u/, and consonants were confused where place of articulation is concerned.  相似文献   

19.
The intensity fluctuation of surface‐enhanced Raman scattering (SERS) has seriously hampered its practical applications. The previous studies that involved SERS intensity fluctuation mostly focus on the assembled substrates or single molecule and single nanoparticle but have few attention on colloidal solutions containing numerous molecules and/or nanoparticles. Here, we studied the time‐elapsed SERS spectra of three different analytes in colloidal solution. Their peak intensities fluctuated simultaneously and uniformly throughout the acquisition process and had no correlation with their wavenumber wandering. Gravity‐induced inhomogeneous distributions of Ag nanocubes in solution led to the spatial and temporal variations of peak intensities, which could be reduced by altering the integration time, particle concentration, and solvent viscosity and density. Furthermore, the dissimilar interactions between analytes and Ag nanocubes showed non‐detectable influence over their intensity fluctuations. This investigation gives out the significant guidance for SERS detection in colloidal solution. Copyright © 2015 John Wiley & Sons, Ltd.  相似文献   

20.
We present a new approach (demonstrated experimentally and through modeling) to characterize the spectral envelope of a terahertz (THz) pulse in a single shot. The coherent THz pulse is produced by a femtosecond electron bunch and contains information on the bunch duration. The technique, involving a single low-power laser probe pulse, is an extension of the conventional spectral encoding method (limited in time resolution to hundreds of femtoseconds) into a regime only limited in resolution by the laser pulse length (tens of femtoseconds). While only the bunch duration is retrieved (and not the exact charge profile), such a measurement provides a useful and critical parameter for optimization of the electron accelerator.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号