首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 31 毫秒
1.
This study describes the masking asymmetry between noise and iterated rippled noise (IRN) as a function of spectral region and the IRN delay. Masking asymmetry refers to the fact that noise masks IRN much more effectively than IRN masks noise, even when the stimuli occupy the same spectral region. Detection thresholds for IRN masked by noise and for noise masked by IRN were measured with an adaptive two-alternative, forced choice (2AFC) procedure with signal level as the adaptive parameter. Masker level was randomly varied within a 10-dB range in order to reduce the salience of loudness as a cue for detection. The stimuli were filtered into frequency bands, 2.2-kHz wide, with lower cutoff frequencies ranging from 0.8 to 6.4 kHz. IRN was generated with 16 iterations and with varying delays. The reciprocal of the delay was 16, 32, 64, or 128 Hz. When the reciprocal of the IRN delay was within the pitch range, i.e., above 30 Hz, there was a substantial masking asymmetry between IRN and noise for all filter cutoff frequencies; threshold for IRN masked by noise was about 10 dB larger than threshold for noise masked by IRN. For the 16-Hz IRN, the masking asymmetry decreased progressively with increasing filter cutoff frequency, from about 9 dB for the lowest cutoff frequency to less than 1 dB for the highest cutoff frequency. This suggests that masking asymmetry may be determined by different cues for delays within and below the pitch range. The fact that masking asymmetry exists for conditions that combine very long IRN delays with very high filter cutoff frequencies means that it is unlikely that models based on the excitation patterns of the stimuli would be successful in explaining the threshold data. A range of time-domain models of auditory processing that focus on the time intervals in phase-locked neural activity patterns is reviewed. Most of these models were successful in accounting for the basic masking asymmetry between IRN and noise for conditions within the pitch range, and one of the models produced an exceptionally good fit to the data.  相似文献   

2.
Virtual pitch in a computational physiological model   总被引:2,自引:0,他引:2  
A computational model of nervous activity in the auditory nerve, cochlear nucleus, and inferior colliculus is presented and evaluated in terms of its ability to simulate psychophysically-measured pitch perception. The model has a similar architecture to previous autocorrelation models except that the mathematical operations of autocorrelation are replaced by the combined action of thousands of physiologically plausible neuronal components. The evaluation employs pitch stimuli including complex tones with a missing fundamental frequency, tones with alternating phase, inharmonic tones with equally spaced frequencies and iterated rippled noise. Particular attention is paid to differences in response to resolved and unresolved component harmonics. The results indicate that the model is able to simulate qualitatively the related pitch-perceptions. This physiological model is similar in many respects to autocorrelation models of pitch and the success of the evaluations suggests that autocorrelation models may, after all, be physiologically plausible.  相似文献   

3.
Better place-coding of the fundamental frequency in cochlear implants   总被引:1,自引:0,他引:1  
In current cochlear implant systems, the fundamental frequency F0 of a complex sound is encoded by temporal fluctuations in the envelope of the electrical signals presented on the electrodes. In normal hearing, the lower harmonics of a complex sound are resolved, in contrast with a cochlear implant system. In the present study, it is investigated whether "place-coding" of the first harmonic improves the ability of an implantee to discriminate complex sounds with different fundamental frequencies. Therefore, a new filter bank was constructed, for which the first harmonic is always resolved in two adjacent filters, and the balance between both filter outputs is directly related to the frequency of the first harmonic. The new filter bank was compared with a filter bank that is typically used in clinical processors, both with and without the presence of temporal cues in the stimuli. Four users of the LAURA cochlear implant participated in a pitch discrimination task to determine detection thresholds for F0 differences. The results show that these thresholds decrease noticeably for the new filter bank, if no temporal cues are present in the stimuli. If temporal cues are included, the differences between the results for both filter banks become smaller, but a clear advantage is still observed for the new filter bank. This demonstrates the feasibility of using place-coding for the fundamental frequency.  相似文献   

4.
This paper describes a neurocognitive model of pitch segregation in which it is proposed that recognition mechanisms initiate early in auditory processing pathways so that long-term memory templates may be employed to segregate and integrate auditory features. In this model neural representations of pitch height are primed by the location and pattern of excitation across auditory filter channels in relation to long-term memory templates for common stimuli. Since waveform driven pitch mechanisms may produce information at multiple frequencies for tonal stimuli, pitch priming was assumed to include competitive inhibition that would allow only one pitch estimation at any time. Consequently concurrent pitch information must be relayed to short-term memory via a parallel mechanism that employs pitch information contained in the long-term memory template of the chord. Pure tones, harmonic complexes and two pitch chords of harmonic complexes were correctly classified by the correlation of templates comprising auditory nerve excitation and off-frequency inhibition with the excitation patterns of stimuli. The model then replicated behavioral data for pitch matching of concurrent vowels. Comparison of model outputs to the behavioral data suggests that inability to recognize a stimulus was associated with poor pitch segregation due to the use of inappropriate pitch priming strategies.  相似文献   

5.
In transmission-line models of cochlear mechanics, predictions about otoacoustic-emission delays depend on the place- or wave-fixed nature of the emission generation mechanism. In this work, transient evoked otoacoustic emissions (TEOAEs), recorded at different stimulus levels in 10 young subjects, were analyzed using wavelet-based time-frequency analysis to determine the latency of each frequency component of the response. The same wave forms were Fourier analyzed to evaluate the phase-gradient delay as a function of frequency. Interpreting the relation between these two characteristic delays using cochlear models shows that most of the TEOAE response can be attributed to place-fixed reflection mechanisms. The causality principle explains observed correlations between fluctuations of the TEOAE amplitude and phase-gradient delay.  相似文献   

6.
Melodic contour identification was measured in cochlear implant (CI) and normal-hearing (NH) subjects for piano samples processed by four bandpass filters: low (310-620 Hz), middle (620-2480 Hz), high (2480-4960 Hz), and full (310-4960 Hz). NH performance was near-perfect for all filter ranges and much higher than CI performance. The best mean CI performance was with the middle frequency range; performance was much better for some CI subjects with the middle rather than the full filter. These results suggest that acoustic filtering may reduce potential mismatches between fundamental frequencies and harmonic components thereby improving CI users' melodic pitch perception.  相似文献   

7.
光谱位相干涉仪参数的优化选取   总被引:1,自引:0,他引:1       下载免费PDF全文
模拟研究了光谱位相相干直接电场重构法的三个主要参数:时间延迟τ、频率剪切量Ω和展宽器色散φ之间的相互关系.研究结果表明:不同的待测脉冲,具有不同的最佳时间延迟τ;而τ的改变所引起的Ω的改变又限制了τ的选取范围;在此情况下只有适当调整φ才能在τ的较宽的范围内保证测量精度. 关键词: 光谱位相相干直接电场重构法 飞秒脉冲测量 光谱相干  相似文献   

8.
Multichannel, auditory models have been repeatedly used to explain many aspects of human pitch perception. Among the most successful ones are models where pitch is estimated based on an analysis of periodicity in the simulated auditory-nerve firing. This periodicity analysis is typically implemented as a running autocorrelation, i.e., the autocorrelation is calculated within a temporal window which is shifted along the time axis. The window was suggested to have an exponential decay with time-constant estimates between 1.5 and 100 ms. The window length determines the minimal integration time of pitch extraction. The present experiments are designed to quantify the temporal window of pitch extraction using regular-interval noises (RINs). RINs were generated by concatenating equal-duration noise samples which produce a pitch corresponding to the reciprocal of the sample duration when the samples are identical (periodic noise). When the samples are independent, the stimulus is Gaussian noise and produces no pitch. Using RIN stimuli where periodic portions interchange with aperiodic portions, it is shown that the temporal window of pitch extraction cannot be modeled using a single time constant but that the size of the temporal window depends on the pitch itself.  相似文献   

9.
改进谐波组织规则的单通道浊语音分离系统   总被引:1,自引:0,他引:1  
针对以往单通道噪声和浊语音分离算法的不足,改进了谐波组织算法。算法利用载波包络能量比将时频单元分为确定和非确定。提取基频作为组织线索。组织阶段分别使用谐波原理和最小幅度原理对确定时频单元组织;使用改进包络自相关函数度量幅度调制率对非确定时频单元组织。对比以往算法的处理结果,改进算法平均信噪比(SNR)提高0.96 dB。通过对谐波组织规则的改进,提高了分离性能。  相似文献   

10.
Hearing a mistuned harmonic in an otherwise periodic complex tone   总被引:1,自引:0,他引:1  
The ability of a listener to detect a mistuned harmonic in an otherwise periodic tone is representative of the capacity to segregate auditory entities on the basis of steady-state signal cues. By use of a task in which listeners matched the pitch of a mistuned harmonic, this ability has been studied, in order to find dependences on mistuned harmonic number, fundamental frequency, signal level, and signal duration. The results considerably augment the data previously obtained from discrimination experiments and from experiments in which listeners counted apparent sources. Although previous work has emphasized the role of spectral resolution in the segregation process, the present work suggests that neural synchrony is an important consideration; our data show that listeners lose the ability to segregate mistuned harmonics at high frequencies where synchronous neural firing vanishes. The functional form of this loss is insensitive to the spacing of the harmonics. The matching experiment also permits the measurement of the pitches of mistuned harmonics. The data exhibit shifts of a form that argues against models of pitch shifts that are based entirely upon partial masking.  相似文献   

11.
Recent temporal models of pitch and amplitude modulation perception converge on a relatively realistic implementation of cochlear processing followed by a temporal analysis of periodicity. However, for modulation perception, a modulation filterbank is applied whereas for pitch perception, autocorrelation is applied. Considering the large overlap in pitch and modulation perception, this is not parsimonious. Two experiments are presented to investigate the interaction between carrier periodicity, which produces strong pitch sensations, and envelope periodicity using broadband stimuli. Results show that in the presence of carrier periodicity, detection of amplitude modulation is impaired throughout the tested range (8-1000 Hz). On the contrary, detection of carrier periodicity in the presence of an additional amplitude modulation is impaired only for very low frequencies below the pitch range (<33 Hz). Predictions of a generic implementation of a modulation-filterbank model and an autocorrelation model are compared to the data. Both models were too insensitive to high-frequency envelope or carrier periodicity and to infra-pitch carrier periodicity. Additionally, both models simulated modulation detection quite well but underestimated the detrimental effect of carrier periodicity on modulation detection. It is suggested that a hybrid model consisting of bandpass envelope filters with a ripple in their passband may provide a functionally successful and physiologically plausible basis for a unified model of auditory periodicity extraction.  相似文献   

12.
The nature of the neural processing underlying the extraction of pitch information from harmonic complex sounds is still unclear. Electrophysiological studies in the auditory nerve and many psychophysical and modeling studies suggest that pitch might be extracted successfully by applying a mechanism like autocorrelation to the temporal discharge patterns of auditory-nerve fibers. The current modeling study investigates the possible role of populations of sustained chopper (Chop-S) units located in the mammalian ventral cochlear nucleus (VCN) in this process. First, it is shown that computer simulations can predict responses to periodic and quasiperiodic sounds of individual Chop-S units recorded in the guinea-pig VCN. Second, it is shown that the fundamental period of a periodic or quasiperiodic sound is represented in the first-order, interspike interval statistics of a population of simulated Chop-S units. This is true across a wide range of characteristic frequencies when the chopping rate is equal to the f0 of the sound. The model was able to simulate the results of psychophysical studies involving the pitch height and pitch strength of iterated ripple noise, the dominance region of pitch, the effect of phase on pitch height and pitch strength, pitch of inharmonic stimuli, and of sinusoidally amplitude modulated noise. Simulation results indicate that changes in the interspike interval statistics of populations of Chop-S units compare well with changes in the pitch perceived by humans. It is proposed that Chop-S units in the ventral cochlear nucleus may play an important role in pitch extraction: They can convert a purely temporal pitch code as observed in the auditory nerve into a temporal place code of pitch in populations of cochlear-nucleus, Chop-S with different characteristic frequencies, and chopping rates. Thus, populations of cochlear-nucleus Chop-S units, together with their target units presumably located in the inferior colliculus, may serve to establish a stable rate-place code of pitch at the level of the auditory cortex.  相似文献   

13.
Pitch-intensity functions and psychophysical tuning curves (PTC's) were measured in ten listeners with sensorineural impairments of presumed cochlear origin. Masking patterns, frequency jnd's, diplacusis measurements, and octave adjustments were also obtained for selected conditions in selected listeners. The results showed a tendency for increased frequency jnd's and increased pitch-matching variability in frequency regions where frequency resolution, as determined by PTC Q10 estimates, was degraded. The results also showed exaggerated pitch-level effects, both in regions where frequency resolution was degraded and, in many cases, in regions where thresholds and frequency resolution were apparently normal. The usual manifestation of exaggerated pitch-level effect was an abnormally large negative pitch shift with increasing level, particularly at low frequencies. The limited data from diplacusis measurements and octave adjustments suggest that the exaggerated negative pitch shifts are the consequence of a large increase in pitch at low stimulus levels which "recruits" at higher levels. These results are difficult to explain with simple tonotopic models, or presently formulated temporal models, of pure-tone pitch encoding.  相似文献   

14.
Fundamental frequency (f0) difference limens (DLs) were measured as a function of f0 for sine- and random-phase harmonic complexes, bandpass filtered with 3-dB cutoff frequencies of 2.5 and 3.5 kHz (low region) or 5 and 7 kHz (high region), and presented at an average 15 dB sensation level (approximately 48 dB SPL) per component in a wideband background noise. Fundamental frequencies ranged from 50 to 300 Hz and 100 to 600 Hz in the low and high spectral regions, respectively. In each spectral region, f0 DLs improved dramatically with increasing f0 as approximately the tenth harmonic appeared in the passband. Generally, f0 DLs for complexes with similar harmonic numbers were similar in the two spectral regions. The dependence of f0 discrimination on harmonic number presents a significant challenge to autocorrelation (AC) models of pitch, in which predictions generally depend more on spectral region than harmonic number. A modification involving a "lag window"is proposed and tested, restricting the AC representation to a limited range of lags relative to each channel's characteristic frequency. This modified unitary pitch model was able to account for the dependence of f0 DLs on harmonic number, although this correct behavior was not based on peripheral harmonic resolvability.  相似文献   

15.
If two vowels with different fundamental frequencies (fo's) are presented simultaneously and monaurally, listeners often hear two talkers producing different vowels on different pitches. This paper describes the evaluation of four computational models of the auditory and perceptual processes which may underlie this ability. Each model involves four stages: (i) frequency analysis using an "auditory" filter bank, (ii) determination of the pitches present in the stimulus, (iii) segregation of the competing speech sources by grouping energy associated with each pitch to create two derived spectral patterns, and (iv) classification of the derived spectral patterns to predict the probabilities of listeners' vowel-identification responses. The "place" models carry out the operations of pitch determination and spectral segregation by analyzing the distribution of rms levels across the channels of the filter bank. The "place-time" models carry out these operations by analyzing the periodicities in the waveforms in each channel. In their "linear" versions, the place and place-time models operate directly on the waveforms emerging from the filters. In their "nonlinear" versions, analogous operations are applied to the output of an additional stage which applied a compressive nonlinearity to the filtered waveforms. Compared to the other three models, the nonlinear place-time model provides the most accurate estimates of the fo's of paris of concurrent synthetic vowels and comes closest to predicting the identification responses of listeners to such stimuli. Although the model has several limitations, the results are compatible with the idea that a place-time analysis is used to segregate competing sound sources.  相似文献   

16.
This paper describes the temporal responses of anteroventral cochlear nucleus (AVCN) units in the chinchilla to rippled noises. Rippled noise is generated when a wideband noise is delayed and added (cos+ noise) or subtracted (cos- noise) to the undelayed noise. Renewal densities were constructed to evaluate synchronous discharges at the delay. In response to rippled noise, AVCN units which show phase locking to best frequency (BF) tones gave renewal densities having major peaks at the delay for cos+ noise, but nulls at the delay for cos- noise. Most AVCN units which did not show BF phase locking gave renewal densities that did not contain features related to the rippled noise delay; a few of these nonphase-locked units did show peaks in renewal densities for both cos+ and cos- noises. Synchrony at the rippled noise delay was also demonstrated with evoked potential recording. Autocorrelation functions of the neurophonic potential showed peaks at the rippled noise delay for both cos+ and cos- noises. In addition, peaks could be observed in the autocorrelation functions of neurophonic potentials for rippled noises with delays as short as 1 ms; peaks were never observed in renewal densities of single units for ripple delays as short as 1 ms. The results show that a temporal representation of rippled noise delay does exist in the AVCN and are consistent with current hypotheses regarding functions of AVCN subsystems. The temporal representation of the delay is a presumptive neural code for the pitches of rippled noises.  相似文献   

17.
It is unclear whether the perceptual segregation of a mistuned harmonic from a periodic complex tone depends specifically on harmonic relations between the other components. A procedure used previously for harmonic complexes [W. M. Hartmann et al., J. Acoust. Soc. Am. 88, 1712-1724 (1990)] was adapted and extended to regular inharmonic complexes. On each trial, subjects heard a 12-component complex followed by a pure tone in a continuous loop. In experiment 1, a mistuning of +/- 4% was applied to one of the components 2-11. The complex was either harmonic, frequency shifted, or spectrally stretched. Subjects adjusted the pure tone to match the pitch of the mistuned component. Near matches were taken to indicate segregation, and were almost as frequent in the inharmonic conditions as in the harmonic case. Also, small but consistent mismatches, pitch shifts, were found in all conditions. These were similar in direction and size to earlier findings for harmonic complexes. Using a range of mistunings, experiment 2 showed that the segregation of components from regular inharmonic complexes could be sensitive to mistunings of 1.5% or less. These findings are consistent with the proposal that aspects of spectral regularity other than harmonic relations can also influence auditory grouping.  相似文献   

18.
The authors previously reported that same/different judgments on pitch sequences were more accurate for tones with resolved (low-rank) harmonics compared to unresolved (high-rank) harmonics, even when discriminability between tones was equated [Cousineau et al. (2009). J. Acoust. Soc. Am. 126, 3179-3187]. Here, peripheral resolvability, defined by the number of harmonics per cochlear filter, was contrasted with harmonic number. Tones were presented either diotically or dichotically. In the latter case, even and odd harmonics were presented to different ears, thus halving the number of harmonics per cochlear filter. Performance was better for dichotic than for diotic presentations. This indicates that peripheral resolvability is necessary and sufficient for efficient pitch-sequence processing.  相似文献   

19.
A method of calculating an autocorrelation function with extremely narrow peaks is described. This is done by including terms in the autocorrelation expression corresponding to delays at 2 tau, 3 tau, etc., in addition to the usual term with delay tau. Implications in the frequency domain are explored. Graphs of this autocorrelation function for a number of violin sound samples, including a two-octave scale, vibrato, and glissando, are presented. Graphs of the autocorrelation function for some synthetic sound samples are also included.  相似文献   

20.
We report on the observation of doubly charged helium ions produced by a nonlinear interaction between a helium atom and photons with a photon energy of 42 eV which are generated with the 27th harmonic of a femtosecond pulse from a Ti:sapphire laser. The number of ions is proportional to the square of the intensity of the 27th harmonic pulse, and thus two-photon double ionization should be dominantly induced as compared with other nonlinear processes accompanying sequential ionization via a singly charged ion. This phenomenon is utilized to measure the pulse duration of the 27th harmonic pulse by using an autocorrelation technique, for the first time to our knowledge, and as a result a duration of 8 fs is found.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号