首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 31 毫秒
1.
A principle of auditory perception that governs the detectability of changes in components in unfamiliar sequences of tones is demonstrated in four experiments. The proportion-of-the-total-duration (PTD) rule can be stated as follows: Each individual component of an unfamiliar sequence of tones is resolved with an accuracy that is a function of its proportion of the total duration of the sequence or "pattern." An adaptive-tracking frequency-discrimination task was used in all experiments. Experiment 1 demonstrated that the PTD rule holds over a wide range of total pattern durations, numbers of components, and component durations. Experiment 2 demonstrated that the PTD rule governs discrimination performance despite variation in the relative durations of context and target tones. Experiment 3, using a variable temporal position for the target, confirmed that the PTD effect does not require that a listener be able to anticipate the temporal location of the target tone. Experiment 4, using two target tones, showed that the PTD rules applies to the proportional duration of individual components within patterns and not to the total proportional duration of nonadjacent components within the pattern. These findings are incompatible with performance limitations based on a fixed-duration short-term memory capacity and with versions of informational limitations in which the amount of information in a pattern varies either with the number of components or with the total pattern duration. The PTD rule appears to reflect the way listeners distribute their attention when presented with unfamiliar complex sounds that have no structural properties (other than proportional duration) that significantly increase the salience of individual components.  相似文献   

2.
Learning to perceive pitch differences   总被引:2,自引:0,他引:2  
This paper reports two experiments concerning the stimulus specificity of pitch discrimination learning. In experiment 1, listeners were initially trained, during ten sessions (about 11,000 trials), to discriminate a monaural pure tone of 3000 Hz from ipsilateral pure tones with slightly different frequencies. The resulting perceptual learning (improvement in discrimination thresholds) appeared to be frequency-specific since, in subsequent sessions, new learning was observed when the 3000-Hz standard tone was replaced by a standard tone of 1200 Hz, or 6500 Hz. By contrast, a subsequent presentation of the initial tones to the contralateral ear showed that the initial learning was not, or was only weakly, ear-specific. In experiment 2, training in pitch discrimination was initially provided using complex tones that consisted of harmonics 3-7 of a missing fundamental (near 100 Hz for some listeners, 500 Hz for others). Subsequently, the standard complex was replaced by a standard pure tone with a frequency which could be either equal to the standard complex's missing fundamental or remote from it. In the former case, the two standard stimuli were matched in pitch. However, this perceptual relationship did not appear to favor the transfer of learning. Therefore, the results indicated that pitch discrimination learning is, at least to some extent, timbre-specific, and cannot be viewed as a reduction of an internal noise which would affect directly the output of a neural device extracting pitch from both pure tones and complex tones including low-rank harmonics.  相似文献   

3.
Thresholds (F0DLs) were measured for discrimination of the fundamental frequency (F0) of a group of harmonics (group B) embedded in harmonics with a fixed F0. Miyazono and Moore [(2009). Acoust. Sci. & Tech. 30, 383386] found a large training effect for tones with high harmonics in group B, when the harmonics were added in cosine phase. It is shown here that this effect was due to use of a cue related to pitch pulse asynchrony (PPA). When PPA cues were disrupted by introducing a temporal offset between the envelope peaks of the harmonics in group B and the remaining harmonics, F0DLs increased markedly. Perceptual learning was examined using a training stimulus with cosine-phase harmonics, F0 = 50 Hz, and high harmonics in group B, under conditions where PPA was not useful. Learning occurred, and it transferred to other cosine-phase tones, but not to random-phase tones. A similar experiment with F0 = 100 Hz showed a learning effect which transferred to a cosine-phase tone with mainly high unresolved harmonics, but not to cosine-phase tones with low harmonics, and not to random-phase tones. The learning found here appears to be specific to tones for which F0 discrimination is based on distinct peaks in the temporal envelope.  相似文献   

4.
Molecular psychophysics attempts to model the observer's response to stimuli as they vary from trial to trial. The approach has gained popularity in multitone pattern discrimination studies as a means of estimating the relative reliance or decision weight listeners give to different tones in the pattern. Various factors affecting decision weights have been examined, but one largely ignored is the relative level of tones in the pattern. In the present study listeners detected a level-increment in a sequence of 5, 100-ms, 2.0-kHz tone bursts alternating in level between 40 and 80 dB SPL. The level increment was made largest on the 40-dB tones, yet despite this all four highly-practiced listeners gave near exclusive weight to the 80-dB tones. The effect was the same when the tones were replaced by bursts of broadband Gaussian noise alternating in level. It was reduced only when the level differences were made <10 dB, and it was entirely reversed only when the low-level tones alternated with louder bursts of Gaussian noise. The results are discussed in terms of the effects of both sensory and perceptual factors on estimates of decision weights.  相似文献   

5.

Background  

Due to auditory experience, musicians have better auditory expertise than non-musicians. An increased neocortical activity during auditory oddball stimulation was observed in different studies for musicians and for non-musicians after discrimination training. This suggests a modification of synaptic strength among simultaneously active neurons due to the training. We used amplitude-modulated tones (AM) presented in an oddball sequence and manipulated their carrier or modulation frequencies. We investigated non-musicians in order to see if behavioral discrimination training could modify the neocortical activity generated by change detection of AM tone attributes (carrier or modulation frequency). Cortical evoked responses like N1 and mismatch negativity (MMN) triggered by sound changes were recorded by a whole head magnetoencephalographic system (MEG). We investigated (i) how the auditory cortex reacts to pitch difference (in carrier frequency) and changes in temporal features (modulation frequency) of AM tones and (ii) how discrimination training modulates the neuronal activity reflecting the transient auditory responses generated in the auditory cortex.  相似文献   

6.
Infant-directed speech (IDS) is believed to facilitate language learning. However, the benefit may be either due to clearer acoustic correlates to linguistic structures, or simply increased attention from infants induced by IDS exaggerated prosody. This study investigated the pure effect of IDS pitch on lexical tone learning, with attentional/affective factors removed by using artificial neural networks. Following training with the pitch of Mandarin tones in IDS versus adult-directed speech, the networks yielded equal tonal categorization for both registers. IDS pitch produced no additional linguistic support. IDS pitch appears to strictly play the non-linguistic role of attention/affect, which may indirectly benefit learning.  相似文献   

7.
These experiments address the following issues. (1) When two complex tones contain different harmonics, do the differences in timbre between them impair the ability to discriminate the pitches of the tones? (2) When two complex tones have only a single component in common, and that component is the most discriminable component in each tone, is the frequency discrimination of the component affected by differences in residue pitch between the two tones? (3) How good is the pitch discrimination of complex tones with no common components when each tone contains multiple harmonics, so as to avoid ambiguity of pitch? (4) Is the pitch discrimination of complex tones with common harmonics impaired by shifting the component frequencies to nonharmonic values? In all experiments, frequency difference limens (DLCs) were measured for multiple-component complex tones, using an adaptive two-interval, two-alternative, forced-choice task. Three highly trained subjects were used. The main conclusions are as follows. (1) When two tones have the first six harmonics in common, DLCs are larger when the upper harmonics are different than when the upper harmonics are in common or are absent. It appears that differences in timbre impair DLCs. (2) Discrimination of the frequency of a single common partial in two complex tones is worse when the two tones have different residue pitches than when they have the same residue pitch. (3) DLCs for complex tones with no common harmonics are generally larger than those for complex tones with common harmonics. For the former, large individual differences occur, probably because subjects are affected differently by differences in timbre. (4) DLCs for harmonic complex tones are smaller than DLCs for complex tones in which the components are mistuned from harmonic values. This can probably be attributed to the less distinct residue pitch of the inharmonic complexes, rather than to reduced discriminability of partials. Overall, the results support the idea that DLCs for complex tones with common harmonics depend on residue pitch comparisons, rather than on comparisons of the pitches of partials.  相似文献   

8.
Human observers were asked to judge whether or not two sequences of eight or more tones had the same serial pattern of frequencies. The temporal envelopes of the sequences were manipulated by randomly varying the tone durations or intertone gaps. In the correlated condition, the temporal envelopes of the sequences were varied across trials; the two sequences within each trial had the same temporal envelope. In the uncorrelated condition, the temporal envelopes were varied both across and within trials; every sequence had a unique temporal pattern. Performance in the uncorrelated condition decreased with increased variability in the temporal envelope. Performance in the correlated condition was independent of temporal variability, but decreased with increases in the time interval between the onsets of the two sequences. This pattern of results is consistent with an extension of a model of auditory discrimination developed by Durlach and Braida [J. Acoust. Soc. Am. 46, 372-383 (1969)], in which two processing modes are postulated: a trace mode and a context mode. When the tonal sequences had unique temporal patterns, context mode processing was dominant; when the sequences had identical temporal patterns, trace mode processing was preferred. The effect of variables such as the number of tones, the tone duration, the time gap between tones, and the time interval between sequences was consistent with the predictions of the discrimination model.  相似文献   

9.
Adult non-native speech perception is subject to influence from multiple factors, including linguistic and extralinguistic experience such as musical training. The present research examines how linguistic and musical factors influence non-native word identification and lexical tone perception. Groups of native tone language (Thai) and non-tone language listeners (English), each subdivided into musician and non-musician groups, engaged in Cantonese tone word training. Participants learned to identify words minimally distinguished by five Cantonese tones during training, also completing musical aptitude and phonemic tone identification tasks. First, the findings suggest that either musical experience or a tone language background leads to significantly better non-native word learning proficiency, as compared to those with neither musical training nor tone language experience. Moreover, the combination of tone language and musical experience did not provide an additional advantage for Thai musicians above and beyond either experience alone. Musicianship was found to be more advantageous than a tone language background for tone identification. Finally, tone identification and musical aptitude scores were significantly correlated with word learning success for English but not Thai listeners. These findings point to a dynamic influence of musical and linguistic experience, both at the tone dentification level and at the word learning stage.  相似文献   

10.
The relation between the auditory brain stem potential called the frequency-following response (FFR) and the low pitch of complex tones was investigated. Eleven complex stimuli were synthesized such that frequency content varied but waveform envelope periodicity was constant. This was accomplished by repeatedly shifting the components of a harmonic complex tone upward in frequency by delta f of 20 Hz, producing a series of six-component inharmonic complex tones with constant intercomponent spacing of 200 Hz. Pitch-shift functions were derived from pitch matches for these stimuli to a comparison pure tone for each of four normal hearing adults with extensive musical training. The FFRs were recorded for the complex stimuli that were judged most divergent in pitch by each subject and for pure-tone signals that were judged equal in pitch to these complex stimuli. Spectral analyses suggested that the spectral content of the FFRs elicited by the complex stimuli did not vary consistently with component frequency or the first effect of pitch shift. Furthermore, complex and pure-tone signals judged equal in pitch did not elicit FFRs of similar spectral content.  相似文献   

11.
Dichotic pitch perception reflects the auditory system's use of binaural cues to perceptually separate different sound sources and to determine the spatial location of sounds. Several studies were conducted to identify factors that influence children's dichotic pitch perception thresholds. An initial study of school children revealed an age-related improvement in thresholds for lateralizing dichotic pitch tones. In subsequent studies potential sensory and nonsensory limitations on young children's performance of dichotic pitch lateralization tasks were examined. A training study showed that with sufficient practice, young children lateralize dichotic pitch stimuli as well as adults, indicating an age difference in perceptual learning of the lateralization task. Changing the task requirements so that young children made a judgment about the pitch of dichotic pitch tones, rather than the spatial location of the tones, also resulted in significantly better thresholds. These findings indicate that nonsensory factors limit young children's performance of dichotic pitch tasks.  相似文献   

12.
Thresholds for discriminating the fundamental frequency (FO) of a complex tone, FODLs, are small when low harmonics are present, but increase when the number of the lowest harmonic, N, is above eight. To assess whether the relatively small FODLs for N in the range 8-10 are based on (partly) resolved harmonics or on temporal fine structure information, FODLs were measured as a function of N for tones with three successive harmonics which were added either in cosine or alternating phase. The center frequency was 2000 Hz, and N was varied by changing the mean FO. A background noise was used to mask combination tones. The value of FO was roved across trials to force subjects to make within-trial comparisons. N was roved by +/- 1 for every stimulus, to prevent subjects from using excitation pattern cues. FODLs were not influenced by component phase for N= 6 or 7, but were smaller for cosine than for alternating phase once N exceeded 7, suggesting that temporal fine structure plays a role in this range. When the center frequency was increased to 5000 Hz, performance was much worse for low N, suggesting that phase locking is important for obtaining low FODLs with resolved harmonics.  相似文献   

13.
Studies of pitch perception often involve measuring difference limens for complex tones (DLCs) that differ in fundamental frequency (F0). These measures are thought to reflect F0 discrimination and to provide an indirect measure of subjective pitch strength. However, in many situations discrimination may be based on cues other than the pitch or the F0, such as differences in the frequencies of individual components or timbre (brightness). Here, DLCs were measured for harmonic and inharmonic tones under various conditions, including a randomized or fixed lowest harmonic number, with and without feedback. The inharmonic tones were produced by shifting the frequencies of all harmonics upwards by 6.25%, 12.5%, or 25% of F0. It was hypothesized that, if DLCs reflect residue-pitch discrimination, these frequency-shifted tones, which produced a weaker and more ambiguous pitch than would yield larger DLCs than the harmonic tones. However, if DLCs reflect comparisons of component pitches, or timbre, they should not be systematically influenced by frequency shifting. The results showed larger DLCs and more scattered pitch matches for inharmonic than for harmonic complexes, confirming that the inharmonic tones produced a less consistent pitch than the harmonic tones, and consistent with the idea that DLCs reflect F0 pitch discrimination.  相似文献   

14.
The role of frequency and informational cues in the detection of tones of uncertain frequency was investigated using the probe-signal method [G.Z. Greenberg and W.D. Larkin, J. Acoust. Soc. Am. 44, 1513-1523 (1968)] with auditory and visual test patterns. Patterns consisted of a sequence of events, either tones or visual stimuli, preceding the tone to be detected. Both frequency and informational cues were available in the auditory patterns, whereas only informational cues were available in the visual patterns. Results indicated that observers in the auditory condition displayed trial-by-trial selective attention to one or another frequency band as a function of the cues provided by earlier pattern components. In contrast, listeners in the visual condition displayed simultaneous attention to the two separate frequency bands that could possibly contain the tone, regardless of the information provided by the cues. Results are discussed in terms of single- and multiple-band models of attention.  相似文献   

15.
This paper extends previous research on listeners' abilities to discriminate the details of brief tonal components occurring within sequential auditory patterns (Watson et al., 1975, 1976). Specifically, the ability to discriminate increments in the duration delta t of tonal components was examined. Stimuli consisted of sequences of ten sinusoidal tones: a 40-ms test tone to which delta t was added, plus nine context tones with individual durations fixed at 40 ms or varying between 20 and 140 ms. The level of stimulus uncertainty was varied from high (any of 20 test tones occurring in any of nine factorial contexts), through medium (any of 20 test tones occurring in ten contexts), to minimal levels (one test tone occurring in a single context). The ability to discriminate delta t depended strongly on the level of stimulus uncertainty, and on the listener's experience with the tonal context. Asymptotic thresholds under minimal uncertainty approached 4-6 ms, or 15% of the duration of the test tones; under high uncertainty, they approached 40 ms, or 10% of the total duration of the tonal sequence. Initial thresholds exhibited by inexperienced listeners are two-to-four times greater than the asymptotic thresholds achieved after considerable training (20,000-30,000 trials). Isochronous sequences, with context tones of uniform, 40-ms duration, yield lower thresholds than those with components of varying duration. The frequency and temporal position of the test tones had only minor effects on temporal discrimination. It is proposed that a major determinant of the ability to discriminate the duration of components of sequential patterns is the listener's knowledge about "what to listen for and where." Reduced stimulus uncertainty and extensive practice increase the precision of this knowledge, and result in high-resolution discrimination performance. Increased uncertainty, limited practice, or both, would allow only discrimination of gross changes in the temporal or spectral structure of the sequential patterns.  相似文献   

16.
Difference limens for complex tones (DLCs) that differ in F0 are widely regarded as a measure of periodicity-pitch discrimination. However, because F0 changes are inevitably accompanied by changes in the frequencies of the harmonics, DLCs may actually reflect the discriminability of individual components. To test this hypothesis, DLCs were measured for complex tones, the component frequencies of which were shifted coherently upward or downward by ΔF = 0%, 25%, 37.5%, or 50% of the F0, yielding fully harmonic (ΔF = 0%), strongly inharmonic (ΔF = 25%, 37.5%), or odd-harmonic (ΔF = 50%) tones. If DLCs truly reflect periodicity-pitch discriminability, they should be larger (worse) for inharmonic tones than for harmonic and odd harmonic tones because inharmonic tones have a weaker pitch. Consistent with this prediction, the results of two experiments showed a non-monotonic dependence of DLCs on ΔF, with larger DLCs for ΔF's of ± 25% or ± 37.5% than for ΔF's of 0 or ± 50% of F0. These findings are consistent with models of pitch perception that involve harmonic templates or with an autocorrelation-based model provided that more than just the highest peak in the summary autocorrelogram is taken into account.  相似文献   

17.
This study examined the perception and cortical representation of harmonic complex tones, from the perspective of the spectral fusion evoked by such sounds. Experiment 1 tested whether ferrets spontaneously distinguish harmonic from inharmonic tones. In baseline sessions, ferrets detected a pure tone terminating a sequence of inharmonic tones. After they reached proficiency, a small fraction of the inharmonic tones were replaced with harmonic tones. Some of the animals confused the harmonic tones with the pure tones at twice the false-alarm rate. Experiment 2 sought correlates of harmonic fusion in single neurons of primary auditory cortex and anterior auditory field, by comparing responses to harmonic tones with those to inharmonic tones in the awake alert ferret. The effects of spectro-temporal filtering were accounted for by using the measured spectrotemporal receptive field to predict responses and by seeking correlates of fusion in the predictability of responses. Only 12% of units sampled distinguished harmonic tones from inharmonic tones, a small percentage that is consistent with the relatively weak ability of the ferrets to spontaneously discriminate harmonic tones from inharmonic tones in Experiment 1.  相似文献   

18.
The tendency to hear a sequence of alternating low (L) and high (H) frequency tones as two streams can be increased by a preceding induction sequence, even one composed only of same-frequency tones. Four experiments used such an induction sequence (10 identical L tones) to promote segregation in a shorter test sequence comprising L and H tones. Previous studies have shown that the build-up of stream segregation is usually reduced greatly when a sudden change in acoustic properties distinguishes all of the induction tones from their test-sequence counterparts. Experiment 1 showed that a single deviant tone, created by altering the final inducer (in frequency, level, duration, or replacement with silence) reduced reported segregation, often substantially. Experiment 2 partially replicated this finding, using changes in temporal discrimination as a measure of streaming. Experiments 3 and 4 varied the size of a frequency change applied to the deviant tone; the extent of resetting varied with size only gradually. The results suggest that resetting begins to occur once the change is large enough to be noticeable. Since the prior inducers always remained unaltered in the deviant-tone conditions, it is proposed that a single change actively resets the build-up evoked by the induction sequence.  相似文献   

19.
This article describes further study of the finding reported by Green et al. [J. Acoust. Soc. Am. 73, 639-643 (1983)] and others that, in certain conditions, the threshold of detectability for an intensity increment to the center tone of a multitone reference spectrum decreased as the number of nonsignal tones increased. That result was considered remarkable since critical-band theory would predict that these nonsignal tones, spaced outside the "critical band" containing the signal, would have no effect on or, at most, slightly decrease within-band detectability--and certainly could not account for the result of improved detectability found in the study cited above. Recently, Henn and Turner [J. Acoust. Soc. Am. 88, 126-131 (1990)] were unable to replicate the result described above, concluding that the phenomenon exists only in "limited conditions" and that is "highly individual" in nature. Further, they speculated that the most likely reason for the discrepancy between their study and previous studies was the selection and/or training of the observers. The present study addressed the effects of the amount of subject training on the finding of Green et al. while controlling the potential effects of stimulus order. Specifically, for a group of three "naive" listeners, thresholds were measured for 3-, 7-, and 21-tone inharmonic complexes as a function of the amount of practice in a mixed-block design. In all cases the group mean thresholds decreased as the number of nonsignal tones increased both initially and after extensive practice for both fixed- and roving-level conditions. Thus the effect does not appear to be an artifact of the amount or order of training subjects receive. The possible role of subject sample size and the magnitude of individual differences in obtaining the effect remains an open question. Two hypotheses suggested to account for the improvement in threshold with increasing number of nonsignal tones were evaluated. The hypotheses were represented by simple mathematical models, referred to as the "multiple-comparison" and "pitch-cue" models. The predictions of both models were compared with the results of a series of detection experiments in which the independent variables were the number of nonsignal tones and amount of random, within-trial "amplitude perturbation" [cf. Kidd et al., J. Acoust. Soc. Am. 79, 1045-1053 (1986)] of the nonsignal tones. Neither model, as applied, provided a satisfactory account of the effects of the main variables of number of tones and amount of perturbation.(ABSTRACT TRUNCATED AT 400 WORDS)  相似文献   

20.
Mandarin perceivers were tested in visual lexical-tone identification before and after learning. Baseline performance was only slightly above chance, although there appeared to be some visual information in the speakers' neck and head movements. When participants were taught to use this visible information in two experiments, visual tone identification improved significantly. There appears to be a relationship between the production of lexical tones and the visible movements of the neck, head, and mouth, and this information can be effectively used after a short training session.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号