首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 234 毫秒
1.
The purpose of this cross-language study was to examine whether the online control of voice fundamental frequency (F(0)) during vowel phonation is influenced by language experience. Native speakers of Cantonese and Mandarin, both tonal languages spoken in China, participated in the experiments. Subjects were asked to vocalize a vowel sound /u/at their comfortable habitual F(0), during which their voice pitch was unexpectedly shifted (± 50, ± 100, ± 200, or ± 500 cents, 200 ms duration) and fed back instantaneously to them over headphones. The results showed that Cantonese speakers produced significantly smaller responses than Mandarin speakers when the stimulus magnitude varied from 200 to 500 cents. Further, response magnitudes decreased along with the increase in stimulus magnitude in Cantonese speakers, which was not observed in Mandarin speakers. These findings suggest that online control of voice F(0) during vocalization is sensitive to language experience. Further, systematic modulations of vocal responses across stimulus magnitude were observed in Cantonese speakers but not in Mandarin speakers, which indicates that this highly automatic feedback mechanism is sensitive to the specific tonal system of each language.  相似文献   

2.
Previous studies have demonstrated that motor control of segmental features of speech rely to some extent on sensory feedback. Control of voice fundamental frequency (F0) has been shown to be modulated by perturbations in voice pitch feedback during various phonatory tasks and in Mandarin speech. The present study was designed to determine if voice Fo is modulated in a task-dependent manner during production of suprasegmental features of English speech. English speakers received pitch-modulated voice feedback (+/-50, 100, and 200 cents, 200 ms duration) during a sustained vowel task and a speech task. Response magnitudes during speech (mean 31.5 cents) were larger than during the vowels (mean 21.6 cents), response magnitudes increased as a function of stimulus magnitude during speech but not vowels, and responses to downward pitch-shift stimuli were larger than those to upward stimuli. Response latencies were shorter in speech (mean 122 ms) compared to vowels (mean 154 ms). These findings support previous research suggesting the audio vocal system is involved in the control of suprasegmental features of English speech by correcting for errors between voice pitch feedback and the desired F0.  相似文献   

3.
The present study was conducted to test the hypothesis that intrinsic laryngeal muscles are involved in producing voice fundamental frequency (F(0)) responses to perturbations in voice pitch auditory feedback. Electromyography (EMG) recordings of the cricothyroid and thyroarytenoid muscles were made with hooked-wire electrodes, while subjects sustained vowel phonations at three different voice F(0) levels (conversational, high pitch in head register, and falsetto register) and received randomized pitch shifts (±100 or ±300 cents) in their voice auditory feedback. The median latencies from stimulus onset to the peak in the EMG and voice F(0) responses were 167 and 224 ms, respectively. Among the three different F(0) levels, the falsetto register produced compensatory EMG responses that occurred prior to vocal responses and increased along with rising voice F(0) responses and decreased for falling F(0) responses. For the conversational and high voice levels, the EMG response timing was more variable than in the falsetto voice, and changes in EMG activity with relevance to the vocal responses did not follow the consistent trend observed in the falsetto condition. The data from the falsetto condition suggest that both the cricothyroid and thyroarytenoid muscles are involved in generating the compensatory vocal responses to pitch-shifted voice feedback.  相似文献   

4.
Previous studies have demonstrated that perturbations in voice pitch or loudness feedback lead to compensatory changes in voice F(0) or amplitude during production of sustained vowels. Responses to pitch-shifted auditory feedback have also been observed during English and Mandarin speech. The present study investigated whether Mandarin speakers would respond to amplitude-shifted feedback during meaningful speech production. Native speakers of Mandarin produced two-syllable utterances with focus on the first syllable, the second syllable, or none of the syllables, as prompted by corresponding questions. Their acoustic speech signal was fed back to them with loudness shifted by +/-3 dB for 200 ms durations. The responses to the feedback perturbations had mean latencies of approximately 142 ms and magnitudes of approximately 0.86 dB. Response magnitudes were greater and latencies were longer when emphasis was placed on the first syllable than when there was no emphasis. Since amplitude is not known for being highly effective in encoding linguistic contrasts, the fact that subjects reacted to amplitude perturbation just as fast as they reacted to F(0) perturbations in previous studies provides clear evidence that a highly automatic feedback mechanism is active in controlling both F(0) and amplitude of speech production.  相似文献   

5.
Recent research has found that while speaking, subjects react to perturbations in pitch of voice auditory feedback by changing their voice fundamental frequency (F0) to compensate for the perceived pitch-shift. The long response latencies (150-200 ms) suggest they may be too slow to assist in on-line control of the local pitch contour patterns associated with lexical tones on a syllable-to-syllable basis. In the present study, we introduced pitch-shifted auditory feedback to native speakers of Mandarin Chinese while they produced disyllabic sequences /ma ma/ with different tonal combinations at a natural speaking rate. Voice F0 response latencies (100-150 ms) to the pitch perturbations were shorter than syllable durations reported elsewhere. Response magnitudes increased from 50 cents during static tone to 85 cents during dynamic tone productions. Response latencies and peak times decreased in phrases involving a dynamic change in F0. The larger response magnitudes and shorter latency and peak times in tasks requiring accurate, dynamic control of F0, indicate this automatic system for regulation of voice F0 may be task-dependent. These findings suggest that auditory feedback may be used to help regulate voice F0 during production of bi-tonal Mandarin phrases.  相似文献   

6.
The purpose of the present study was to investigate the responsiveness of the pitch-shift reflex to small magnitude stimuli and voice fundamental frequency (F(0)) level. English speakers received pitch-shifted voice feedback (+/-10, 20, 30, 40, and 50 cents, 200 ms duration) during vowel phonations at a high and a low F(0) level. Mean pitch-shift response magnitude increased as a function of pitch-shift stimulus magnitude, but when expressed as a percent of stimulus magnitude, declined from 100% with +/-10 cents to 37% with +/-50 cents stimuli. Response magnitudes were larger and latencies were shorter with a high F(0) level (16 cents;130 ms) compared to a low F(0) level (13 cents;152 ms). Data from the present study demonstrate that vocal response magnitudes are equal to small perturbation magnitudes, and they are larger and faster with a high F(0) voice. These results suggest that the audio-vocal system is optimally suited for compensating for small pitch rather than larger perturbations. Data also suggest the sensitivity of the audio-vocal system to voice perturbation may vary with F(0) level.  相似文献   

7.
The present study was undertaken to examine if a subject's voice F0 responded not only to perturbations in pitch of voice feedback but also to changes in pitch of a side tone presented congruent with voice feedback. Small magnitude brief duration perturbations in pitch of voice or tone auditory feedback were randomly introduced during sustained vowel phonations. Results demonstrated a higher rate and larger magnitude of voice F0 responses to changes in pitch of the voice compared with a triangular-shaped tone (experiment 1) or a pure tone (experiment 2). However, response latencies did not differ across voice or tone conditions. Data suggest that subjects responded to the change in F0 rather than harmonic frequencies of auditory feedback because voice F0 response prevalence, magnitude, or latency did not statistically differ across triangular-shaped tone or pure-tone feedback. Results indicate the audio-vocal system is sensitive to the change in pitch of a variety of sounds, which may represent a flexible system capable of adapting to changes in the subject's voice. However, lower prevalence and smaller responses to tone pitch-shifted signals suggest that the audio-vocal system may resist changes to the pitch of other environmental sounds when voice feedback is present.  相似文献   

8.
Vocal vibrato and tremor are characterized by oscillations in voice fundamental frequency (F0). These oscillations may be sustained by a control loop within the auditory system. One component of the control loop is the pitch-shift reflex (PSR). The PSR is a closed loop negative feedback reflex that is triggered in response to discrepancies between intended and perceived pitch with a latency of approximately 100 ms. Consecutive compensatory reflexive responses lead to oscillations in pitch every approximately 200 ms, resulting in approximately 5-Hz modulation of F0. Pitch-shift reflexes were elicited experimentally in six subjects while they sustained /u/ vowels at a comfortable pitch and loudness. Auditory feedback was sinusoidally modulated at discrete integer frequencies (1 to 10 Hz) with +/- 25 cents amplitude. Modulated auditory feedback induced oscillations in voice F0 output of all subjects at rates consistent with vocal vibrato and tremor. Transfer functions revealed peak gains at 4 to 7 Hz in all subjects, with an average peak gain at 5 Hz. These gains occurred in the modulation frequency region where the voice output and auditory feedback signals were in phase. A control loop in the auditory system may sustain vocal vibrato and tremorlike oscillations in voice F0.  相似文献   

9.
《Journal of voice》2019,33(6):851-859
PurposeThe pitch-shift reflex (PSR) is the adaptation of the fundamental frequency during phonation and speech and describes the auditory feedback control. Speakers without voice and speech disorders mostly show a compensation of the pitch change in the auditory feedback and adapt their fundamental frequency to the opposite direction. Dysphonic patients often display problems with the auditory perception and control of their voice during therapy. Our study focuses on the auditory and kinesthetic control mechanisms of patients with muscle tension dysphonia (MTD) and speakers without voice and speech problems. Main purpose of the study is the analysis of the functionality of the control mechanisms within phonation and speech between patients with MTD and normal speakers.MethodSixty-one healthy subjects (17 male, 44 female) and 22 patients with MTD (7 male, 15 female) participated following two paradigms including a sustained phonation (vowel /a/) and speech ([‘mama]). Within both paradigms the fundamental frequency of the auditory feedback was increased synthetically. For the analysis of the PSR the electroencephalogram, electroglottography, the voice signal, and the high-speed endoscopy data were recorded simultaneously. The PSR in the electroencephalogram was detected via the N100 and the mismatch negativity. Statistical tests were applied for the detection of the PSR in the physiological response within the electroglottography, voice, and high-speed endoscopy signals. The results were compared between both groups.ResultsNo differences were found between the controls and patients with MTD regarding latency and magnitude of the perception of the pitch shift in both paradigms, but for the magnitude of the behavioral response. Differences also could be found for both groups between the “no pitch” and “pitch” condition of the two paradigms regarding vocal fold dynamics and voice quality. Patients with MTD showed more vibrational irregularities during the PSR than the controls, especially regarding the symmetry of vocal fold dynamics.ConclusionPatients with MTD seem to have a disturbed interaction between the auditory and kinesthetic feedback inducing the execution of an overriding behavioral response.  相似文献   

10.
Values for acoustic voice measurements were obtained from 88 normal individuals and 98 pathological cases of mass lesions of vocal fold and 50 cases of unilateral vocal fold paralysis. Overall, all items reflecting perturbations of pitch and amplitude as well as glottal noise were significantly higher in the groups of patients compared with the normal group. The measurement of normalized noise energy (NNE) was found to be an optimum parameter for discrimination of normal/abnormal voices. The voices of patients with vocal fold nodules and vocal fold polyps were analyzed before endolaryngeal phonomicrosurgery (EPM) and 2 weeks after. Statistically significant (p < 0.01) improvement was achieved both in perceptual and acoustic analysis. EPM resulted in a significant decrease of mean jitter, shimmer, and NNE. Clinically, these measures provided documentable and measurable evidence of vocal function and were helpful for comparing patients with normal speakers. They also were useful for a thorough documentation of patient's voice pathology and for evaluation of the presurgical and postsurgical voice status.  相似文献   

11.
The present study tested whether subjects respond to unanticipated short perturbations in voice loudness feedback with compensatory responses in voice amplitude. The role of stimulus magnitude (+/- 1,3 vs 6 dB SPL), stimulus direction (up vs down), and the ongoing voice amplitude level (normal vs soft) were compared across compensations. Subjects responded to perturbations in voice loudness feedback with a compensatory change in voice amplitude 76% of the time. Mean latency of amplitude compensation was 157 ms. Mean response magnitudes were smallest for 1-dB stimulus perturbations (0.75 dB) and greatest for 6-dB conditions (0.98 dB). However, expressed as gain, responses for 1-dB perturbations were largest and almost approached 1.0. Response magnitudes were larger for the soft voice amplitude condition compared to the normal voice amplitude condition. A mathematical model of the audio-vocal system captured the main features of the compensations. Previous research has demonstrated that subjects can respond to an unanticipated perturbation in voice pitch feedback with an automatic compensatory response in voice fundamental frequency. Data from the present study suggest that voice loudness feedback can be used in a similar manner to monitor and stabilize voice amplitude around a desired loudness level.  相似文献   

12.
Previous studies have shown that voice fundamental frequency (F0) is modified by changes in the pitch of vocal feedback and have demonstrated that the audio-vocal control system has both open- and closed-loop control properties. However, the extent to which this system operates in closed-loop fashion may have been underestimated in previous work. Because the step-type stimuli used were very rapid, and people are physically unable to change their voice F0 as rapidly as the stimuli, feedback responses might have been reduced or suppressed. In the present study, pitch-shift stimuli, consisting of a disparity between voice F0 and feedback pitch of varying ramp onset velocities, were presented to subjects vocalizing a steady /ah/ sound to examine the effect of stimulus onset on voice F0 responses. Results showed that response velocity covaried with stimulus velocity. Response latency and time of the peak response decreased with increases in stimulus velocity, while response magnitude decreased. A simple feedback model reproduced most features of these responses. These results strongly support previous suggestions that the audio-vocal system monitors auditory feedback and, through closed-loop negative feedback, adjusts voice F0 so as to cancel low-level fluctuations in F0.  相似文献   

13.
Measurements on the inverse filtered airflow waveform and of estimated average transglottal pressure and glottal airflow were made from syllable sequences in low, normal, and high pitch for 25 male and 20 female speakers. Correlation analyses indicated that several of the airflow measurements were more directly related to voice intensity than to fundamental frequency (F0). Results suggested that pressure may have different influences in low and high pitch in this speech task. It is suggested that unexpected results of increased pressure in low pitch were related to maintaining voice quality, that is, avoiding vocal fry. In high pitch, the increased pressure may serve to maintain vocal fold vibration. The findings suggested different underlying laryngeal mechanisms and vocal adjustments for increasing and decreasing F0 from normal pitch.  相似文献   

14.
How are listeners able to identify whether the pitch of a brief isolated sample of an unknown voice is high or low in the overall pitch range of that speaker? Does the speaker's voice quality convey crucial information about pitch level? Results and statistical models of two experiments that provide answers to these questions are presented. First, listeners rated the pitch levels of vowels taken over the full pitch ranges of male and female speakers. The absolute f0 of the samples was by far the most important determinant of listeners' ratings, but with some effect of the sex of the speaker. Acoustic measures of voice quality had only a very small effect on these ratings. This result suggests that listeners have expectations about f0s for average speakers of each sex, and judge voice samples against such expectations. Second, listeners judged speaker sex for the same speech samples. Again, absolute f0 was the most important determinant of listeners' judgments, but now voice quality measures also played a role. Thus it seems that pitch level judgments depend on voice quality mostly indirectly, through its information about sex. Absolute f0 is the most important information for deciding both pitch level and speaker sex.  相似文献   

15.
Although the problem of vocal fatigue is not uncommon in people with voice disorders, research on objective quantifiable indicators of vocal fatigue is limited. It has been suggested that a speaker's perception of increased phonatory effort associated with periods of prolonged voice use is related to increased lung pressure required to initiate and sustain phonation. The purpose of this study was to examine the relationship among perceived phonatory effort (PPE), which was used as a subjective index of vocal fatigue, and phonation threshold pressure (PTP), a quantifiable measure defined as the minimal lung pressure required to initiate and sustain vocal fold oscillation. PTP and PPE were recorded before, during, and after five adult male and five adult female speakers engaged in a prolonged oral reading task designed to induce vocal fatigue. The results supported a direct, moderately strong relationship between PTP and PPE, particularly when PTP was measured during speech produced at comfortable and low-speaking pitch levels. No gender effects were found. PTP returned to baseline levels within 1 hour after the fatiguing task. PPE returned to baseline within 1 day. The data support the use of PTP as an objective index of vocal fatigue.  相似文献   

16.
This paper examines an updated version of a lumped mucosal wave model of the vocal fold oscillation during phonation. Threshold values of the subglottal pressure and the mean (DC) glottal airflow for the oscillation onset are determined. Depending on the nonlinear characteristics of the model, an oscillation hysteresis phenomenon may occur, with different values for the oscillation onset and offset threshold. The threshold values depend on the oscillation frequency, but the occurrence of the hysteresis is independent of it. The results are tested against pressure data collected from a mechanical replica of the vocal folds, and oral airflow data collected from speakers producing intervocalic /h/. In the human speech data, observed differences between voice onset and offset may be attributed to variations in voice pitch, with a very small or inexistent hysteresis phenomenon.  相似文献   

17.
This study investigated the perceptual and acoustical characteristicsof vocal presentation in both the masculine and the feminine modes by the same group of male subjects. Listeners (N = 88) evaluated 22 voice samples by using 18 semantic differential scales and 57 adjectives. The 22 voice samples were provided by I I biologically male speakers, who described themselves as heterosexual crossdressers. Each speaker read a standard passage under controlled conditions. In one reading, they demonstrated their typical masculine voice and in the other they spoke in their feminine voice. Acoustical analyses included mean fundamental frequency, frequency range, overall passage duration, and duration of a sample of stressed vowels. Results indicated that listeners heard significant differences between masculine and feminine presentations across the I I speakers and the 18 semantic differential scales. Masculine-feminine and high-low pitch were the most salient scales in the perceptual judgments. Acoustical analyses indicated wide variation according to speaker and condition. Clinical applications are provided.  相似文献   

18.
To clarify the role of formant frequency in the perception of pitch in whispering, we conducted a preliminary experiment to determine (1.) whether speakers change their pitch during whispering; (2.) whether listeners can perceive differences in pitch; and (3.) what the acoustical features are when speakers change their pitch. The listening test of whispered Japanese speech demonstrates that one can determine the perceived pitch of vowel /a/ as ordinary, high, or low. Acoustical analysis revealed that the perception of pitch corresponds to some formant frequencies. Further data with synthesized whispered voice are necessary to confirm the importance of the formant frequencies in detail for perceived pitch of whispered vowels.  相似文献   

19.
Auditory feedback has been suggested to be important for voice fundamental frequency (F0) control. The present study featured a new technique for testing this hypothesis by which the pitch of a subject's voice was modulated, fed back over earphones, and the resultant change in the emitted voice F0 was measured. The responses of 67 normal, healthy young adults were recorded as they attempted to ignore intermittent upward or downward shifts in pitch feedback while they sustained steady vowel sounds (/a/) or sang musical scales. Ninety-six percent of subjects increased their F0 when the feedback pitch was decreased, and 78% of subjects decreased their F0 when the pitch feedback was increased. Latencies of responses ranged from 104 to 223 ms. Results indicate people normally rely on pitch feedback to control voice F0.  相似文献   

20.
Hard or abrupt glottal attack (HGA) is one of the vocal behaviors often associated with benign lesion of the vocal folds. This study was designed to determine whether the frequency of HGA was different in hyperfunctional voice patients with and without vocal fold masses. One hundred and forty-seven subjects were studied. All subjects received a complete otolaryngological evaluation including strobovideolaryngoscopy, objective voice measures, and evaluation by a speech-language pathologist. Thirty-two patients were diagnosed with muscle tension dysphonia (19 male, 13 female) without vocal fold masses. Fifty-seven patients were diagnosed with unilateral vocal fold masses (29 male, 28 female), most of which were cysts. Fifty-eight patients were diagnosed with bilateral vocal fold masses (13 male, 45 female). Of the 45 females with bilateral vocal fold masses. 26 had a vocal cyst and reactive nodule and 19 had bilateral vocal fold nodules. The control group was balanced and matched based on sex and on percentage of singers and nonsingers. It consisted of 49 subjects with no vocal fold pathology (20 male, 29 female). The group was composed of professional speakers, singers, and nonprofessional speakers. All voice disordered groups demonstrated higher frequencies of HGA than the control group. Differences were found between the male and female subjects in this study. No differences were found between the various disorders. Differences were also found between the subgroups of bilateral masses, where the bilateral nodules group presented a higher frequency of HGA than the cyst and contralateral reactive nodule.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号