首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 46 毫秒
1.
Previous studies have demonstrated that motor control of segmental features of speech rely to some extent on sensory feedback. Control of voice fundamental frequency (F0) has been shown to be modulated by perturbations in voice pitch feedback during various phonatory tasks and in Mandarin speech. The present study was designed to determine if voice Fo is modulated in a task-dependent manner during production of suprasegmental features of English speech. English speakers received pitch-modulated voice feedback (+/-50, 100, and 200 cents, 200 ms duration) during a sustained vowel task and a speech task. Response magnitudes during speech (mean 31.5 cents) were larger than during the vowels (mean 21.6 cents), response magnitudes increased as a function of stimulus magnitude during speech but not vowels, and responses to downward pitch-shift stimuli were larger than those to upward stimuli. Response latencies were shorter in speech (mean 122 ms) compared to vowels (mean 154 ms). These findings support previous research suggesting the audio vocal system is involved in the control of suprasegmental features of English speech by correcting for errors between voice pitch feedback and the desired F0.  相似文献   

2.
In this study we have simultaneously measured subglottic air pressure, airflow, and vocal intensity during speech in nine healthy subjects. Subglottic air pressure was measured directly by puncture of the cricothyroid membrane. The results show that the interaction between these aerodynamic properties is much more complex that previously believed. Certain trends were seen in most individuals, such as an increase in vocal intensity with increased subglottic air pressure. However, there was considerable variability in the overall aerodynamic properties between subjects and at different frequency and intensity ranges. At certain frequencies several subjects were able to generate significantly louder voices without a comparable increase in subglottic air pressure. We hypothesize that these increases in vocal efficiency are due to changes in vocal fold vibration properties. The relationship between fundamental frequency and subglottic pressure was also noted to vary depending on vocal intensity. Possible mechanisms for these behaviors are discussed.  相似文献   

3.
Strained, strangled, and tremulous vocal qualities that are typically seen in adductor spasmodic dysphonia (ADSD), voice tremor (Tremor), and the spastic dysarthria of amyotrophic lateral sclerosis (ALS) may sound similar and be difficult to differentiate. The purpose of this study was to determine if these vocal qualities of neurologic origin could be differentiated on the basis of acoustic and motor speech parameters. Three groups of subjects (ADSD, ALS, and Tremor) were analyzed by the Motor Speech Profile System (Kay Elemetrics, Lincoln Park, NJ) for fundamental frequency (Fo), standard deviation of Fo, diadochokinetic rate (ddk), standard deviation of ddk, mean intensity and standard deviation of ddk, frequency and amplitude variability in connected speech, and speaking rate in connected speech. Profiles of the three groups are presented with the significant features that differentiated one from the other.  相似文献   

4.
An assessment of vocal impairment is presented for separating healthy people from persons with early untreated Parkinson's disease (PD). This study's main purpose was to (a) determine whether voice and speech disorder are present from early stages of PD before starting dopaminergic pharmacotherapy, (b) ascertain the specific characteristics of the PD-related vocal impairment, (c) identify PD-related acoustic signatures for the major part of traditional clinically used measurement methods with respect to their automatic assessment, and (d) design new automatic measurement methods of articulation. The varied speech data were collected from 46 Czech native speakers, 23 with PD. Subsequently, 19 representative measurements were pre-selected, and Wald sequential analysis was then applied to assess the efficiency of each measure and the extent of vocal impairment of each subject. It was found that measurement of the fundamental frequency variations applied to two selected tasks was the best method for separating healthy from PD subjects. On the basis of objective acoustic measures, statistical decision-making theory, and validation from practicing speech therapists, it has been demonstrated that 78% of early untreated PD subjects indicate some form of vocal impairment. The speech defects thus uncovered differ individually in various characteristics including phonation, articulation, and prosody.  相似文献   

5.
Supraglottic activity was rated from flexible endoscopic video recordings of subjects with normal laryngeal structure and function as they sustained vowels and repeated syllables and sentences. Judges rated these recordings for false vocal fold (FVF) adduction and anterior-to-posterior (A-P) compression at the initiation of the speech task, throughout the whole speech task (static supraglottic activity), and as brief individual adductions within a speech task (dynamic supraglottic activity). Significant differences in A-P (p < 0.0003) and FVF (p < 0.0000001) compression were found between tasks. Dynamic FVF activity was associated with glottal stops. Static A-P and FVF activities were present in males significantly more (p < 0.0001) than females. FVF activity associated with speech initiation was found in females significantly more (p = 0.0256) than males. Supraglottic activity plays a role in normal speech production, and should not necessarily be considered suggestive of a voice use pattern with excessive muscle tension.  相似文献   

6.
The relationship between auditory perception and vocal production has been typically investigated by evaluating the effect of either altered or degraded auditory feedback on speech production in either normal hearing or hearing-impaired individuals. Our goal in the present study was to examine this relationship in individuals with superior auditory abilities. Thirteen professional musicians and thirteen nonmusicians, with no vocal or singing training, participated in this study. For vocal production accuracy, subjects were presented with three tones. They were asked to reproduce the pitch using the vowel /a/. This procedure was repeated three times. The fundamental frequency of each production was measured using an autocorrelation pitch detection algorithm designed for this study. The musicians' superior auditory abilities (compared to the nonmusicians) were established in a frequency discrimination task reported elsewhere. Results indicate that (a) musicians had better vocal production accuracy than nonmusicians (production errors of 1/2 a semitone compared to 1.3 semitones, respectively); (b) frequency discrimination thresholds explain 43% of the variance of the production data, and (c) all subjects with superior frequency discrimination thresholds showed accurate vocal production; the reverse relationship, however, does not hold true. In this study we provide empirical evidence to the importance of auditory feedback on vocal production in listeners with superior auditory skills.  相似文献   

7.
Myotonic dystrophy type 1 (DM1) is a multisystemic disease involving multiple organ systems including central nervous system (CNS) and muscles. Few studies have focused on the central motor system in DM1, pointing to a subclinical abnormality in the CNS. The aim of our study was to investigate patterns of cerebral activation in DM1 during a motor task using functional MRI (fMRI). Fifteen DM1 patients, aged 20 to 59 years, and 15 controls of comparable age were scanned during a self-paced sequential finger-to-thumb opposition task of their dominant right hand. Functional MRI images were analyzed using SPM99. Patients underwent clinical and genetic assessment; all subjects underwent a conventional MR study. Myotonic dystrophy type 1 patients showed greater activation than controls in bilateral sensorimotor areas and inferior parietal lobules, basal ganglia and thalami, in the ipsilateral premotor area, insula and supplementary motor area (corrected P<.05). Analysis of the interaction between disease and age showed that correlation with age was significantly greater in patients than in controls in bilateral sensorimotor areas and in contralateral parietal areas. Other clinical and MR characteristics did not correlate with fMRI. Functional changes in DM1 may represent compensatory mechanisms such as reorganization and redistribution of functional networks to compensate for ultrastructural and neurochemical changes occurring as part of the accelerated aging process.  相似文献   

8.
The purpose of the present study was to examine the effect of prolonged loud reading, intended to induce fatigue, on vocal function in adults with unilateral vocal fold paralysis (UVFP). Subjects were 20 adults, 37–60 years old, with UVFP secondary to recurrent laryngeal nerve paralysis. Subjective ratings and instrumental measures of vocal function were obtained before and after reading. Statistical analysis revealed subjects rated their vocal quality and physical effort for voicing more severely following prolonged loud reading, whereas expert raters did not detect a significant perceptual difference in vocal quality. Reading fundamental frequency (Fo) was significantly increased following prolonged loud reading, as were mean airflow rates at all pitch conditions. Maximum phonation times for comfort and low pitches significantly decreased during posttests. Multiple regression analyses revealed significant associations between ratings of posttest physical effort and select posttest measures. Interpretation of results indicates the prolonged loud reading task was successful in vocally fatiguing most of the UVFP subjects. Key physiologic correlates of vocal fatigue, in individuals with UVFP, include further reduction of glottic efficiency, resulting in decreased regulation of glottic airflow and a temporary destabilization of speaking fundamental frequency.  相似文献   

9.
The purpose of this study was to investigate if there is an effect of task on determination of habitual loudness. Four tasks commonly used to elicit habitual loudness were compared (automatic speech, elicited speech, spontaneous speech, and reading aloud). Participants were adult female speakers (N=30) with normal voice. A one-way analysis of variance (ANOVA) revealed a statistically significant (p < 0.05) effect of task, with post-hoc analyses indicating that there was a statistically significant difference in habitual loudness elicited via automatic versus spontaneous speech (p < 0.05), and automatic speech versus reading aloud (p < 0.001). The issue of how habitual loudness is defined is considered. Implications of the use of one task for determination of habitual loudness are discussed, as is the possibility of a task effect on determination of other clinically useful vocal parameters.  相似文献   

10.
OBJECTIVES/HYPOTHESIS: The purpose of this study was to examine the temporal-acoustic differences between trained singers and nonsingers during speech and singing tasks. METHODS: Thirty male participants were separated into two groups of 15 according to level of vocal training (ie, trained or untrained). The participants spoke and sang carrier phrases containing English voiced and voiceless bilabial stops, and voice onset time (VOT) was measured for the stop consonant productions. RESULTS: Mixed analyses of variance revealed a significant main effect between speech and singing for /p/ and /b/, with VOT durations longer during speech than singing for /p/, and the opposite true for /b/. Furthermore, a significant phonatory task by vocal training interaction was observed for /p/ productions. CONCLUSIONS: The results indicated that the type of phonatory task influences VOT and that these influences are most obvious in trained singers secondary to the articulatory and phonatory adjustments learned during vocal training.  相似文献   

11.
Intonation stylization is studied using "chironomy," i.e., the analogy between hand gestures and prosodic movements. An intonation mimicking paradigm is used. The task of the ten subjects is to copy the intonation pattern of sentences with the help of a stylus on a graphic tablet, using a system for real-time manual intonation modification. Gestural imitation is compared to vocal imitation of the same sentences (seven for a male speaker, seven for a female speaker). Distance measures between gestural copies, vocal imitations, and original sentences are computed for performance assessment. Perceptual testing is also used for assessing the quality of gestural copies. The perceptual difference between natural and stylized contours is measured using a mean opinion score paradigm for 15 subjects. The results indicate that intonation contours can be stylized with accuracy by chironomic imitation. The results of vocal imitation and chironomic imitation are comparable, but subjects show better imitation results in vocal imitation. The best stylized contours using chironomy seems perceptually indistinguishable or almost indistinguishable from natural contours, particularly for female speech. This indicates that chironomic stylization is effective, and that hand movements can be analogous to intonation movements.  相似文献   

12.
Voice disorders, specifically vocal fatigue, are more commonly reported by women than by men. Previously, 4 women with normal untrained voices read loudly for 2 hours in an attempt to fatigue the voice. Vocal function deteriorated, as indicated by increases in phonation threshold pressure (PTP) and self-perceived phonatory effort. The increase in PTP was delayed or attenuated to some degree in 3 of the women when they drank ample amounts of water before the experiment. The current study examined the same vocal-loading task and water-drinking condition in 4 vocally normal men. PTP increased after the loud-reading task. Although 2 of the men appeared to benefit from increased systemic hydration (PTP increased more when they were underhydrated than well-hydrated), the other 2 men's data changed in the opposite direction. Phonatory effort correlated well with PTP; this varied across subject and pitch. Laryngeal endoscopy revealed an anterior glottal gap in two men after the loud-reading task. Amplitude of vocal fold vibration was judged to be reduced after the loud-reading task in three subjects when underhydrated and one subject when well hydrated. The high between-subject variability prohibits a conclusion that drinking water is beneficial to vocal function in men, but all subjects studied to date demonstrated detrimental vocal effects of prolonged loud talking.  相似文献   

13.
Measurements on the inverse filtered airflow waveform and of estimated average transglottal pressure and glottal airflow were made from syllable sequences in low, normal, and high pitch for 25 male and 20 female speakers. Correlation analyses indicated that several of the airflow measurements were more directly related to voice intensity than to fundamental frequency (F0). Results suggested that pressure may have different influences in low and high pitch in this speech task. It is suggested that unexpected results of increased pressure in low pitch were related to maintaining voice quality, that is, avoiding vocal fry. In high pitch, the increased pressure may serve to maintain vocal fold vibration. The findings suggested different underlying laryngeal mechanisms and vocal adjustments for increasing and decreasing F0 from normal pitch.  相似文献   

14.
There has been substantial progress over the last several years in understanding aspects of the functional neuroanatomy of language. Some of these advances are summarized in this review. It will be argued that recognizing speech sounds is carried out in the superior temporal lobe bilaterally, that the superior temporal sulcus bilaterally is involved in phonological-level aspects of this process, that the frontal/motor system is not central to speech recognition although it may modulate auditory perception of speech, that conceptual access mechanisms are likely located in the lateral posterior temporal lobe (middle and inferior temporal gyri), that speech production involves sensory-related systems in the posterior superior temporal lobe in the left hemisphere, that the interface between perceptual and motor systems is supported by a sensory-motor circuit for vocal tract actions (not dedicated to speech) that is very similar to sensory-motor circuits found in primate parietal lobe, and that verbal short-term memory can be understood as an emergent property of this sensory-motor circuit. These observations are considered within the context of a dual stream model of speech processing in which one pathway supports speech comprehension and the other supports sensory-motor integration. Additional topics of discussion include the functional organization of the planum temporale for spatial hearing and speech-related sensory-motor processes, the anatomical and functional basis of a form of acquired language disorder, conduction aphasia, the neural basis of vocabulary development, and sentence-level/grammatical processing.  相似文献   

15.
Classification of vocal fold vibrations is an essential task of the objective assessment of voice disorders. For historical reasons, the conventional clinical examination of vocal fold vibrations is done during stationary, sustained phonation. However, the conclusions drawn from a stationary phonation are restricted to the observed steady-state vocal fold vibrations and cannot be generalized to voice mechanisms during running speech. This study addresses the approach of classifying real-time recordings of vocal fold oscillations during a nonstationary phonation paradigm in the form of a pitch raise. The classification is based on asymmetry measures derived from a time-dependent biomechanical two-mass model of the vocal folds which is adapted to observed vocal fold motion curves with an optimization procedure. After verification of the algorithm performance the method was applied to clinical problems. Recordings of ten subjects with normal voice and ten dysphonic subjects have been evaluated during stationary as well as nonstationary phonation. In the case of nonstationary phonation the model-based classification into "normal" and "dysphonic" succeeds in all cases, while it fails in the case of sustained phonation. The nonstationary vocal fold vibrations contain additional information about vocal fold irregularities, which are needed for an objective interpretation and classification of voice disorders.  相似文献   

16.
Although the problem of vocal fatigue is not uncommon in people with voice disorders, research on objective quantifiable indicators of vocal fatigue is limited. It has been suggested that a speaker's perception of increased phonatory effort associated with periods of prolonged voice use is related to increased lung pressure required to initiate and sustain phonation. The purpose of this study was to examine the relationship among perceived phonatory effort (PPE), which was used as a subjective index of vocal fatigue, and phonation threshold pressure (PTP), a quantifiable measure defined as the minimal lung pressure required to initiate and sustain vocal fold oscillation. PTP and PPE were recorded before, during, and after five adult male and five adult female speakers engaged in a prolonged oral reading task designed to induce vocal fatigue. The results supported a direct, moderately strong relationship between PTP and PPE, particularly when PTP was measured during speech produced at comfortable and low-speaking pitch levels. No gender effects were found. PTP returned to baseline levels within 1 hour after the fatiguing task. PPE returned to baseline within 1 day. The data support the use of PTP as an objective index of vocal fatigue.  相似文献   

17.
A theory is outlined that explains the disruption that occurs when auditory feedback is altered. The key part of the theory is that the number of, and relationship between, inputs to a timekeeper, operative during speech control, affects speech performance. The effects of alteration to auditory feedback depend on the extra input provided to the timekeeper. Different disruption is predicted for auditory feedback that is out of synchrony with other speech activity (e.g., delayed auditory feedback, DAF) compared with synchronous forms of altered feedback (e.g., frequency shifted feedback, FSF). Stimulus manipulations that can be made synchronously with speech are predicted to cause equivalent disruption to the synchronous form of altered feedback. Three experiments are reported. In all of them, subjects repeated a syllable at a fixed rate (Wing and Kristofferson, 1973). Overall timing variance was decomposed into the variance of a timekeeper (Cv) and the variance of a motor process (Mv). Experiment 1 validated Wing and Kristofferson's method for estimating Cv in a speech task by showing that only this variance component increased when subjects repeated syllables at different rates. Experiment 2 showed DAF increased Cv compared with when no altered sound occurred (experiment 1) and compared with FSF. In experiment 3, sections of the subject's output sequence were increased in amplitude. Subjects just heard this sound in one condition and made a duration decision about it in a second condition. When no response was made, results were like those with FSF. When a response was made, Cv increased at longer repetition periods. The findings that the principal effect of DAF, a duration decision and repetition period is on Cv whereas synchronous alterations that do not require a decision (amplitude increased sections where no response was made and FSF) do not affect Cv, support the hypothesis that the timekeeping process is affected by synchronized and asynchronized inputs in different ways.  相似文献   

18.
Information about the acoustic properties of a talker's voice is available in optical displays of speech, and vice versa, as evidenced by perceivers' ability to match faces and voices based on vocal identity. The present investigation used point-light displays (PLDs) of visual speech and sinewave replicas of auditory speech in a cross-modal matching task to assess perceivers' ability to match faces and voices under conditions when only isolated kinematic information about vocal tract articulation was available. These stimuli were also used in a word recognition experiment under auditory-alone and audiovisual conditions. The results showed that isolated kinematic displays provide enough information to match the source of an utterance across sensory modalities. Furthermore, isolated kinematic displays can be integrated to yield better word recognition performance under audiovisual conditions than under auditory-alone conditions. The results are discussed in terms of their implications for describing the nature of speech information and current theories of speech perception and spoken word recognition.  相似文献   

19.
A method for the analysis of vocal tract parameters is developed, aimed to perform quantitative analysis of rigidity from speech signals of Parkinsonian patients. The cross-sectional area function of the vocal tract is calculated using pitch synchronous autoregressive moving average (ARMA) analysis. The changes in Parkinsonian subjects of the cross-sectional area during the utterance of sustained sounds are attributed to both Parkinsonian tremor and rigidity. In order to isolate the effects of the rigidity on the vocal tract from those of the tremor, an adaptive tremor cancellation (ATC) algorithm is developed, based on the correlation of tremor signals extracted from different locations of the speech production system.  相似文献   

20.
A new method for cancelling background noise from running speech was used to study voice production during realistic environmental noise exposure. Normal subjects, 12 women and 11 men, read a text in five conditions: quiet, soft continuous noise (75 dBA to 70 dBA), day-care babble (74 dBA), disco (87 dBA), and loud continuous noise (78 dBA to 85 dBA). The noise was presented over loudspeakers and then removed from the recordings in an off-line processing operation. The voice signals were analyzed acoustically with an automatic phonetograph and perceptually by four expert listeners. Subjective data were collected after each vocal loading task. The perceptual parameters press, instability, and roughness increased significantly as an effect of speaking loudly over noise, whereas vocal fry decreased. Having to make oneself heard over noise resulted in higher SPL and F0, as expected, and in higher phonation time. The total reading time was slightly longer in continuous noise than in intermittent noise. The women had 4 dB lower voice SPL overall and increased their phonation time more in noise than did the men. Subjectively, women reported less success making themselves heard and higher effort. The results support the contention that female voices are more vulnerable to vocal loading in background noise.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号