首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 31 毫秒
1.
2.
3.
Head extension with protruded tongue is the position for video-laryngoscopy and simultaneous glottographic recordings including photoglottographic signals. This study investigated the effect of head extension and tongue protrusion on the measures of fundamental frequency, frequency perturbation (jitter), and amplitude perturbation (shimmer). Acoustic signals recorded during sustained vowels were obtained from 49 women and 66 men with no speech or voice disorders in different head-tongue positions. Head extension was associated with increased fundamental frequency and decreased shimmer. In men, head extension did not appear to affect jitter. When the tongue was protruded, head extension tended to lower jitter. For both genders, tongue protrusion was associated with decreased fundamental frequency with head extension. In the men, tongue protrusion tended to increase shimmer when the head was in the neutral position. In the women, tongue protrusion was associated with increased jitter and increased shimmer and was most evident in the head-neutral position. These findings supported a physical linkage hypothesis of the relationship between vocal tract configuration and vocal fold vibration, suggesting that head-tongue position must be taken into account when comparing voice measures.  相似文献   

4.
Previous investigations have shown that one mechanism of irregular vocal fold vibration may be a desynchronization of two or more vibratory modes of the vocal fold tissues. In the current investigation, mechanisms of irregular vibration were further examined using a self-oscillating, physical model of vocal fold vibration, a hemi-model methodology, and high-speed, stereoscopic, digital imaging. Using the method of empirical eigen-functions, a spatiotemporal analysis revealed mechanisms of irregular vibration in subharmonic phonation and biphonation, which were not disclosed in a standard acoustic spectrum.  相似文献   

5.
Vocal quality factors: analysis, synthesis, and perception.   总被引:4,自引:0,他引:4  
The purpose of this study was to examine several factors of vocal quality that might be affected by changes in vocal fold vibratory patterns. Four voice types were examined: modal, vocal fry, falsetto, and breathy. Three categories of analysis techniques were developed to extract source-related features from speech and electroglottographic (EGG) signals. Four factors were found to be important for characterizing the glottal excitations for the four voice types: the glottal pulse width, the glottal pulse skewness, the abruptness of glottal closure, and the turbulent noise component. The significance of these factors for voice synthesis was studied and a new voice source model that accounted for certain physiological aspects of vocal fold motion was developed and tested using speech synthesis. Perceptual listening tests were conducted to evaluate the auditory effects of the source model parameters upon synthesized speech. The effects of the spectral slope of the source excitation, the shape of the glottal excitation pulse, and the characteristics of the turbulent noise source were considered. Applications for these research results include synthesis of natural sounding speech, synthesis and modeling of vocal disorders, and the development of speaker independent (or adaptive) speech recognition systems.  相似文献   

6.
There has been a lack of objective data on the singing voice registers, particularly on the so called "whistle" register, occurring in the top part of the female pitch range, which is accessible only to some singers. This study offers unique strobolaryngoscopic and high-speed (7812.5 imagess) videokymographic data on the vocal fold behavior of an untrained female singer capable of producing three distinct voice qualities, i.e., the chest, head and whistle registers. The sound was documented spectrographically. The transition from chest to head register, accompanied by pitch jumps, occurred around tones B4-C#5 (500-550 Hz) and was found to be associated with a slight decrease in arytenoids adduction, resulting in decrease of the closed quotient. The register shifts from head to whistle, also accompanied by pitch jumps, occurred around tones E5-B5 (670-1000 Hz) without any noticeable changes in arytenoids adduction. Some evidence was found for the vocal tract influence on this transition. The mechanism of the vocal fold vibration in whistle register was found principally similar to that at lower registers: vibrations along the whole glottal length and vertical phase differences (indicated by sharp lateral peaks in videokymography) were seen on the vocal folds up to the highest tone G6 (1590 Hz).  相似文献   

7.
SUMMARY: The aim of this study was to investigate how different acoustic parameters, extracted both from speech pressure waveforms and glottal flows, can be used in measuring vocal loading in modern working environments and how these parameters reflect the possible changes in the vocal function during a working day. In addition, correlations between objective acoustic parameters and subjective voice symptoms were addressed. The subjects were 24 female and 8 male customer-service advisors, who mainly use telephone during their working hours. Speech samples were recorded from continuous speech four times during a working day and voice symptom questionnaires were completed simultaneously. Among the various objective parameters, only F0 resulted in a statistically significant increase for both genders. No correlations between the changes in objective and subjective parameters appeared. However, the results encourage researchers within the field of occupational voice use to apply versatile measurement techniques in studying occupational voice loading.  相似文献   

8.
We describe an arrangement for simultaneous recording of speech and vocal tract geometry in patients undergoing surgery involving this area. Experimental design is considered from an articulatory phonetic point of view. The speech signals are recorded with an acoustic-electrical arrangement. The vocal tract is simultaneously imaged with MRI. A MATLAB-based system controls the timing of speech recording and MR image acquisition. The speech signals are cleaned from acoustic MRI noise by an adaptive signal processing algorithm. Finally, a vowel data set from pilot experiments is qualitatively compared both with validation data from the anechoic chamber and with Helmholtz resonances of the vocal tract volume, obtained using FEM.  相似文献   

9.
Quantifiable aspects of vocal fold vibration may be inferred by means of the electrolaryngograph. Changes in inferred vocal fold closed quotient are considered as a possible correlate of acoustic efficiency variation; automatic measures of closed quotient compare favorably with results obtained from inverse filtering of the speech pressure waveform. This article describes closed quotient measures based on electrolaryngographic analysis of 18 trained and untrained men singers, and results show a significant difference in mean vocal fold closed quotient between trained and untrained singers.  相似文献   

10.
Measurements on the inverse filtered airflow waveform and of estimated average transglottal pressure and glottal airflow were made from syllable sequences in low, normal, and high pitch for 25 male and 20 female speakers. Correlation analyses indicated that several of the airflow measurements were more directly related to voice intensity than to fundamental frequency (F0). Results suggested that pressure may have different influences in low and high pitch in this speech task. It is suggested that unexpected results of increased pressure in low pitch were related to maintaining voice quality, that is, avoiding vocal fry. In high pitch, the increased pressure may serve to maintain vocal fold vibration. The findings suggested different underlying laryngeal mechanisms and vocal adjustments for increasing and decreasing F0 from normal pitch.  相似文献   

11.
Vocal fold vibratory asymmetry is often associated with inefficient sound production through its impact on source spectral tilt. This association is investigated in both a computational voice production model and a group of 47 human subjects. The model provides indirect control over the degree of left-right phase asymmetry within a nonlinear source-filter framework, and high-speed videoendoscopy provides in vivo measures of vocal fold vibratory asymmetry. Source spectral tilt measures are estimated from the inverse-filtered spectrum of the simulated and recorded radiated acoustic pressure. As expected, model simulations indicate that increasing left-right phase asymmetry induces steeper spectral tilt. Subject data, however, reveal that none of the vibratory asymmetry measures correlates with spectral tilt measures. Probing further into physiological correlates of spectral tilt that might be affected by asymmetry, the glottal area waveform is parameterized to obtain measures of the open phase (open/plateau quotient) and closing phase (speed/closing quotient). Subjects' left-right phase asymmetry exhibits low, but statistically significant, correlations with speed quotient (r=0.45) and closing quotient (r=-0.39). Results call for future studies into the effect of asymmetric vocal fold vibration on glottal airflow and the associated impact on voice source spectral properties and vocal efficiency.  相似文献   

12.
In this paper, the acoustic-phonetic characteristics of steady apical trills--trill sounds produced by the periodic vibration of the apex of the tongue--are studied. Signal processing methods, namely, zero-frequency filtering and zero-time liftering of speech signals, are used to analyze the excitation source and the resonance characteristics of the vocal tract system, respectively. Although it is natural to expect the effect of trilling on the resonances of the vocal tract system, it is interesting to note that trilling influences the glottal source of excitation as well. The excitation characteristics derived using zero-frequency filtering of speech signals are glottal epochs, strength of impulses at the glottal epochs, and instantaneous fundamental frequency of the glottal vibration. Analysis based on zero-time liftering of speech signals is used to study the dynamic resonance characteristics of vocal tract system during the production of trill sounds. Qualitative analysis of trill sounds in different vowel contexts, and the acoustic cues that may help spotting trills in continuous speech are discussed.  相似文献   

13.
This paper presents a Hilbert transform-based approach to analyze vocal fold vibrations in human subjects exhibiting normal and abnormal voice productions. This new approach is applied to the analysis of glottal area waveform (GAW) and is capable of providing useful information on the vocal fold vibration. The GAW is extracted from high-speed laryngeal images by delineating the glottal edge for each image frame. An analytic signal is generated through the Hilbert transform of the GAW, which yields a recognizable pattern of the vocal fold vibration in the analytic phase plane. The vibratory pattern is comprehensive and can be correlated with specific voice conditions. Quantitative measures of the glottal perturbation are introduced using the analytic amplitude and instantaneous frequency obtained from the analysis. Examples of clinical voice recordings are used to evaluate and test the effectiveness of this approach in providing qualitative representation and quantitative characteristics of vocal fold vibratory behavior. The results demonstrate the potential of using this new analytical tool incorporated with the high-speed laryngeal imaging modality for clinical voice assessment.  相似文献   

14.
《Journal of voice》2014,28(4):440-448
ObjectiveTo correlate change in Voice Handicap Index (VHI)-10 scores with corresponding voice laboratory measures across five voice disorders.Study DesignRetrospective study.MethodsOne hundred fifty patients aged >18 years with primary diagnosis of vocal fold lesions, primary muscle tension dysphonia-1, atrophy, unilateral vocal fold paralysis (UVFP), and scar. For each group, participants with the largest change in VHI-10 between two periods (TA and TB) were selected. The dates of the VHI-10 values were linked to corresponding acoustic/aerodynamic and audio-perceptual measures. Change in voice laboratory values were analyzed for correlation with each other and with VHI-10.ResultsVHI-10 scores were greater for patients with UVFP than other disorders. The only disorder-specific correlation between voice laboratory measure and VHI-10 was average phonatory airflow in speech for patients with UVFP. Average airflow in repeated phonemes was strongly correlated with average airflow in speech (r = 0.75). Acoustic measures did not significantly change between time points.ConclusionsThe lack of correlations between the VHI-10 change scores and voice laboratory measures may be due to differing constructs of each measure; namely, handicap versus physiological function. Presuming corroboration between these measures may be faulty. Average airflow in speech may be the most ecologically valid measure for patients with UVFP. Although aerodynamic measures changed between the time points, acoustic measures did not. Correlations to VHI-10 and change between time points may be found with other acoustic measures.  相似文献   

15.
A hypophonic voice, characterized perceptually as weak and breathy, is associated with voice disorders such as vocal fold atrophy and unilateral vocal fold paralysis. Although voice therapy programs for hypophonia typically address the vocal folds or the sound source, twang voice quality was examined in this study as an alternative technique for increasing vocal power by altering the epilarynx or the sound filter. OBJECTIVE: This study investigated the effect of twang production on physiologic, acoustic, and perceived voice handicap measures in speakers with hypophonia. DESIGN/METHODS: This prospective pilot study compared the vocal outcomes of six participants with hypophonia at pre- and posttreatment time points. Outcome measures included mean airflow rate, intensity in dB sound pressure level (SPL), maximum phonation time, and self-report of voice handicap. RESULTS: All subjects improved in at least three of the four vocal outcome measures. Wilcoxon signed-rank test of paired differences revealed significant differences between pre- and posttherapy group means for airflow rate, SPL, and Voice Handicap Index scores. CONCLUSION: The twang voice quality as a manipulation of the sound filter offers a clinical complement to traditional voice therapies that primarily address the sound source.  相似文献   

16.
While vocal fold adduction is an important parameter in speech, relatively little has been known on the adjustment of the vocal fold adduction in singing. This study investigates the possibility of separate adjustments of cartilaginous and membranous vocal fold adduction in singing. Six female and seven male subjects, singers and non-singers, were asked to imitate an instructor in producing four phonation types: "aBducted falsetto" (FaB), "aDducted falsetto" (FaD), "aBducted Chest" (CaB), and "aDducted Chest" (CaD). The phonations were evaluated using videostroboscopy, videokymography (VKG), electroglottography (EGG), and audio recordings. All the subjects showed less posterior (cartilaginous) vocal fold adduction in phonation types FaB and CaB than in FaD and CaD, and less membranous vocal fold adduction (smaller closed quotient) in FaB and FaD than in CaB and CaD. The findings indicate that the exercises enabled the singers to separately manipulate (a) cartilaginous adduction and (b) membranous medialization of the glottis though vocal fold bulging. Membranous adduction (monitored via videokymographic closed quotient) was influenced by both membranous medialization and cartilaginous adduction. Individual control over these types of vocal fold adjustments allows singers to create different vocal timbres.  相似文献   

17.
18.
Teachers have a high percentage of voice problems. For voice disordered teachers, resonant voice therapy is hypothesized to reduce voice problems. No research has been done on the physiological, acoustic, and aerodynamic effects of resonant voice therapy for school teachers. The purpose of this study is to investigate resonant voice therapy outcome from perceptual, physiological, acoustic, aerodynamic, and functional aspects for female teachers with voice disorders. A prospective study was designed for this research. The research subjects were 24 female teachers in Taipei. All subjects received resonant voice therapy in groups of 4 subjects, 90 minutes per session, and 1 session per week for 8 weeks. The outcome of resonant voice therapy was assessed from auditory perceptual judgment, videostroboscopic examination, acoustic measurements, aerodynamic measurements, and functional measurements before and after therapy. After therapy the severity of roughness, strain, monotone, resonance, hard attack, and glottal fry in auditory perceptual judgments, the severity of vocal fold pathology, mucosal wave, amplitude, and vocal fold closure in videostroboscopic examinations, phonation threshold pressure, and the score of physical scale in the Voice Handicap Index were significantly reduced. The speaking Fo, maximum range of speaking Fo, and maximum range of speaking intensity were significantly increased after therapy. No significant change was found in perturbation and breathiness measurements after therapy. Resonant voice therapy is effective for school teachers and is suggested as one of the therapy approaches in clinics for this population.  相似文献   

19.
《Journal of voice》2019,33(6):851-859
PurposeThe pitch-shift reflex (PSR) is the adaptation of the fundamental frequency during phonation and speech and describes the auditory feedback control. Speakers without voice and speech disorders mostly show a compensation of the pitch change in the auditory feedback and adapt their fundamental frequency to the opposite direction. Dysphonic patients often display problems with the auditory perception and control of their voice during therapy. Our study focuses on the auditory and kinesthetic control mechanisms of patients with muscle tension dysphonia (MTD) and speakers without voice and speech problems. Main purpose of the study is the analysis of the functionality of the control mechanisms within phonation and speech between patients with MTD and normal speakers.MethodSixty-one healthy subjects (17 male, 44 female) and 22 patients with MTD (7 male, 15 female) participated following two paradigms including a sustained phonation (vowel /a/) and speech ([‘mama]). Within both paradigms the fundamental frequency of the auditory feedback was increased synthetically. For the analysis of the PSR the electroencephalogram, electroglottography, the voice signal, and the high-speed endoscopy data were recorded simultaneously. The PSR in the electroencephalogram was detected via the N100 and the mismatch negativity. Statistical tests were applied for the detection of the PSR in the physiological response within the electroglottography, voice, and high-speed endoscopy signals. The results were compared between both groups.ResultsNo differences were found between the controls and patients with MTD regarding latency and magnitude of the perception of the pitch shift in both paradigms, but for the magnitude of the behavioral response. Differences also could be found for both groups between the “no pitch” and “pitch” condition of the two paradigms regarding vocal fold dynamics and voice quality. Patients with MTD showed more vibrational irregularities during the PSR than the controls, especially regarding the symmetry of vocal fold dynamics.ConclusionPatients with MTD seem to have a disturbed interaction between the auditory and kinesthetic feedback inducing the execution of an overriding behavioral response.  相似文献   

20.
Acoustic and glottographic measures may provide important information that could enhance clinical management and documentation of vocal dysfunction. Acoustic measures such as jitter and shimmer reflect “short-term” perturbations, or instabilities of the voice, and the coefficients of variation for frequency and for amplitude reflect “long-term” perturbations. Interpretations of these acoustic measures are based on the assumption that vocal perturbations may be related to laryngeal tissue abnormalities, asymmetries in vocal fold movement, or neuromuscular fluctuations in the respiratory, laryngeal, or vocal tract systems. The abduction quotient is a glottographic measure related to laryngeal adduction and is obtained from an analysis of the electroglottograph signal. The adduction measure appears to be independent of the acoustic perturbation measures. Interpretations of the acoustic and adductory measures may, therefore, complement each other for greater understanding of a patient's laryngeal behavior. Visual displays of the acoustic and glottographic signals also are discussed to demonstrate their value in voice signal interpretations. Case studies illustrate potential interpretations of the acoustic perturbation and abduction quotient measures.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号