首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 31 毫秒
1.
This paper presents a Hilbert transform-based approach to analyze vocal fold vibrations in human subjects exhibiting normal and abnormal voice productions. This new approach is applied to the analysis of glottal area waveform (GAW) and is capable of providing useful information on the vocal fold vibration. The GAW is extracted from high-speed laryngeal images by delineating the glottal edge for each image frame. An analytic signal is generated through the Hilbert transform of the GAW, which yields a recognizable pattern of the vocal fold vibration in the analytic phase plane. The vibratory pattern is comprehensive and can be correlated with specific voice conditions. Quantitative measures of the glottal perturbation are introduced using the analytic amplitude and instantaneous frequency obtained from the analysis. Examples of clinical voice recordings are used to evaluate and test the effectiveness of this approach in providing qualitative representation and quantitative characteristics of vocal fold vibratory behavior. The results demonstrate the potential of using this new analytical tool incorporated with the high-speed laryngeal imaging modality for clinical voice assessment.  相似文献   

2.
The term “compensatory falsetto”, for the purpose of this investigation, refers to the development of an abnormally high-pitched voice in the presence of laryngeal pathology where more socially acceptable lower pitched voice production is possible. The purpose of this investigation was to compare laryngeal compensations and their effects on objective measures of vocal function during production of compensatory falsetto voice. Eighteen patients with abnormally high-pitched voice in the presence of underlying laryngeal pathology were evaluated in the Department of Otolaryngology at the University of Miami School of Medicine from January 1988 through December 1992 and were diagnosed with “compensatory falsetto”. Vocal fold paralysis (n = 11) was the most common laryngeal pathology. Vibratory characteristics were evaluated through videostrobolaryngoscopic examination. Acoustic and aerodynamic parameters assessed included fundamental frequency, jitter rate, harmonic-to-noise ratio, glottal air flow, and maximum phonation time. Production of a higher-pitched voice appeared to improve glottic closure and decrease the amount of air loss during phonation. A corresponding increase in maximum phonation time and improvement in acoustic characteristics of jitter and harmonic-to-noise ratio was also observed.  相似文献   

3.
The present study was conducted to test the hypothesis that intrinsic laryngeal muscles are involved in producing voice fundamental frequency (F(0)) responses to perturbations in voice pitch auditory feedback. Electromyography (EMG) recordings of the cricothyroid and thyroarytenoid muscles were made with hooked-wire electrodes, while subjects sustained vowel phonations at three different voice F(0) levels (conversational, high pitch in head register, and falsetto register) and received randomized pitch shifts (±100 or ±300 cents) in their voice auditory feedback. The median latencies from stimulus onset to the peak in the EMG and voice F(0) responses were 167 and 224 ms, respectively. Among the three different F(0) levels, the falsetto register produced compensatory EMG responses that occurred prior to vocal responses and increased along with rising voice F(0) responses and decreased for falling F(0) responses. For the conversational and high voice levels, the EMG response timing was more variable than in the falsetto voice, and changes in EMG activity with relevance to the vocal responses did not follow the consistent trend observed in the falsetto condition. The data from the falsetto condition suggest that both the cricothyroid and thyroarytenoid muscles are involved in generating the compensatory vocal responses to pitch-shifted voice feedback.  相似文献   

4.
On acoustic and fiberscopic studies of stop consonants, voice onset time and glottal width have been shown to be greatest in heavily aspirated stops, next greater for slightly aspirated stops, and least for unaspirated stops. Integrated activity of the thyroarytenoid and posterior cricoarytenoid muscles has been reported to be involved in differentiating aspirate characteristics of the stops. However, the fine movement of mucosal edges of vocal folds during the production of stops has not been well documented. In recent years, a new method for high-speed digital recording of laryngeal dynamics has made this possible. In the current study, the movements of vocal fold edges were documented during the period of stop production using a fiberscopic system of high-speed digital images. By observing the glottal width and the visual vibratory movements of vocal folds before voice onset, the heavily aspirated stop was characterized as being more prominent and dynamic than the slightly aspirated and unaspirated stops.  相似文献   

5.
The purpose of this exploratory study was to determine if laryngeal transillumination in combination with stroboscopy (strobophotoglottography; SPGG) is useful for (1) the visualization of vocal fold vibration (VFV) opening patterns, (2) the localization of initial vocal fold opening in horizontal glottal thirds (anterior, midmembranous, and posterior), (3) determination of the temporal correspondence of the so-called electroglottography (EGG)-knee and initial vocal fold separation, and, finally, (4) automatized quantitative measurements of glottal area function within endoscopic images. With stroboscopic transillumination, initial inferior vocal fold separation was detectable during the "closed" phase, where the vocal folds were still closed in the upper portion and therefore initial inferior vocal fold separation could not be visualized with usual laryngoscopy techniques. In the horizontal plane within similar fundamental frequencies in modal voice registers in two male subjects, localization of initial glottal opening depended on the voice types used (soft, normal, or pressed phonation). We found zipperlike posterior-to-anterior openings, initial midmembranous openings, initial anterior openings, as well as simultaneous initial opening of all three portions in the two healthy male adults examined. This technique proved to add temporal and spatial information to vocal fold opening patterns and extends our examination techniques to the very beginning of vocal fold opening at the inferior portion. Simultaneous electroglottogram tracking and comparison with bidirectionally illuminated stroboscopic images revealed a time-locked correspondence of the EGG-knee with the aforementioned initial inferior vocal fold separation. Bidirectional illumination combined with digital color extraction techniques allowed for image separation of subglottally and supraglottally illuminated structures. This facilitated vocal fold contour detection and automatized image processing, for example, for determination of glottal area function, and is considered to be a further step to objective automatized quantitative measurements within endoscopic images.  相似文献   

6.
Different source-related factors can lead to vocal fold instabilities and bifurcations referred to as voice breaks. Nonlinear coupling in phonation suggests that changes in acoustic loading can also be responsible for this unstable behavior. However, no in vivo visualization of tissue motion during these acoustically induced instabilities has been reported. Simultaneous recordings of laryngeal high-speed videoendoscopy, acoustics, aerodynamics, electroglottography, and neck skin acceleration are obtained from a participant consistently exhibiting voice breaks during pitch glide maneuvers. Results suggest that acoustically induced and source-induced instabilities can be distinguished at the tissue level. Differences in vibratory patterns are described through kymography and phonovibrography; measures of glottal area, open/speed quotient, and amplitude/phase asymmetry; and empirical orthogonal function decomposition. Acoustically induced tissue instabilities appear abruptly and exhibit irregular vocal fold motion after the bifurcation point, whereas source-induced ones show a smoother transition. These observations are also reflected in the acoustic and acceleration signals. Added aperiodicity is observed after the acoustically induced break, and harmonic changes appear prior to the bifurcation for the source-induced break. Both types of breaks appear to be subcritical bifurcations due to the presence of hysteresis and amplitude changes after the frequency jumps. These results are consistent with previous studies and the nonlinear source-filter coupling theory.  相似文献   

7.
《Journal of voice》2023,37(2):298.e11-298.e29
IntroductionTypical singing registers are the chest and falsetto; however, trained singers have an additional register, namely, the mixed register. The mixed register, which is also called “mixed voice” or “mix,” is an important technique for singers, as it can help bridge from the chest voice to falsetto without noticeable voice breaks.ObjectiveThe present study aims to reveal the nature of the voice-production mechanism of the different registers (chest, mix, and falsetto) using high-speed digital imaging (HSDI), electroglottography (EGG), and acoustic and aerodynamic measurements.Study DesignCross-sectional study.MethodsAerodynamic measurements were acquired for twelve healthy singers (six men and women) during the phonation of a variety of pitches using three registers. HSDI and EGG devices were simultaneously used on three healthy singers (two men and one woman) from which an open quotient (OQ) and speed quotient (SQ) were detected. Audio signals were recorded for five sustained vowels, and a spectral analysis was conducted to determine the amplitude of each harmonic component. Furthermore, the absolute (not relative) value of the glottal volume flow was estimated by integrating data obtained from the HSDI and aerodynamic studies.ResultsFor all singers, the subglottal pressure (PSub) was the highest for the chest in the three registers, and the mean flow rate (MFR) was the highest for the falsetto. Conversely, the PSub of the mix was as low as the falsetto, and the MFR of the mix was as low as the chest. The HSDI analysis showed that the OQ differed significantly among the registers, even when the fundamental frequency was the same; the OQ of the mix was higher than that of the chest but lower than that of the falsetto. The acoustic analysis showed that, for the mix, the harmonic structure was intermediate between the chest and falsetto. The results of the glottal volume-flow analysis revealed that the maximum volume velocity was the least for the mix register at every fundamental frequency. The first and second harmonic (H1-H2) difference of the voice source spectrum was the greatest for the falsetto, then the mix, and finally, the chest.ConclusionsWe found differences in the registers in terms of the aeromechanical mechanisms and vibration patterns of the vocal folds. The mixed register proved to have a distinct voice-production mechanism, which can be differentiated from those of the chest or falsetto registers.  相似文献   

8.
There has been a lack of objective data on the singing voice registers, particularly on the so called "whistle" register, occurring in the top part of the female pitch range, which is accessible only to some singers. This study offers unique strobolaryngoscopic and high-speed (7812.5 imagess) videokymographic data on the vocal fold behavior of an untrained female singer capable of producing three distinct voice qualities, i.e., the chest, head and whistle registers. The sound was documented spectrographically. The transition from chest to head register, accompanied by pitch jumps, occurred around tones B4-C#5 (500-550 Hz) and was found to be associated with a slight decrease in arytenoids adduction, resulting in decrease of the closed quotient. The register shifts from head to whistle, also accompanied by pitch jumps, occurred around tones E5-B5 (670-1000 Hz) without any noticeable changes in arytenoids adduction. Some evidence was found for the vocal tract influence on this transition. The mechanism of the vocal fold vibration in whistle register was found principally similar to that at lower registers: vibrations along the whole glottal length and vertical phase differences (indicated by sharp lateral peaks in videokymography) were seen on the vocal folds up to the highest tone G6 (1590 Hz).  相似文献   

9.
We present a comprehensive, functional analysis of clinical voice data derived from both high-speed digital imaging (HSDI) of the larynx and simultaneously acquired acoustic recordings. The goals of this study are to: (1) correlate dynamic characteristics of the vocal folds derived from direct laryngeal imaging with indirectly acquired acoustic measurements; (2) define the advantages of using a combined imaging/acoustic approach for the analysis of voice condition; and (3) identify new quantitative measures to evaluate the regularity of the vocal fold vibration and the complexity of the vocal output -- these measures will be key to successful diagnosis of vocal abnormalities. Image- and acoustic-based analyses are performed using an analytic phase plot approach previously introduced by our group (referred to as 'Nyquist' plot). Fast Fourier Transform (FFT) spectral analyses are performed on the same data for a comparison. Clinical HSDI and acoustic recordings from subjects having normal and specific voice pathologies, including muscular tension dysphonia (MTD) and recurrent respiratory papillomatosis (RRP) were analyzed using the Nyquist plot approach. The results of these analyses show that a combined imaging/acoustic analysis approach provides better characterization of the vibratory behavior of the vocal folds as it correlates with vocal output and pathology.  相似文献   

10.
Vocal quality factors: analysis, synthesis, and perception.   总被引:4,自引:0,他引:4  
The purpose of this study was to examine several factors of vocal quality that might be affected by changes in vocal fold vibratory patterns. Four voice types were examined: modal, vocal fry, falsetto, and breathy. Three categories of analysis techniques were developed to extract source-related features from speech and electroglottographic (EGG) signals. Four factors were found to be important for characterizing the glottal excitations for the four voice types: the glottal pulse width, the glottal pulse skewness, the abruptness of glottal closure, and the turbulent noise component. The significance of these factors for voice synthesis was studied and a new voice source model that accounted for certain physiological aspects of vocal fold motion was developed and tested using speech synthesis. Perceptual listening tests were conducted to evaluate the auditory effects of the source model parameters upon synthesized speech. The effects of the spectral slope of the source excitation, the shape of the glottal excitation pulse, and the characteristics of the turbulent noise source were considered. Applications for these research results include synthesis of natural sounding speech, synthesis and modeling of vocal disorders, and the development of speaker independent (or adaptive) speech recognition systems.  相似文献   

11.
Videokymographic images of deviant or irregular vocal fold vibration, including diplophonia, the transition from falsetto to modal voice, irregular vibration onset and offset, and phonation following partial laryngectomy were compared with the synchronously recorded acoustic speech signals. A clear relation was shown between videokymographic image sequences and acoustic speech signals, and the effect of irregular or incomplete vocal fold vibration patterns was recognized in the amount of perceived breathiness and roughness and by the harmonics-to-noise ratio in the speech signal. Mechanisms causing roughness are the presence of mucus, phase differences between the left and right vocal fold, and short-term frequency and amplitude modulation. It can be concluded that the use of simultaneously recorded videokymographic image sequences and speech signals contributes to the understanding of the effect of irregular vocal fold vibration on voice quality.  相似文献   

12.
Occurrences of period-doubling are found in human phonation, in particular for pathological and some singing phonations such as Sardinian A Tenore Bassu vocal performance. The combined vibration of the vocal folds and the ventricular folds has been observed during the production of such low pitch bass-type sound. The present study aims to characterize the physiological correlates of this acoustical production and to provide a better understanding of the physical interaction between ventricular fold vibration and vocal fold self-sustained oscillation. The vibratory properties of the vocal folds and the ventricular folds during phonation produced by a professional singer are analyzed by means of acoustical and electroglottographic signals and by synchronized glottal images obtained by high-speed cinematography. The periodic variation in glottal cycle duration and the effect of ventricular fold closing on glottal closing time are demonstrated. Using the detected glottal and ventricular areas, the aerodynamic behavior of the laryngeal system is simulated using a simplified physical modeling previously validated in vitro using a larynx replica. An estimate of the ventricular aperture extracted from the in vivo data allows a theoretical prediction of the glottal aperture. The in vivo measurements of the glottal aperture are then compared to the simulated estimations.  相似文献   

13.
In this study, we evaluated the relationship between laryngeal function measures and glottal gap ratio and normalized measures of supraglottic behaviors in patients with unilateral vocal fold paresis (UVFP). Thirty-one patients were found to have unilateral vocal fold paresis by videoendoscopy and laryngeal electromyography, and 13 controls participated in this study. Patients with UVFP demonstrated significantly larger glottal gap ratios (p = 0.016) than control subjects. The nonparalyzed or contralateral vocal fold was associated with significantly more static false vocal fold compression (p = 0.03) compared with the paralyzed vocal fold or with the controls. Patients with unilateral vocal fold paresis were divided into subgroups: those with normal or abnormal maximum phonation time, flow, or pressure measures. Smaller glottal gap ratios were identified in patients with normal maximum phonation times and flow measures. Greater false vocal fold activity was identified in unilateral vocal fold paresis patients with normal laryngeal function measures than in unilateral vocal fold paresis patients with abnormal measures. These findings suggest that some patients with documented unilateral paresis and glottal incompetence can compensate for vocal fold weakness such that their acoustic and aerodynamic measures are normal.  相似文献   

14.
Vocal fold vibratory asymmetry is often associated with inefficient sound production through its impact on source spectral tilt. This association is investigated in both a computational voice production model and a group of 47 human subjects. The model provides indirect control over the degree of left-right phase asymmetry within a nonlinear source-filter framework, and high-speed videoendoscopy provides in vivo measures of vocal fold vibratory asymmetry. Source spectral tilt measures are estimated from the inverse-filtered spectrum of the simulated and recorded radiated acoustic pressure. As expected, model simulations indicate that increasing left-right phase asymmetry induces steeper spectral tilt. Subject data, however, reveal that none of the vibratory asymmetry measures correlates with spectral tilt measures. Probing further into physiological correlates of spectral tilt that might be affected by asymmetry, the glottal area waveform is parameterized to obtain measures of the open phase (open/plateau quotient) and closing phase (speed/closing quotient). Subjects' left-right phase asymmetry exhibits low, but statistically significant, correlations with speed quotient (r=0.45) and closing quotient (r=-0.39). Results call for future studies into the effect of asymmetric vocal fold vibration on glottal airflow and the associated impact on voice source spectral properties and vocal efficiency.  相似文献   

15.
Mongolian "throat singing" can be performed in different modes. In Mongolia, the bass-type is called Kargyraa. The voice source in bass-type throat singing was studied in one male singer. The subject alternated between modal voice and the throat singing mode. Vocal fold vibrations were observed with high-speed photography, using a computerized recording system. The spectral characteristics of the sound signal were analyzed. Kymographic image data were compared to the sound signal and flow inverse filtering data from the same singer were obtained on a separate occasion. It was found that the vocal folds vibrated at the same frequency throughout both modes of singing. During throat singing the ventricular folds vibrated with complete but short closures at half the frequency of the true vocal folds, covering every second vocal fold closure. Kymographic data confirmed the findings. The spectrum contained added subharmonics compared to modal voice. In the inverse filtered signal the amplitude of every second airflow pulse was considerably lowered. The ventricular folds appeared to modulate the sound by reducing the glottal flow of every other vocal fold vibratory cycle.  相似文献   

16.

Purpose

Professional male altos (countertenors) mostly use a register function, which is considered to be derived from falsetto. However, the sound produced differs in professional altos compared with the modal register or falsetto of untrained voices. The aim of this study was to analyze differences of the vocal tract shapes in male alto register functions.

Material and Methods

Dynamic real-time magnetic resonance imaging of eight frames per second was used to analyze the vocal tract profile in seven professional male altos who sang on the vowel /a/, an ascending and descending scale from G3 (196 Hz) to E4 (330 Hz). The scale included their register transition from modal register to stage (counter) falsetto and naïve falsetto.

Results

Register transitions from modal register to stage falsetto were associated with increased lip opening, jaw retraction, elevation and back positioning of the tongue, pharynx narrowing, uvula elevation, drop of larynx height, and tilting of the larynx. Differences between stage and naïve falsetto were found mostly with regard to lip opening and pharynx width.

Conclusions

The differences between the vocal tract configurations might have an impact on the acoustic characteristics observed in professional male alto register functions.  相似文献   

17.
高速摄影成像分析声带振动发声的前后不对称性   总被引:1,自引:0,他引:1       下载免费PDF全文
张宇  杨帅  黄楠木  李琳 《声学学报》2017,42(3):341-347
高速摄影成像直接观察到声带振动的前后不对称性。将11个离体狗喉声带进行发声实验,设置3组声门下压分别为10 cm H2O,20 cm H2O和30 cm H2O,利用高速摄像仪和传声器,分别记录不同声门下压的声带振动图像和声信号.对高速摄影成像与同步采集的声信号基频进行定量分析和比较,基频均随声门下压的增大而增加。此外,对两种测量方法得到的基频进行相关分析比较,得到在同一声门下压下两种方法的基频相关系数均大于0.9,表明高速摄影成像得到的基频与声信号的基频具有高度相关性。高速摄影成像能直观地测量声带振动行为,对研究声带振动发声机理提供了有价值的测量手段。高速摄影获得的声带线性结构上25%,50%,75%位置处的振动幅度,显示了声带前后振动不对称且声门下压较低时振动不对称较明显。   相似文献   

18.
Simultaneous measurements of mean airflow rate, vocal intensityand fundamental frequency were made during flexible video endoscopic recording of the vowel /i/ sustained in two vocal registers, modal and falsetto. The glottal closure patterns of four males and four females were evaluated by visually inspecting the video images. Acoustic signals were recorded and analyzed to verify the frequency and intensity criteria. Aerodynamic analysis of mean airflow rate was done via Rothenberg mask and commercial software. Incomplete glottic closure was common in both males and females. The degree of closure was significantly higher for modal samples than for falsetto samples with frequency and intensity held constant. The shape of the glottal closure did not vary with changes in the mode of phonation. As expected, the mean airflow rate increased with decreased glottal closure. The results suggest that incomplete glottic closure should be considered as a normal glottal configuration in high frequency modal and falsetto phonation. Moreover, hourglass and spindle glottal configurations may also be found in both the modal and falsetto registers of normal subjects. These results also confirm the positive relationships between degree of glottal gap and mean airflow rate. Thus, mean airflow rate may be regarded as a criterion for judging degree of glottal closure.  相似文献   

19.
Sustained high notes, diminishing gradually from the loudest to the softest phonation within a maneuver called messa di voce, are examined in two contrasting professional tenor voices. Signals of the sound pressure level, electroglottograph, and mean esophageal pressure are recorded, and similar maneuvers by the same subjects are examined stroboscopically. The lyric voice is found to make a gradual diminuendo while maintaining nearly constant posture of the vocal tract together with a phase of complete closure in the glottal cycle. The robust voice, by contrast, passes abruptly from a production of high subglottal pressure and a high closed quotient to one of low pressure and incomplete closure, and the transition is marked by a sudden opening of the previously constricted laryngeal collar. It is proposed that the mode of soft voice production demonstrated by the robust voice be recognized as a distinct register of the singing voice.  相似文献   

20.
Mongolian “throat singing” can be performed in different modes. In Mongolia, the bass-type is called Kargyraa. The voice source in bass-type throat singing was studied in one male singer. The subject alternated between modal voice and the throat singing mode. Vocal fold vibrations were observed with high-speed photography, using a computerized recording system. The spectral characteristics of the sound signal were analyzed. Kymographic image data were compared to the sound signal and flow inverse filtering data from the same singer were obtained on a separate occasion. It was found that the vocal folds vibrated at the same frequency throughout both modes of singing. During throat singing the ventricular folds vibrated with complete but short closures at half the frequency of the true vocal folds, covering every second vocal fold closure. Kymographic data confirmed the findings. The spectrum contained added subharmonics compared to modal voice. In the inverse filtered signal the amplitude of every second airflow pulse was considerably lowered. The ventricular folds appeared to modulate the sound by reducing the glottal flow of every other vocal fold vibratory cycle.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号