首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 46 毫秒
1.
Vocal quality factors: analysis, synthesis, and perception.   总被引:4,自引:0,他引:4  
The purpose of this study was to examine several factors of vocal quality that might be affected by changes in vocal fold vibratory patterns. Four voice types were examined: modal, vocal fry, falsetto, and breathy. Three categories of analysis techniques were developed to extract source-related features from speech and electroglottographic (EGG) signals. Four factors were found to be important for characterizing the glottal excitations for the four voice types: the glottal pulse width, the glottal pulse skewness, the abruptness of glottal closure, and the turbulent noise component. The significance of these factors for voice synthesis was studied and a new voice source model that accounted for certain physiological aspects of vocal fold motion was developed and tested using speech synthesis. Perceptual listening tests were conducted to evaluate the auditory effects of the source model parameters upon synthesized speech. The effects of the spectral slope of the source excitation, the shape of the glottal excitation pulse, and the characteristics of the turbulent noise source were considered. Applications for these research results include synthesis of natural sounding speech, synthesis and modeling of vocal disorders, and the development of speaker independent (or adaptive) speech recognition systems.  相似文献   

2.
Synchronized videostroboscopy and electroglottography were applied to the measurement of anterior-to-posterior open glottal length in four groups of patients; two with no clinically significant voice disorder, one with vocal fold polyps, and one with vocal fold nodules. The data showed that the groups did not differ significantly when open glottal length was measured at the time of minimum glottal opening. The pathological groups had significantly lower open glottal length measurements, however, when measurements were obtained at the time that vocal fold contact was initiated during the glottal cycle. The findings are preliminary evidence that vocal fold neoplasms may not have the effect of reducing glottal closure, as previously suggested in the literature. The data also highlight the importance of examining differential effects of vocal fold neoplasms at various points throughout the glottal cycle.  相似文献   

3.
Noninvasive measures of vocal fold activity are useful for describingnormal and disordered voice production. Measures of open and speed quotient from glottal airflow and electroglottographic (EGG) waveforms have been used to describe timing events associated with vocal fold vibration. To date, there has been little consistency in the measurement criteria used to calculate quotient values. In this study, criteria of 20% and 50% were applied to the AC amplitude of glottal airflow and inverted EGG waveforms for measurement of open quotient. Criteria of 20%, 50%, and 80%, and a midslope criterion that segmented the waveform between 20% and 80% of the waveform amplitude, were used for the calculation of speed quotient. Subjects produced waveforms at sound pressure levels (SPL) of 70, 75, 80 and 85 dB. Results indicated that approximations of open quotient obtained from the glottal airflow waveform significantly decreased using both the 20% and 50% criteria as SPL increased from 80 to 85 dB. No significant changes were found in open quotient from the EGG waveform as a function of SPL. Results of speed quotient measures from the glottal airflow and EGG waveforms showed a generally increasing trend as SPL increased, although the differences were not statistically significant. The data suggest that the signal type, measurement criterion and SPL must be considered in interpreting quotient measures.  相似文献   

4.
During vocal fold vibration, there may be a mucosal wave in the superior-inferior (vertical) direction, resulting in a convergent shape during opening and a divergent shape during closing. Most of our understanding of the converging/diverging shape of the glottis has come from studies in a hemilarynx model. Previous work has shown that vibratory patterns in the full excised larynx are different than the hemilarynx. This study characterized the dynamics of the medial glottal wall geometry during vibrations in the full excised canine larynx model. Using particle image velocimetry, the intraglottal geometry was measured at the midmembranous coronal plane in an excised canine larynx model. Measurements of the glottal area were taken simultaneously using high-speed imaging. The results show that skewing of the glottal area waveform occurs without the presence of a vocal tract and that the phase-lag of the superior edge relative to the inferior edge is smaller than reported and depends on the subglottal pressure. In addition, it shows that the glottal divergence angle during closing is proportional to the magnitude of the acoustic intensity and the intraglottal negative pressure. This preliminary data suggests that more studies are needed to determine the important mechanisms determining the relationship between intraglottal flow, intraglottal geometry, and acoustics.  相似文献   

5.
Interpretation of electroglottography (EGG) as an index of glottal contact area has been complicated by difficulty obtaining independent validation measures. The purpose of this research was to implement a new simultaneous EGG/videostroboscopic technique for the evaluation of the relationship between a discontinuity in the opening phase of the EGG waveform with the onset of glottal opening viewed via videostroboscopy. The results support previous suggestions that this EGG discontinuity, when observed in nonpathologic individuals, usually marks the onset of glottal opening along the superior surface of the vocal folds.  相似文献   

6.
This paper presents a Hilbert transform-based approach to analyze vocal fold vibrations in human subjects exhibiting normal and abnormal voice productions. This new approach is applied to the analysis of glottal area waveform (GAW) and is capable of providing useful information on the vocal fold vibration. The GAW is extracted from high-speed laryngeal images by delineating the glottal edge for each image frame. An analytic signal is generated through the Hilbert transform of the GAW, which yields a recognizable pattern of the vocal fold vibration in the analytic phase plane. The vibratory pattern is comprehensive and can be correlated with specific voice conditions. Quantitative measures of the glottal perturbation are introduced using the analytic amplitude and instantaneous frequency obtained from the analysis. Examples of clinical voice recordings are used to evaluate and test the effectiveness of this approach in providing qualitative representation and quantitative characteristics of vocal fold vibratory behavior. The results demonstrate the potential of using this new analytical tool incorporated with the high-speed laryngeal imaging modality for clinical voice assessment.  相似文献   

7.
A method for analyzing and displaying electroglottographic (EGG) signals (and their first derivative, DEGG) is introduced: the electroglottographic wavegram ("wavegram" hereafter). To construct a wavegram, the time-varying fundamental frequency is measured and consecutive individual glottal cycles are identified. Each cycle is locally normalized in duration and amplitude, the signal values are encoded by color intensity and the cycles are concatenated to display the entire voice sample in a single image, similar as in sound spectrography. The wavegram provides an intuitive means for quickly assessing vocal fold contact phenomena and their variation over time. Variations in vocal fold contact appear here as a sequence of events rather than single phenomena, taking place over a certain period of time, and changing with pitch, loudness and register. Multiple DEGG peaks are revealed in wavegrams to behave systematically, indicating subtle changes of vocal fold oscillatory regime. As such, EGG wavegrams promise to reveal more information on vocal fold contacting and de-contacting events than previous methods.  相似文献   

8.
Geometry of the human vocal folds strongly influences their oscillatory motion. While the effect of intraglottal geometry on phonation has been widely investigated, the study of the geometry of the inferior surface of the vocal folds has been limited. In this study the way in which the inferior vocal fold surface angle affects vocal fold vibration was explored using a two-dimensional, self-oscillating finite element vocal fold model. The geometry was parameterized to create models with five different inferior surface angles. Four of the five models exhibited self-sustained oscillations. Comparisons of model motion showed increased vertical displacement and decreased glottal width amplitude with decreasing inferior surface angle. In addition, glottal width and air flow rate waveforms changed as the inferior surface angle was varied. Structural, rather than aerodynamic, effects are shown to be the cause of the changes in model response as the inferior surface angle was varied. Supporting data including glottal pressure distribution, average intraglottal pressure, energy transfer, and flow separation point locations are discussed, and suggestions for future research are given.  相似文献   

9.
《Journal of voice》2020,34(4):503-526
Electroglottography (EGG) is a low-cost, noninvasive technology for measuring changes of relative vocal fold contact area during laryngeal voice production. EGG was introduced about 60 years ago and has gone through a “golden era” of increased scientific attention in the late 1980s and early 90s. During that period, four eminent review papers were written. Here, an update to these reviews is given, recapitulating some earlier landmark contributions and documenting noteworthy developments during the past 25 years.After presenting an algorithmic bibliographic analysis, some methodological aspects pertaining to measurement technology, qualitative and quantitative analysis, and respective interpretation are discussed. In particular, the interpretation of landmarks in the (first derivative of the) EGG waveform is critically examined. It is argued that because of inferior-superior and anterior-posterior phase differences of vocal fold vibration, vocal fold (de)contacting does not occur instantaneously, but over an interval of time. For this reason, instants of vocal fold closing and opening cannot be resolved exactly from the EGG signal. Consequently, any quantitative analysis parameter relying on the determination of (de)contacting events (such as the EGG contact quotient) should be interpreted with care.Finally, recent developments are reviewed for the various fields of application of EGG, including basic voice science and voice production physiology, speech signal processing and classification, clinical practice including swallowing, phonetics, hearing sciences, psychology, singing, trumpet playing, and mammalian and avian bioacoustics. Overall, EGG has over the past six decades developed into a mature technology with a wide range of applications. However, due to current limitations, the full potential of the methodology has as yet not been fully exploited. Future development may occur on three levels: (a) rigorous validation of existent measurement approaches; (b) introduction and rigorous validation of novel quantitative and interpretative approaches; and (c) advancement of the measurement technology itself.  相似文献   

10.
《Journal of voice》2020,34(4):645.e19-645.e39
Intraglottal pressure is the driving force of vocal fold vibration. Its time course during the open phase of the vibratory cycle is essential in the mechanics of phonation, but measuring it directly is difficult and may hinder spontaneous voicing. However, it can be computed from the in vivo measured transglottal flow and glottal area (hence the air particle velocity) on the basis of the Bernoulli energy law and the interaction with the inertance of the vocal tract. As to sustained modal phonation, calculations are presented for the two possible shapes of glottal duct: convergent and divergent, including absolute calibration in order to obtain quantitative physical values. Whatever the glottal duct configuration, the calculations based on measured values of glottal area and air flow show that the integrated intraglottal pressure during the opening phase systematically exceeds that during the closing phase, which is the basic condition for sustaining vocal fold oscillation. The key point is that the airflow curve is skewed to the right relative to the glottal area curve. The skewing results from air compressibility and vocal tract inertance. The intraglottal pressure becomes negative during the closing phase. As to the soft (or physiological) voice onset, a similar approach shows that the integrated pressure differences (opening phase − closing phase) actually increase as the onset progresses, and this applies to the results based on Bernoulli's energy law as well as to those based on the interaction with the inertance of the vocal tract. Furthermore and similarly, the phase lead of the pressure wave with respect to the glottal opening progressively increases. The underlying explanation lies in the progressively increasing skewing of the airflow curve to the right with respect to the glottal area curve.  相似文献   

11.
提出一种声带动力学模型参数反演方法,从发声机理角度对声带病变嗓音进行有效区分。依据声带生理组织和伯努利定律构建声带动力学模型,确定模型优化参数向量,耦合声门气流获取模型声门波;利用迭代自适应逆滤波算法获得实际嗓音声门波作为目标声门波;采用遗传优化算法提出通过匹配目标和模型声门波特征参数实现模型参数反演。实验结果表明,表征声门波的各时频域参数匹配相对误差不超过2%;依据反演所获模型参数提出去除声门下压影响的平均归一化缩放系数,克服声带非对称性特征在区分病变嗓音方面的不足,实现病理嗓音的全面有效区分。   相似文献   

12.
Electroglottography (EGG) is a method to monitor the vibrations of the vocal folds by measuring the varying impedance to a weak alternating current through the tissues of the neck. The paper is an attempt to give a state-of-the-art report of how electroglottography is used in the clinic. It is based on a search of the pertinent literature was well as on an inquiry to 17 well known specialists in the field. The EGG techniques are described and limitations to the method are pointed out. Attempts to document voice quality by EGG are recognized and computerized methods to obtain information about vibratory perturbations and/or the vibratory frequency of the vocal folds are described. The author's personal conclusion is that the EGG signal is especially well suited for measurements of the glottal vibratory period. In the clinic such measurements are useful for periodicity analysis, as a basis for recording intonation contours, and to establish the characteristics of the voice fundamental frequency.  相似文献   

13.
The electroglottogram (EGG) has been conjectured to be related to the area of contact between the vocal folds. This hypothesis has been substantiated only partially via direct and indirect observations. In this paper, a simple model of vocal fold vibratory motion is used to estimate the vocal fold contact area as a function of time. This model employs a limited number of vocal fold vibratory features extracted from ultra high-speed laryngeal films. These characteristics include the opening and closing vocal fold angles and the lag (phase difference) between the upper and lower vocal fold margins. The electroglottogram is simulated using the contact area, and the EGG waveforms are compared to measured EGGs for normal male voices producing both modal and pulse register tones. The model also predicts EGG waveforms for vocal fold vibration associated with a nodule or polyp.  相似文献   

14.
Thyroplasty type I is one of several surgical treatments in which improving the voice of unilateral vocal fold paralysis is the ultimate objective. The goal of the surgery is the medialization of the paralyzed vocal fold. The purpose of this study is to evaluate the effectiveness of thyroplasty type I through acoustical analysis, aerodynamic measures, and quantitative videostroboscopic measurements. We report on 20 patients with unilateral vocal cord paralysis who underwent thyroplasty type I. We performed preoperative and postoperative video image analysis (normalized glottal gap area) and computer-assisted voice analysis (fundamental frequency, jitter, shimmer, noise-to-harmonic ratio, mean phonation time, mean flow rate, mean subglottic pressure) in all patients. The glottal gap was significantly reduced after thyroplasty type I. Postoperative voice quality was characterized by an improved pitch and amplitude pertubation (jitter and shimmer), phonation time (mean phonation time), and subglottic pressure (mean subglottic pressure). Thyroplasty type I is an effective method for regaining glottal closure and vocal function.  相似文献   

15.
The primary purpose of the study was to explore a methodology for measuring vocal fold impact stress (SI) in awake humans, and to provide information about the general magnitude of SIs that may occur at the midpoint of the membranous vocal folds during phonation. A secondary purpose was to examine the potential use of the electroglottographic closed quotient (EGG CQ) to indirectly reflect SI. Seven male and 13 female adults were enrolled as subjects, of whom 18 had normal larynges and normal voices, 1 had nodules, and 1 had vocal fold paresis and bowing. Subjects attempted to produce 3 different voice types (pressed, normal, breathy), at 3 different pitches (low, medium, high) and 3 different loudness levels (quiet, medium, loud). For a first set of trials, only EGG data were collected. For a second set, a sensor was also introduced to the midmembranous glottis for the collection of SI data. The primary findings were that (1) endolaryngeal sensor placement was achieved during phonation trials for 17 of 20 subjects; however, grossly consistent anteroposterior positioning was accomplished, and analyzable data were obtained, for only 7 subjects; (2) SIs ranged from less than 1 kPa to about 3 kPa for those 7 subjects; and (3) no relation was detected between simultaneous CQs and SIs for individual data, although a relation was reported in a prior canine study. One possible reason for the failure to show such a relation in the present study was subtle variations in vertical as well as anteroposterior positioning of the sensor during the trials. Future studies should focus on developing a methodology for ensuring invariant 3-dimensional sensor positioning between the membranous folds, so that the stability of both SI and simultaneous CQ data can be improved.  相似文献   

16.
Aerobic instructors frequently experience vocal fatigue and are at risk for the development of vocal fold pathology. Six female aerobic instructors, three with self-reported voice problems and three without, served as subjects. Measures of vocal function (perturbation and EGG) were obtained before and after a 30-minute exercise session. Results showed that the group with self-reported voice problems had greater amounts of jitter, lower harmonic-to-noise ratios, and less periodicity in sustained vowels overall, but no significant differences in measures of perturbation and EGG were found before and immediately after instruction. Measures of vocal parameters showed that subjects with self-reported voice problems projected with relatively greater vocal intensity and phonated for a greater percentage of time across beginning, middle, and ending periods of aerobic instruction than subjects with no reported voice problems.  相似文献   

17.
The mucosal upheaval (MU), where the mucosal wave starts and propagates upward, appears only when the vocal fold vibrates. The location of the MU histologically and the effect of changes in mean air flow rate (MFR) and vocal fold length on occurrence of the MU were studied in twelve excised canine larynges. The lower surface of the vocal fold was marked to serve as a landmark for subsequent study. Cricothyroid approximation was performed to lengthen the vocal fold. After taking high-speed pictures or recording stroboscopic images from the tracheal side, a small cut wound was made at the mark. This wound served to compare the position of the MU with the histologically identified location of the mark. The larynx was then sectioned in the frontal plane. Before lengthening the vocal fold, the MU occurred on the area where the lamina propria became thinner and where the muscular layer neared the epithelial layer. After lengthening the vocal fold, the MU actually shifted medially compared with its original position. The subglottic area surrounded by the bilateral MUs became longer and thinner. Whether or not complete glottal closure during a vibratory cycle was achieved did not alter these findings. In contrast, with a fixed vocal fold length the MU appeared more laterally as MFR increased, but, based on the relation with the mark, its location on the vocal fold did not change from its original position before increase of MFR.  相似文献   

18.
《Journal of voice》2020,34(2):294-299
ObjectiveThis study aimed to investigate the correlation between morphological features of vocal fold polyps (VFPs) and subjective/objective voice parameters.MethodsPerceptual evaluations, aerodynamic and acoustic tests were performed on 47 patients with VFPs. Still images were captured from video and the morphological features associated with the size of VFP were quantified. To reveal the correlation between size-related morphological features (length of polyp base, the ratio of polyp base to vocal fold length, glottal gap area) and objective/subjective parameters of voice, Pearson's and Spearman's tests were carried out.ResultsThis cohort was composed of 30 (63.8 %) male and 17 (36.2%) female patients with the mean age of 45.2 years and 41.3 years, respectively. No correlation was found between the morphological features of VFPs and any of perceptual, aerodynamic and acoustic voice parameters.ConclusionsOur findings indicated that controversies still exist regarding the role of vocal fold polyp morphology in clinical decision making.  相似文献   

19.
Stroboscopic signs were systematically rated for a group of 80 patients with benign vocal fold lesions, most of whom had either a nodule or a polyp. Each group revealed a characteristic pattern of ranking of signs and exhibited differences of most predominant signs. The results of the ratings were submitted to a multiple discriminant analysis to determine if post hoc stroboscopic ratings could be used to correctly classify patients into one of four diagnostic groups and into one of two treatment groups. All patients except one were correctly classified into the diagnostic groups, and all were correctly classified into the treatment groups. The important signs for classifying patients into the diagnostic groups were roughness of the edge of the affected vocal fold, phase closure pattern, and phase symmetry. The important signs for classifying patients into the treatment groups were roughness of the edge of the affected vocal fold, glottal closure configuration, and vibration characteristics of the affected (or more affected) vocal fold. The results suggest that objective evaluation of stroboscopic examinations can be valuable in correctly diagnosing patients and in selecting the proper treatment regimen for the patient.  相似文献   

20.
Posterior closure insufficiency of the glottis is often mentioned in connection with permanent voice disorders. Recently published studies have revealed that an incomplete closure of the glottis can be found also in normal-speaking voices, especially in women. However, the effect of glottal closure configuration on vocal efficacy is not sufficiently clarified. The purpose of this study was to determine the effect of glottal closure configuration on singing and speaking voice characteristics. Overall, 520 young female normal-speaking subjects were examined by videostroboscopy for different phonation conditions in the combination of soft, loud, low, and/or high phonation and by voice range profile measurements. According to the videostroboscopic analysis, the subjects were subdivided into four groups: complete closure of the vocal folds already in soft phonation (group 1), closure of the vocal fold with increasing intensity (group 2), persistent closure insufficiencies despite increasing intensity (group 3), and hourglass-shaped closure in subjects with vocal nodules (group 4). Subjects in which the glottal closure could not be evaluated sufficiently were subclassified into group 5 (missing values).

Selected criteria of the singing and speaking voice were evaluated and statistically processed according to the mentioned subclassification. Group 1 reached significantly the highest sound pressure levels (SPLmax) for the singing voice as well as for the shouting voice. Group 3 showed a limited capacity to increase the intensity of the singing and speaking voice. The results gathered in this study objectify the relationship of insufficient glottal closure and reduced vocal capabilities. As long as no conclusive data on long-term consequences of insufficient glottal closure are available, a prophylactic improvement of the laryngeal situation especially in female professional voice users by voice therapy should be recommended.  相似文献   


设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号