首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 187 毫秒
1.
EGGW is a phonatory parameter that can be derived from electroglottographic (EGG) signals and used to infer the relative degree of vocal fold contact. Vocal fold models predict that men will exhibit medial bulging of their vocal folds during phonation but women will not. These models lead us to expect gender differences in the magnitude of EGGW. Nevertheless, significant gender differences in EGGW for adults with normal voices have not been documented in previous studies when EGGW was computed from criterion lines placed at 25%-40% of the amplitude of the uninverted EGG wave form. We hypothesized that EGGW would better reflect gender differences in vocal fold adductory patterns if EGGW was computed from portions of the wave form that were associated with more vocal fold contact. EGGW was measured for seven men and seven women with normal voices. When EGGW was computed from segments of the wave form that were associated with relatively greater vocal fold contact (i.e., using criterion levels of > or = 55%), findings were consistent with the gender-specific adductory patterns that have been proposed from vocal fold models. Guidelines for appropriate placement of criterion lines when computing EGGW are discussed.  相似文献   

2.
The purpose of this study was to explore the possible use of the electroglottographic closed quotient (EGG CQ) as a noninvasive estimate of vocal fold impact stress (SI). Two excised canine larynges were used. Each larynx was mounted and vocal fold oscillation was induced using a humidified air source. Twentyseven experimental trials were conducted for each larynx. Trials involved variations in vocal process gap, vocal fold elongation, and subglottic pressure. Simultaneous measures were made of vocal fold SI at the midpoint of the membranous vocal folds, and EGG CQ (dimensionless ratio). The results indicated that when threshold and saturation effects were excluded, the SI and the CQ were strongly related (linear correlation r = .83 and .96 for the two individual larynges, and .81 for the combined data). Within the region of linear relation, an increase of .15 in the CQ corresponded to about 1 kPa increase in SI for the combined data. Discussion focuses on possible clinical implications and the likely reasons for threshold and saturation phenomena.  相似文献   

3.
The electroglottogram (EGG) has been conjectured to be related to the area of contact between the vocal folds. This hypothesis has been substantiated only partially via direct and indirect observations. In this paper, a simple model of vocal fold vibratory motion is used to estimate the vocal fold contact area as a function of time. This model employs a limited number of vocal fold vibratory features extracted from ultra high-speed laryngeal films. These characteristics include the opening and closing vocal fold angles and the lag (phase difference) between the upper and lower vocal fold margins. The electroglottogram is simulated using the contact area, and the EGG waveforms are compared to measured EGGs for normal male voices producing both modal and pulse register tones. The model also predicts EGG waveforms for vocal fold vibration associated with a nodule or polyp.  相似文献   

4.
《Journal of voice》2020,34(4):503-526
Electroglottography (EGG) is a low-cost, noninvasive technology for measuring changes of relative vocal fold contact area during laryngeal voice production. EGG was introduced about 60 years ago and has gone through a “golden era” of increased scientific attention in the late 1980s and early 90s. During that period, four eminent review papers were written. Here, an update to these reviews is given, recapitulating some earlier landmark contributions and documenting noteworthy developments during the past 25 years.After presenting an algorithmic bibliographic analysis, some methodological aspects pertaining to measurement technology, qualitative and quantitative analysis, and respective interpretation are discussed. In particular, the interpretation of landmarks in the (first derivative of the) EGG waveform is critically examined. It is argued that because of inferior-superior and anterior-posterior phase differences of vocal fold vibration, vocal fold (de)contacting does not occur instantaneously, but over an interval of time. For this reason, instants of vocal fold closing and opening cannot be resolved exactly from the EGG signal. Consequently, any quantitative analysis parameter relying on the determination of (de)contacting events (such as the EGG contact quotient) should be interpreted with care.Finally, recent developments are reviewed for the various fields of application of EGG, including basic voice science and voice production physiology, speech signal processing and classification, clinical practice including swallowing, phonetics, hearing sciences, psychology, singing, trumpet playing, and mammalian and avian bioacoustics. Overall, EGG has over the past six decades developed into a mature technology with a wide range of applications. However, due to current limitations, the full potential of the methodology has as yet not been fully exploited. Future development may occur on three levels: (a) rigorous validation of existent measurement approaches; (b) introduction and rigorous validation of novel quantitative and interpretative approaches; and (c) advancement of the measurement technology itself.  相似文献   

5.
Noninvasive measures of vocal fold activity are useful for describingnormal and disordered voice production. Measures of open and speed quotient from glottal airflow and electroglottographic (EGG) waveforms have been used to describe timing events associated with vocal fold vibration. To date, there has been little consistency in the measurement criteria used to calculate quotient values. In this study, criteria of 20% and 50% were applied to the AC amplitude of glottal airflow and inverted EGG waveforms for measurement of open quotient. Criteria of 20%, 50%, and 80%, and a midslope criterion that segmented the waveform between 20% and 80% of the waveform amplitude, were used for the calculation of speed quotient. Subjects produced waveforms at sound pressure levels (SPL) of 70, 75, 80 and 85 dB. Results indicated that approximations of open quotient obtained from the glottal airflow waveform significantly decreased using both the 20% and 50% criteria as SPL increased from 80 to 85 dB. No significant changes were found in open quotient from the EGG waveform as a function of SPL. Results of speed quotient measures from the glottal airflow and EGG waveforms showed a generally increasing trend as SPL increased, although the differences were not statistically significant. The data suggest that the signal type, measurement criterion and SPL must be considered in interpreting quotient measures.  相似文献   

6.
7.
Ma EP  Baken RJ  Roark RM  Li PM 《Journal of voice》2012,26(5):670.e1-670.e6
Vocal attack time (VAT) is the time lag between the growth of the sound pressure signal and the development of physical contact of vocal folds at vocal initiation. It can be derived by a cross-correlation of short-time amplitude changes occurring in the sound pressure and electroglottographic (EGG) signals. Cantonese is a tone language in which tone determines the lexical meaning of the syllable. Such linguistic function of tone has implications for the physiology of tone production. The aim of the present study was to investigate the possible effects of Cantonese tones on VAT. Sound pressure and EGG signals were simultaneously recorded from 59 native Cantonese speakers (31 females and 28 males). The subjects were asked to read aloud 12 disyllabic words comprising homophone pairs of the six Cantonese lexical tones. Results revealed a gender difference in VAT values, with the mean VAT significantly smaller in females than in males. There was also a significant difference in VAT values between the two tone categories, with the mean VAT values of the three level tones (tone 1, 3, and 6) significantly smaller than those of the three contour tones (tone 2, 4, and 5). The findings support the notion that norms and interpretations based on nontone European languages may not be directly applied to tone languages.  相似文献   

8.
9.
The primary purpose of the study was to explore a methodology for measuring vocal fold impact stress (SI) in awake humans, and to provide information about the general magnitude of SIs that may occur at the midpoint of the membranous vocal folds during phonation. A secondary purpose was to examine the potential use of the electroglottographic closed quotient (EGG CQ) to indirectly reflect SI. Seven male and 13 female adults were enrolled as subjects, of whom 18 had normal larynges and normal voices, 1 had nodules, and 1 had vocal fold paresis and bowing. Subjects attempted to produce 3 different voice types (pressed, normal, breathy), at 3 different pitches (low, medium, high) and 3 different loudness levels (quiet, medium, loud). For a first set of trials, only EGG data were collected. For a second set, a sensor was also introduced to the midmembranous glottis for the collection of SI data. The primary findings were that (1) endolaryngeal sensor placement was achieved during phonation trials for 17 of 20 subjects; however, grossly consistent anteroposterior positioning was accomplished, and analyzable data were obtained, for only 7 subjects; (2) SIs ranged from less than 1 kPa to about 3 kPa for those 7 subjects; and (3) no relation was detected between simultaneous CQs and SIs for individual data, although a relation was reported in a prior canine study. One possible reason for the failure to show such a relation in the present study was subtle variations in vertical as well as anteroposterior positioning of the sensor during the trials. Future studies should focus on developing a methodology for ensuring invariant 3-dimensional sensor positioning between the membranous folds, so that the stability of both SI and simultaneous CQ data can be improved.  相似文献   

10.
Vocal quality factors: analysis, synthesis, and perception.   总被引:4,自引:0,他引:4  
The purpose of this study was to examine several factors of vocal quality that might be affected by changes in vocal fold vibratory patterns. Four voice types were examined: modal, vocal fry, falsetto, and breathy. Three categories of analysis techniques were developed to extract source-related features from speech and electroglottographic (EGG) signals. Four factors were found to be important for characterizing the glottal excitations for the four voice types: the glottal pulse width, the glottal pulse skewness, the abruptness of glottal closure, and the turbulent noise component. The significance of these factors for voice synthesis was studied and a new voice source model that accounted for certain physiological aspects of vocal fold motion was developed and tested using speech synthesis. Perceptual listening tests were conducted to evaluate the auditory effects of the source model parameters upon synthesized speech. The effects of the spectral slope of the source excitation, the shape of the glottal excitation pulse, and the characteristics of the turbulent noise source were considered. Applications for these research results include synthesis of natural sounding speech, synthesis and modeling of vocal disorders, and the development of speaker independent (or adaptive) speech recognition systems.  相似文献   

11.
Electroglottography (EGG) was used to monitor vocal fold vibration patterns in normal subjects and patients with various laryngeal disorders. In order to evaluate the regularity of vocal fold vibration, frequency and amplitude perturbation of EGG waves during sustained phonation were measured with a laboratory computer. The data were compared to the degree of hoarseness evaluated by auditory perception and by sound spectrographic analysis. Frequency and amplitude perturbation measures showed some overlap between normal and pathological groups. However, there was a close relation between perturbation analysis of EGG waves and degree of hoarseness (Spearman's rank correlation coefficient rs = 0.73, p less than 0.0005). Amplitude perturbation was found to be a more sensitive measure of the irregularity of vocal fold vibration than frequency perturbation.  相似文献   

12.
Aerobic instructors frequently experience vocal fatigue and are at risk for the development of vocal fold pathology. Six female aerobic instructors, three with self-reported voice problems and three without, served as subjects. Measures of vocal function (perturbation and EGG) were obtained before and after a 30-minute exercise session. Results showed that the group with self-reported voice problems had greater amounts of jitter, lower harmonic-to-noise ratios, and less periodicity in sustained vowels overall, but no significant differences in measures of perturbation and EGG were found before and immediately after instruction. Measures of vocal parameters showed that subjects with self-reported voice problems projected with relatively greater vocal intensity and phonated for a greater percentage of time across beginning, middle, and ending periods of aerobic instruction than subjects with no reported voice problems.  相似文献   

13.
Newly developed glottographic sensors, utilizing high-frequency propagating electromagnetic waves, were compared to a well-established electroglottographic device. The comparison was made on four male subjects under different phonation conditions, including three levels of vocal fold adduction (normal, breathy, and pressed), three different registers (falsetto, chest, and fry), and two different pitches. Agreement between the sensors was always found for the glottal closure event, but for the general wave shape the agreement was better for falsetto and breathy voice than for pressed voice and vocal fry. Differences are attributed to the field patterns of the devices. Whereas the electroglottographic device can operate only in a conduction mode, the electromagnetic device can operate in either the forward scattering (diffraction) mode or in the backward scattering (reflection) mode. Results of our tests favor the diffraction mode because a more favorable angle imposed on receiving the scattered (reflected) signal did not improve the signal strength. Several observations are made on the uses of the electromagnetic sensors for operation without skin contact and possibly in an array configuration for improved spatial resolution within the glottis.  相似文献   

14.
Subharmonics are an important class of voice signals, relevant for speech, pathological voice, singing, and animal bioacoustics. They arise from special cases of amplitude (AM) or frequency modulation (FM) of the time-domain signal. Surprisingly, to date there is only one open source subharmonics detector available to the scientific community: Sun’s subharmonic-to-harmonic ratio (SHR). Here, this algorithm was subjected to a formal evaluation with two data sets of synthesized and empirical speech samples. Both data sets consisted of electroglottographic (EGG) signals, ie, a physiological correlate of vocal fold oscillation that bypasses vocal tract acoustics. Data Set I contained 2560 synthesized EGG signals with varying degrees of AM and FM, fundamental frequency (fo), periodicity, and signal-to-noise ratio (SNR). Data Set II was made up of 25 EGG samples extracted from the CMU Arctic speech data base. For a “ground truth” of subharmonicity, these samples were manually annotated by a group of five external experts. Analysis of the synthesized data suggested that the SHR metric is relatively robust as long as the subharmonic modulation extent is below 0.35 and 0.7 for the FM and AM scenarios, respectively. In the CMU Arctic speech data samples, the SHR analysis reached a maximum sensitivity of about 87% at a specificity of over 90%, but only for adaptive algorithm parameter settings. In contrast, the algorithm’s default parameter settings could only successfully classify about 9% of all subharmonic instances. The SHR is a useful metric for assessing the degree of subharmonics contained in voice signals, but only at adaptive parameter settings. In particular, the frequency ceiling should be set to five times the highest fo, and the frame length to at least five times the largest fundamental period of the analyzed signal. For subharmonic classification a threshold of SHR  ≥  0.01 is recommended.  相似文献   

15.
A new set of parameters is described for analysis and synthesis of glottal area, vocal fold contact area, and glottal volume flow. Parameters are all nondimensionalized and consist of an abduction quotient, a shape quotient, a phase quotient, and a load quotient in addition to fundamental frequency and vibrational amplitude. The parameters show promise in interpretation of electroglottographic, photoglottographic, and inverse filtered volume velocity waveforms in terms of the glottal configuration. Some comparisons between modeled and measured glottographic waveforms are made.  相似文献   

16.
This study compared acoustic and electroglottographic (EGG) jitter from [a] vowels of 103 dysphonic speakers. The EGG recordings were chosen according to their intensity, signal-to-noise ratio, and percentage of unvoiced intervals, while acoustic signals were selected based on voicing detection and the reliability of jitter extraction. The agreement between jitter measures was expressed numerically as a normalized difference. In 63.1% (65/103) of the cases the differences fell within +/-22.5%. Positive differences above +22.5% were associated with increased acoustic jitter and occurred in 12.6% (13/103) of the speakers. These were, typically, cases of small nodular lesions without problems in the posterior larynx. On the other hand, substantial rises in EGG jitter leading to differences below -22.5% took place in 24.3% (25/103) of the speakers and were related to hyperfunctional voices, creaky-like voices, small laryngeal asymmetries affecting the arytenoids, or small-to-moderate glottal chinks. A clinically relevant outcome of the study was the possibility of detecting gentle laryngeal asymmetries among cases of large unilateral increase in EGG jitter. These asymmetries can be linked with vocal problems that are often overlooked in endoscopic examinations.  相似文献   

17.
While vocal fold adduction is an important parameter in speech, relatively little has been known on the adjustment of the vocal fold adduction in singing. This study investigates the possibility of separate adjustments of cartilaginous and membranous vocal fold adduction in singing. Six female and seven male subjects, singers and non-singers, were asked to imitate an instructor in producing four phonation types: "aBducted falsetto" (FaB), "aDducted falsetto" (FaD), "aBducted Chest" (CaB), and "aDducted Chest" (CaD). The phonations were evaluated using videostroboscopy, videokymography (VKG), electroglottography (EGG), and audio recordings. All the subjects showed less posterior (cartilaginous) vocal fold adduction in phonation types FaB and CaB than in FaD and CaD, and less membranous vocal fold adduction (smaller closed quotient) in FaB and FaD than in CaB and CaD. The findings indicate that the exercises enabled the singers to separately manipulate (a) cartilaginous adduction and (b) membranous medialization of the glottis though vocal fold bulging. Membranous adduction (monitored via videokymographic closed quotient) was influenced by both membranous medialization and cartilaginous adduction. Individual control over these types of vocal fold adjustments allows singers to create different vocal timbres.  相似文献   

18.
Measures of vocal function during changes in vocal effort level   总被引:4,自引:0,他引:4  
The purpose of this article is to present the results of a controlled study of the day-to-day variabilities of three acoustic parameters (jitter, shimmer, and normalized noise energy), and two electroglottographic parameters (contact quotient and contact quotient perturbation) for vowels produced at three vocal efforts (low, normal, high). Data were obtained with use of a sophisticated bilinear interpolation pitch detection method. A repeated measures design required subjects to produce the vowels // and /a/ five times a day over 3 days at each vocal effort level. The jitter, shimmer, and normalized noise energy values from acoustic measures and contact quotient and contact quotient perturbation values varied significantly among the three vocal effort levels. The clinical implication of this finding is that vocal effort must be controlled in order to obtain consistent clinical measures. Furthermore, day-to-day variability must be taken into account if representative measures are to be obtained for clinical use.  相似文献   

19.
Occurrences of period-doubling are found in human phonation, in particular for pathological and some singing phonations such as Sardinian A Tenore Bassu vocal performance. The combined vibration of the vocal folds and the ventricular folds has been observed during the production of such low pitch bass-type sound. The present study aims to characterize the physiological correlates of this acoustical production and to provide a better understanding of the physical interaction between ventricular fold vibration and vocal fold self-sustained oscillation. The vibratory properties of the vocal folds and the ventricular folds during phonation produced by a professional singer are analyzed by means of acoustical and electroglottographic signals and by synchronized glottal images obtained by high-speed cinematography. The periodic variation in glottal cycle duration and the effect of ventricular fold closing on glottal closing time are demonstrated. Using the detected glottal and ventricular areas, the aerodynamic behavior of the laryngeal system is simulated using a simplified physical modeling previously validated in vitro using a larynx replica. An estimate of the ventricular aperture extracted from the in vivo data allows a theoretical prediction of the glottal aperture. The in vivo measurements of the glottal aperture are then compared to the simulated estimations.  相似文献   

20.
The purpose of this exploratory study was to determine if laryngeal transillumination in combination with stroboscopy (strobophotoglottography; SPGG) is useful for (1) the visualization of vocal fold vibration (VFV) opening patterns, (2) the localization of initial vocal fold opening in horizontal glottal thirds (anterior, midmembranous, and posterior), (3) determination of the temporal correspondence of the so-called electroglottography (EGG)-knee and initial vocal fold separation, and, finally, (4) automatized quantitative measurements of glottal area function within endoscopic images. With stroboscopic transillumination, initial inferior vocal fold separation was detectable during the "closed" phase, where the vocal folds were still closed in the upper portion and therefore initial inferior vocal fold separation could not be visualized with usual laryngoscopy techniques. In the horizontal plane within similar fundamental frequencies in modal voice registers in two male subjects, localization of initial glottal opening depended on the voice types used (soft, normal, or pressed phonation). We found zipperlike posterior-to-anterior openings, initial midmembranous openings, initial anterior openings, as well as simultaneous initial opening of all three portions in the two healthy male adults examined. This technique proved to add temporal and spatial information to vocal fold opening patterns and extends our examination techniques to the very beginning of vocal fold opening at the inferior portion. Simultaneous electroglottogram tracking and comparison with bidirectionally illuminated stroboscopic images revealed a time-locked correspondence of the EGG-knee with the aforementioned initial inferior vocal fold separation. Bidirectional illumination combined with digital color extraction techniques allowed for image separation of subglottally and supraglottally illuminated structures. This facilitated vocal fold contour detection and automatized image processing, for example, for determination of glottal area function, and is considered to be a further step to objective automatized quantitative measurements within endoscopic images.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号