期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

1.

Normalized amplitude quotient for parametrization of the glottal flow

Alku P Bäckström T Vilkman E 《The Journal of the Acoustical Society of America》2002,112(2):701-710

Normalized amplitude quotient (NAQ) is presented as a method to parametrize the glottal closing phase using two amplitude-domain measurements from waveforms estimated by inverse filtering. In this technique, the ratio between the amplitude of the ac flow and the negative peak amplitude of the flow derivative is first computed using the concept of equivalent rectangular pulse, a hypothetical signal located at the instant of the main excitation of the vocal tract. This ratio is then normalized with respect to the length of the fundamental period. Comparison between NAQ and its counterpart among the conventional time-domain parameters, the closing quotient, shows that the proposed parameter is more robust against distortion such as measurement noise that make the extraction of conventional time-based parameters of the glottal flow problematic. Experiments with breathy, normal, and pressed vowels indicate that NAQ is also able to separate the type of phonation effectively. 相似文献

2.

Characteristics of vocal fold adduction related to voice onset

Ayanna Cooke Christy L. Ludlow Nicholas Hallett W. Scott Selbie 《Journal of voice》1997,11(1):12-22

This study examined whether vocal fold kinematics prior to phonation differed between hard (glottal), normal, or breathy onsets in men and women. Glottal landmarks were identified and digitized from videotape recorded with a rigid laryngoscope during different voice onset types. Significant linear relationships (p 0.0055) were found among onset types on measures of (a) gesture duration when moving from 80% to 20% of maximum distance during adduction, (b) maximum velocity, (c) duration between the completion of adduction and phonation onset, and (d) ratios of maximum velocity to maximum distance between the vocal processes, an estimate of stiffness. The gesture duration was greatest for breathy onsets and least for hard onsets, while the maximum velocity, latency between adduction and phonation onset, and estimated stiffness were greatest for hard onsets and least for breathy onsets. The results suggest that one trajectory seems to be used with increases in gesture duration being accompanied by decreases in articulator stiffness when moving from hard to normal to breathy voice onset types. 相似文献

3.

A comparison of type I thyroplasty and arytenoid adduction 总被引：1，自引：0，他引：1

Steven Bielamowicz Gerald S. Berke Bruce R. Gerratt 《Journal of voice》1995,9(4):466-472

Glottal incompetence is a common laryngeal disorder causing impaired swallowing and phonation. The resultant voice has been characterized as weak and breathy with a restricted pitch range. Currently, medialization thyroplasty and arytenoid adduction are two of the surgical treatments for patients with glottal incompetence. However, few studies have evaluated the changes in objective measures of speech with type I thyroplasty and arytenoid adduction. In this study, 59 patients with glottal incompetence underwent either type I thyroplasty or arytenoid adduction. Acoustic (jitter, shimmer, and harmonics-to-noise ratio) and aerodynamic (airflow, subglottic pressure, and glottal resistance) measures were obtained both pre- and postoperatively. No significant differences were found among acoustic or aerodynamic measures for operation type. However, a significant pre/postsurgery effect was observed for translaryngeal airflow. In addition, no significant differences were found among the measures for patients with traditional compared with nontraditional operative indications. Patients who developed glottal insufficiency due to previous laryngeal surgery (e.g., vocal fold stripping) demonstrated no statistically significant improvement in acoustic or aerodynamic measures following thyroplasty or arytenoid adduction. 相似文献

4.

Comparison of Two Inverse Filtering Methods in Parameterization of the Glottal Closing Phase Characteristics in Different Phonation Types 总被引：1，自引：0，他引：1

Laura Lehto Matti Airas Eva Bjrkner Johan Sundberg Paavo Alku 《Journal of voice》2007,21(2):138-150

SUMMARY: Inverse filtering (IF) is a common method used to estimate the source of voiced speech, the glottal flow. This investigation aims to compare two IF methods: one manual and the other semiautomatic. Glottal flows were estimated from speech pressure waveforms of six female and seven male subjects producing sustained vole /a/ in breathy, normal, and pressed phonation. The closing phase characteristics of the glottal pulse were parameterized using two time-based parameters: the closing quotient (C1Q) and the normalized amplitude quotient (NAQ). The information given by these two parameters indicates a strong correlation between the two IF methods. The results are encouraging in showing that the parameterization of the voice source in different speech sounds can be performed independently of the technique used for inverse filtering. 相似文献

5.

Glottal flow through a two-mass model: comparison of Navier-Stokes solutions with simplified models

de Vries MP Schutte HK Veldman AE Verkerke GJ 《The Journal of the Acoustical Society of America》2002,111(4):1847-1853

A new numerical model of the vocal folds is presented based on the well-known two-mass models of the vocal folds. The two-mass model is coupled to a model of glottal airflow based on the incompressible Navier-Stokes equations. Glottal waves are produced using different initial glottal gaps and different subglottal pressures. Fundamental frequency, glottal peak flow, and closed phase of the glottal waves have been compared with values known from the literature. The phonation threshold pressure was determined for different initial glottal gaps. The phonation threshold pressure obtained using the flow model with Navier-Stokes equations corresponds better to values determined in normal phonation than the phonation threshold pressure obtained using the flow model based on the Bernoulli equation. Using the Navier-Stokes equations, an increase of the subglottal pressure causes the fundamental frequency and the glottal peak flow to increase, whereas the fundamental frequency in the Bernoulli-based model does not change with increasing pressure. 相似文献

6.

Relationship between aerodynamic measures of glottal efficiency and stroboscopic findings in asymptomatic singing students

Donna S. Lundy Soham Roy Roy R. Casiano Joseph Evans Paula A. Sullivan Jun W. Xue 《Journal of voice》2000,14(2):178-183

Singing requires exquisite coordination between the respiratory and phonatory systems to efficiently control glottal airflow. Asymptomatic singing students underwent pulmonary function testing (PFT), videostrobolaryngoscopic examination, and measures of glottal efficiency (maximum phonation time [MPT], glottal flow rate [GFR], and phonation quotient [PQ]) performed in both a sung and spoken tone. Pulmonary function and glottal efficiency values were within reported normative data for professional singers. However, sung tones were made with significantly higher GFR and PQ and lower PQ than spoken tones. The mean GFR was not related to the degree of glottal closure (by videostrobolaryngoscopy) or underlying pulmonary support. 相似文献

7.

Glottal closure, transglottal airflow, and voice quality in healthy middle-aged women

Maria Sdersten Stellan Hertega rd Britta Hammarberg 《Journal of voice》1995,9(2)

Seventeen healthy women, 45 to 61 years old, were examined using videofiberstroboscopy during phonation at three loudness levels. Two phoniatricians evaluated glottal closure using category and ratio scales. Transglottal airflow was studied by inverse filtering of the oral airflow signal recorded in a flow mask (Glottal Enterprises System) during the spoken phrase /ba:pa:pa:pa:p/ at three loudness levels. Subglottal pressure was estimated from the intraoral pressure during p occlusion. Running speech and the repeated /pa:/ syllables were perceptually evaluated by three speech pathologists regarding breathiness, hypo-, and hyperfunction, using continuous scales. Incomplete glottal closure was found in 35 of 46 phonations (76%). The degree of glottal closure increased significantly with raised loudness. Half of the women closed the glottis completely during loud phonation. Posterior glottal chink (PGC) was the most common gap configuration and was found in 28 of 46 phonations (61%). One third of the PGCs were in the cartilaginous glottis (PGCc) only. Two thirds extended into the membranous portion (PGCm); most of these occurred during soft phonation. Peak flow, peak-to-peak (AC) flow, and the maximum rate of change for the flow in the closing phase increased significantly with raised loudness. Minimum flow decreased significantly from normal to loud voice. Breathiness decreased with increased loudness. The results suggest that the incomplete closure patterns PGCc and PGCm during soft phonation ought primarily to be regarded as normal for Swedish women in this age group. 相似文献

8.

The abduction quotient related to vocal quality

Ronald C. Scherer Ingo R. Titze 《Journal of voice》1987,1(3)

The abduction quotient, a measure of effective glottal width, was obtained for electroglottographic recordings from a professional operatic baritone singer. The subject produced repeated tokens of the voice qualities breathy, normal, and pressed (or constricted) in both a speech and a singing manner. In the singing manner, the subject produced the three vocal qualities at three pitch levels and three loudness levels. The abduction quotient decreased from breathy to pressed voice, suggesting that the measure corresponds to effective glottal width. The measure was found to be consistently low during all conditions of singing, suggesting that the subject produced all singing tokens with relatively strong laryngeal adduction at the vocal process level. Although the results of this study support the validity and usefulness of the abduction quotient, further verification is needed. 相似文献

9.

Electroglottographic tracking of phonatory response to botox™

Kimberly V. Fisher Ronald C. Scherer Paul R. Swank Cheryl Giddens Dava Patten 《Journal of voice》1999,13(2):203

Botox injection into the thyroarytenoid muscle is thought to alter the glottal competence and laryngeal adduction of patients with adductor spasmodic dysphonia (ADSD). Hypofunctional responses to treatment have been rated subjectively and inferred from postinjection breathy voice, aphonia, midline glottal gap, or subclinical aspiration. Clinical experience suggests that temporary hypofunction varies in duration and severity among patients. This study used electroglottographic measures to examine changes over time in glottal competence during the relatively stable phonation produced by 5 patients with ADSD. Hierarchical linear modeling was used to test 3 hypotheses: (a) that reduced laryngeal adduction would occur during the first 3 weeks postinjection, followed by a reversal; (b) that patients' hypofunctional response curves would differ one from another; and (c) that changes in adduction, if present, would be related to changes in severity ratings of ADSD symptoms. For 3 participants, significant hypoadduction occurred after injection and reversed toward preinjection level over an 8-week period. Two participants demonstrated a flat or increasing vocal fold contact response curve during the early postinjection period. Observations were consistent with the previously reported differences and possibly complex relation between the resolution of breathy hypofunction and ultimate return of ADSD symptoms. 相似文献

10.

In Vivo Quantification of the Intraglottal Pressure: Modal Phonation and Voice Onset

《Journal of voice》2020,34(4):645.e19-645.e39

Intraglottal pressure is the driving force of vocal fold vibration. Its time course during the open phase of the vibratory cycle is essential in the mechanics of phonation, but measuring it directly is difficult and may hinder spontaneous voicing. However, it can be computed from the in vivo measured transglottal flow and glottal area (hence the air particle velocity) on the basis of the Bernoulli energy law and the interaction with the inertance of the vocal tract. As to sustained modal phonation, calculations are presented for the two possible shapes of glottal duct: convergent and divergent, including absolute calibration in order to obtain quantitative physical values. Whatever the glottal duct configuration, the calculations based on measured values of glottal area and air flow show that the integrated intraglottal pressure during the opening phase systematically exceeds that during the closing phase, which is the basic condition for sustaining vocal fold oscillation. The key point is that the airflow curve is skewed to the right relative to the glottal area curve. The skewing results from air compressibility and vocal tract inertance. The intraglottal pressure becomes negative during the closing phase. As to the soft (or physiological) voice onset, a similar approach shows that the integrated pressure differences (opening phase − closing phase) actually increase as the onset progresses, and this applies to the results based on Bernoulli's energy law as well as to those based on the interaction with the inertance of the vocal tract. Furthermore and similarly, the phase lead of the pressure wave with respect to the glottal opening progressively increases. The underlying explanation lies in the progressively increasing skewing of the airflow curve to the right with respect to the glottal area curve. 相似文献

11.

Electroglottographic tracking of phonatory response to Botox.

K V Fisher R C Scherer P R Swank C Giddens D Patten 《Journal of voice》1999,13(2):203-218

Botox injection into the thyroarytenoid muscle is thought to alter the glottal competence and laryngeal adduction of patients with adductor spasmodic dysphonia (ADSD). Hypofunctional responses to treatment have been rated subjectively and inferred from postinjection breathy voice, aphonia, midline glottal gap, or subclinical aspiration. Clinical experience suggests that temporary hypofunction varies in duration and severity among patients. This study used electroglottographic measures to examine changes over time in glottal competence during the relatively stable phonation produced by 5 patients with ADSD. Hierarchical linear modeling was used to test 3 hypotheses: (a) that reduced laryngeal adduction would occur during the first 3 weeks postinjection, followed by a reversal; (b) that patients' hypofunctional response curves would differ one from another; and (c) that changes in adduction, if present, would be related to changes in severity ratings of ADSD symptoms. For 3 participants, significant hypoadduction occurred after injection and reversed toward preinjection level over an 8-week period. Two participants demonstrated a flat or increasing vocal fold contact response curve during the early postinjection period. Observations were consistent with the previously reported differences and possibly complex relation between the resolution of breathy hypofunction and ultimate return of ADSD symptoms. 相似文献

12.

Relationship among glottal area, static supraglottic compression, and laryngeal function studies in unilateral vocal fold paresis and paralysis 总被引：1，自引：0，他引：1

Steven Bielamowicz Ritu Kapoor Jerome Schwartz Sheila V. Stager 《Journal of voice》2004,18(1):138-145

In this study, we evaluated the relationship between laryngeal function measures and glottal gap ratio and normalized measures of supraglottic behaviors in patients with unilateral vocal fold paresis (UVFP). Thirty-one patients were found to have unilateral vocal fold paresis by videoendoscopy and laryngeal electromyography, and 13 controls participated in this study. Patients with UVFP demonstrated significantly larger glottal gap ratios (p = 0.016) than control subjects. The nonparalyzed or contralateral vocal fold was associated with significantly more static false vocal fold compression (p = 0.03) compared with the paralyzed vocal fold or with the controls. Patients with unilateral vocal fold paresis were divided into subgroups: those with normal or abnormal maximum phonation time, flow, or pressure measures. Smaller glottal gap ratios were identified in patients with normal maximum phonation times and flow measures. Greater false vocal fold activity was identified in unilateral vocal fold paresis patients with normal laryngeal function measures than in unilateral vocal fold paresis patients with abnormal measures. These findings suggest that some patients with documented unilateral paresis and glottal incompetence can compensate for vocal fold weakness such that their acoustic and aerodynamic measures are normal. 相似文献

13.

Simultaneous analysis of vocal fold vibration and transglottal airflow: exploring a new experimental setup

Svante Granqvist Stellan Hertegrd Hans Larsson Johan Sundberg 《Journal of voice》2003,17(3):319-330

The purpose of this study was to develop an analysis system for studying the relationship between vocal fold vibration and the associated transglottal airflow. Recordings of airflow, electroglottography (EGG), oral air pressure, and acoustic signals were performed simultaneously with high-speed imaging at a rate of approximately 1900 frames/s. Inverse filtered airflow is compared with the simultaneous glottal area extracted from the high-speed image sequence. The accuracy of the synchronization between the camera images and the foot pedal synchronization pulse was examined, showing that potential synchronization errors increase with time distance to the synchronization pulse. Therefore, analysis was limited to material near the synchronization pulse. Results corroborate previous predictions that air flow lags behind area, but also they reveal that relationships between these two entities may be complex and apparently varying with phonation mode. 相似文献

14.

The effects of rehydration on phonation in excised canine larynges

Jack Jiang Jennie Ng David Hanson 《Journal of voice》1999,13(1):51-59

Experiments using excised canine larynges were conducted to study the restoration of vocal efficiency in dehydrated larynges. Excised larynges were dehydrated with warm, dry air to the point that airflow through the approximated vocal folds would not entrain the folds to produce phonation. The dehydrated vocal folds were then bathed in a saline solution. The rehydrated larynges were then remounted on the bench apparatus that enabled phonation with a constant humidified airflow, and measurements were made of phonation threshold pressure, glottal airflow, and amplitude. Hydration resulted in significantly increased efficiency and decrease in phonation threshold pressure. The findings confirm clinical impressions that hydration is critical in the physiology of normal phonation. 相似文献

15.

Estimating perceived phonatory pressedness in singing from flow glottograms

Johan Sundberg Margareta Thaln Paavo Alku Erkki Vilkman 《Journal of voice》2004,18(1):56-62

The normalized amplitude quotient (NAQ), defined as the ratio between the peak-to-peak amplitude of the flow pulse and the negative peak amplitude of the differentiated flow glottogram and normalized with respect to period time, has been shown to be related to glottal adduction. Glottal adduction, in turn, affects mode of phonation and hence perceived phonatory pressedness. The relationship between NAQ and perceived phonatory pressedness was analyzed in a material collected from a professional female singer and singing teacher who sang a triad pattern in breathy, flow, neutral, and pressed phonation in three different loudness conditions (soft, middle, loud). In addition, she also sang the same triad pattern in four different styles of singing, classical, pop, jazz, and blues, in the same three loudness conditions. A panel of experts rated the degree of perceived phonatory press along visual analogue scales. Comparing the obtained mean rated pressedness ratings with the mean NAQ values for the various triads showed that about 73% of the variation in perceived pressedness could be accounted for by variations of NAQ. 相似文献

16.

Glottal configuration associated withfundamental frequency and vocal register

Thomas Murry Jie Jie Xu Gayle E. Woodson 《Journal of voice》1998,12(1):44-49

Simultaneous measurements of mean airflow rate, vocal intensityand fundamental frequency were made during flexible video endoscopic recording of the vowel /i/ sustained in two vocal registers, modal and falsetto. The glottal closure patterns of four males and four females were evaluated by visually inspecting the video images. Acoustic signals were recorded and analyzed to verify the frequency and intensity criteria. Aerodynamic analysis of mean airflow rate was done via Rothenberg mask and commercial software. Incomplete glottic closure was common in both males and females. The degree of closure was significantly higher for modal samples than for falsetto samples with frequency and intensity held constant. The shape of the glottal closure did not vary with changes in the mode of phonation. As expected, the mean airflow rate increased with decreased glottal closure. The results suggest that incomplete glottic closure should be considered as a normal glottal configuration in high frequency modal and falsetto phonation. Moreover, hourglass and spindle glottal configurations may also be found in both the modal and falsetto registers of normal subjects. These results also confirm the positive relationships between degree of glottal gap and mean airflow rate. Thus, mean airflow rate may be regarded as a criterion for judging degree of glottal closure. 相似文献

17.

Vocal intensity in speakers and singers.

I R Titze J Sundberg 《The Journal of the Acoustical Society of America》1992,91(5):2936-2946

Vocal intensity is studied as a function of fundamental frequency and lung pressure. A combination of analytical and empirical models is used to predict sound pressure levels from glottal waveforms of five professional tenors and twenty five normal control subjects. The glottal waveforms were obtained by inverse filtering the mouth flow. Empirical models describe features of the glottal flow waveform (peak flow, peak flow derivative, open quotient, and speed quotient) in terms of lung pressure and phonation threshold pressure, a key variable that incorporates the Fo dependence of many of the features of the glottal flow. The analytical model describes the contributions to sound pressure levels SPL by the vocal tract. Results show that SPL increases with Fo at a rate of 8-9 dB/octave provided that lung pressure is raised proportional to phonation threshold pressure. The SPL also increases at a rate of 8-9 dB per doubling of excess pressure over threshold, a new quantity that assumes considerable importance in vocal intensity calculations. For the same excess pressure over threshold, the professional tenors produced 10-12 dB greater intensity than the male nonsingers, primarily because their peak airflow was much higher for the same pressure. A simple set of rules is devised for predicting SPL from source waveforms. 相似文献

18.

Voice Characteristics of Young Girl Role in Kunqu Opera

《Journal of voice》2019,33(6):945.e19-945.e25

Three electroglottographic parameters, fundamental frequency, contact quotient, and speed quotient were analyzed for two singers of Young girl role in Kunqu Opera. Each singer performed three conditions, singing, stage speech, and reading lyrics. The phonation types adopted in different conditions were explored based on electroglottographic parameters. Fundamental frequency, contact quotient, and speed quotient showed different distributions among conditions. Five phonation types were used in singing and stage speech, which include (1) breathy voice, (2) modal voice with low degree of posterior glottal adduction, (3) modal voice, (4) falsetto, and (5) falsetto with high degree of posterior glottal adduction. The phonation strategies partly showed differences between singers. Different phonation type collocations were employed in singing and stage speech. The relationship between phonation types and pitch was complex. The phonation types actually used were different from and more complex than those in traditional Kunqu Opera singing theory. 相似文献

19.

Accuracy and variability of acoustic measures of voicing onset

Francis AL Ciocca V Yu JM 《The Journal of the Acoustical Society of America》2003,113(2):1025-1032

Five commonly used methods for determining the onset of voicing of syllable-initial stop consonants were compared. The speech and glottal activity of 16 native speakers of Cantonese with normal voice quality were investigated during the production of consonant vowel (CV) syllables in Cantonese. Syllables consisted of the initial consonants /ph/, /th/, /kh/, /p/, /t/, and /k/ followed by the vowel /a/. All syllables had a high level tone, and were all real words in Cantonese. Measurements of voicing onset were made based on the onset of periodicity in the acoustic waveform, and on spectrographic measures of the onset of a voicing bar (f0), the onset of the first formant (F1), second formant (F2), and third formant (F3). These measurements were then compared against the onset of glottal opening as determined by electroglottography. Both accuracy and variability of each measure were calculated. Results suggest that the presence of aspiration in a syllable decreased the accuracy and increased the variability of spectrogram-based measurements, but did not strongly affect measurements made from the acoustic waveform. Overall, the acoustic waveform provided the most accurate estimate of voicing onset; measurements made from the amplitude waveform were also the least variable of the five measures. These results can be explained as a consequence of differences in spectral tilt of the voicing source in breathy versus modal phonation. 相似文献

20.

Glottal characteristics of male speakers: acoustic correlates and comparison with female data.

H M Hanson E S Chuang 《The Journal of the Acoustical Society of America》1999,106(2):1064-1077

Acoustic measurements believed to reflect glottal characteristics were made on recordings collected from 21 male speakers. The waveforms and spectra of three nonhigh vowels (/ae, lambda, epsilon/) were analyzed to obtain acoustic parameters related to first-formant bandwidth, open quotient, spectral tilt, and aspiration noise. Comparisons were made with previous results obtained for 22 female speakers [H. M. Hanson, J. Acoust. Soc. Am. 101, 466-481 (1997)]. While there is considerable overlap across gender, the male data show lower average values and less interspeaker variation for all measures. In particular, the amplitude of the first harmonic relative to that of the third formant is 9.6 dB lower for the male speakers than for the female speakers, suggesting that spectral tilt is an especially significant parameter for differentiating male and female speech. These findings are consistent with fiberscopic studies which have shown that males tend to have a more complete glottal closure, leading to less energy loss at the glottis and less spectral tilt. Observations of the speech waveforms and spectra suggest the presence of a second glottal excitation within a glottal period for some of the male speakers. Possible causes and acoustic consequences of these second excitations are discussed. 相似文献