首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 46 毫秒
1.
The relationship of lung pressure, fundamental frequency, peak airflow, open quotient, and maximal flow declination rate to vocal intensity for a normal speaking, young male control group and an elderly male group was investigated. The control group consisted of 17 healthy male subjects with a mean age of 30 years and the elderly group consisted of 11 healthy male subjects with a mean age of 77 years. Data were collected at three levels of vocal intensity: soft, comfortable, and loud, corresponding to 25%, 50%, and 75% of dynamic range, respectively. Phonational threshold pressure and lung pressure were obtained using the intraoral technique. The oral airflow waveform was inverse filtered to provide an approximation to the glottal airflow waveform from which measures of fundamental frequency, peak airflow, open quotient, and maximal flow declination rate were determined. Excess lung pressure was calculated as lung pressure minus estimated phonational threshold pressure. The results show for both groups an increase in sound pressure level across the conditions, with corresponding increases in lung pressure, excess lung pressure, fundamental frequency, peak airflow, and maximal flow declination rate. Open quotient decreased with increasing vocal intensity. Lung pressure, sound pressure level, and peak airflow were all found to be significantly greater for the control group than for the elderly group at each condition. Open quotient was found to be significantly lower in the control group than in the elderly group at each condition. No significant difference was observed for excess lung pressure, phonational threshold pressure, fundamental frequency, or maximal flow declination rate between the two groups. These results show that a difference in vocal intensity does exist between young and elderly voices and that this difference is the result of differences in lung pressure, peak airflow, and open quotient.  相似文献   

2.
Measurements on the inverse filtered airflow waveform (the "glottal waveform") and of estimated average transglottal pressure and glottal airflow were made from noninvasive recordings of productions of syllable sequences in soft, normal, and loud voice for 25 male and 20 female speakers. Statistical analyses showed that with change from normal to loud voice, both males and females produced loud voice with increased pressure, accompanied by increased ac flow and increased maximum airflow declination rate. With change from normal voice, soft voice was produced with decreased pressure, ac flow and maximum airflow declination rate, and increased dc and average flow. Within the loudness conditions, there was no significant male-female difference in air pressure. Several glottal waveform parameters separated males and females in normal and loud voice. The data indicate higher ac flow and higher maximum airflow declination rate for males. In soft voice, the male and female glottal waveforms were more alike, and there was no significant difference in maximum airflow declination rate. The dc flow did not differ significantly between males and females. Possible relevance to biomechanical differences and differences in voice source characteristics between males and females and across loudness conditions is discussed.  相似文献   

3.
Acoustic measurements believed to reflect glottal characteristics were made on recordings collected from 21 male speakers. The waveforms and spectra of three nonhigh vowels (/ae, lambda, epsilon/) were analyzed to obtain acoustic parameters related to first-formant bandwidth, open quotient, spectral tilt, and aspiration noise. Comparisons were made with previous results obtained for 22 female speakers [H. M. Hanson, J. Acoust. Soc. Am. 101, 466-481 (1997)]. While there is considerable overlap across gender, the male data show lower average values and less interspeaker variation for all measures. In particular, the amplitude of the first harmonic relative to that of the third formant is 9.6 dB lower for the male speakers than for the female speakers, suggesting that spectral tilt is an especially significant parameter for differentiating male and female speech. These findings are consistent with fiberscopic studies which have shown that males tend to have a more complete glottal closure, leading to less energy loss at the glottis and less spectral tilt. Observations of the speech waveforms and spectra suggest the presence of a second glottal excitation within a glottal period for some of the male speakers. Possible causes and acoustic consequences of these second excitations are discussed.  相似文献   

4.
Normative measures of open quotient, speed quotient, maximum flow declination rate (MFDR), and subglottal pressure were determined for 75 children between the ages of 6 years 0 months and 10 years 11 months. The participants produced a sustained /a/ at low, comfort, and high pitches for a minimum of 5 seconds, and five to seven repetitions of /pa/ at low, comfort, and high pitches. No statistically significant differences were found in the mean measures of any aerodynamic variables (open quotient, speed quotient, maximum flow declination rate, subglottal pressure) between the frequency levels (low, comfort, high pitches). Also, no strong evidence (P > .05) exists that age or sex effect differed between the frequency levels (low, comfort, high) for any of the aerodynamic measures. For /a/ response tasks, mean open quotient measures increased slightly from low to comfort frequency and from comfort to high frequency. Mean speed quotient measures showed minimal differences between low and comfort frequency, with decreased mean measures for high frequency. Mean MFDR measures increased from low to comfort frequency and from comfort to high frequency. Mean subglottal pressure measures increased slightly from low to comfort frequency and from comfort to high frequency.  相似文献   

5.
A stratified random sample of 20 males and 20 females matched for physiologic factors and cultural-linguistic markers was examined to determine differences in formant frequencies during prolongation of three vowels: [a], [i], and [u]. The ethnic and gender breakdown included four sets of 5 male and 5 female subjects comprised of Caucasian and African American speakers of Standard American English, native Hindi Indian speakers, and native Mandarin Chinese speakers. Acoustic measures were analyzed using the Computerized Speech Lab (4300B) from which formant histories were extracted from a 200-ms sample of each vowel token to obtain first formant (F1), second formant (F2), and third formant (F3) frequencies. Significant group differences for the main effect of culture and race were found. For the main effect gender, sexual dimorphism in vowel formants was evidenced for all cultures and races across all three vowels. The acoustic differences found are attributed to cultural-linguistic factors.  相似文献   

6.
A stratified random sample of 20 males and 20 females matched for physiological factors and cultural-linguistic markers were examined to determine differences in fundamental frequency and spectral characteristics during prolongation of three vowels: [a], [i], and [u]. The ethnic-gender breakdown included four sets of five male and five female subjects comprised of Caucasian and African-American speakers of standard American English, native Hindi Indian speakers, and native Mandarin Chinese speakers. Acoustic measures were analyzed using the Multidimensional Voice Program (Kay Elemetrics, Lincoln Park, NJ) (Model 4305) from which fundamental frequency and associated acoustic spectra were extracted from a 200-ms sample of each vowel token. Statistically significant group differences for the main effects of culture, race, and gender were found. The acoustic differences found are attributed to biomechanical, physiological, cultural, and linguistic factors.  相似文献   

7.
Noninvasive measures of vocal fold activity are useful for describingnormal and disordered voice production. Measures of open and speed quotient from glottal airflow and electroglottographic (EGG) waveforms have been used to describe timing events associated with vocal fold vibration. To date, there has been little consistency in the measurement criteria used to calculate quotient values. In this study, criteria of 20% and 50% were applied to the AC amplitude of glottal airflow and inverted EGG waveforms for measurement of open quotient. Criteria of 20%, 50%, and 80%, and a midslope criterion that segmented the waveform between 20% and 80% of the waveform amplitude, were used for the calculation of speed quotient. Subjects produced waveforms at sound pressure levels (SPL) of 70, 75, 80 and 85 dB. Results indicated that approximations of open quotient obtained from the glottal airflow waveform significantly decreased using both the 20% and 50% criteria as SPL increased from 80 to 85 dB. No significant changes were found in open quotient from the EGG waveform as a function of SPL. Results of speed quotient measures from the glottal airflow and EGG waveforms showed a generally increasing trend as SPL increased, although the differences were not statistically significant. The data suggest that the signal type, measurement criterion and SPL must be considered in interpreting quotient measures.  相似文献   

8.
Changes in magnitude and variability of duration, fundamental frequency, formant frequencies, and spectral envelope of children's speech are investigated as a function of age and gender using data obtained from 436 children, ages 5 to 17 years, and 56 adults. The results confirm that the reduction in magnitude and within-subject variability of both temporal and spectral acoustic parameters with age is a major trend associated with speech development in normal children. Between ages 9 and 12, both magnitude and variability of segmental durations decrease significantly and rapidly, converging to adult levels around age 12. Within-subject fundamental frequency and formant-frequency variability, however, may reach adult range about 2 or 3 years later. Differentiation of male and female fundamental frequency and formant frequency patterns begins at around age 11, becoming fully established around age 15. During that time period, changes in vowel formant frequencies of male speakers is approximately linear with age, while such a linear trend is less obvious for female speakers. These results support the hypothesis of uniform axial growth of the vocal tract for male speakers. The study also shows evidence for an apparent overshoot in acoustic parameter values, somewhere between ages 13 and 15, before converging to the canonical levels for adults. For instance, teenagers around age 14 differ from adults in that, on average, they show shorter segmental durations and exhibit less within-subject variability in durations, fundamental frequency, and spectral envelope measures.  相似文献   

9.
SUMMARY: Acoustic pharyngometry evaluates the geometry of the vocal tract with acoustic reflections and provides information about vocal tract cross-sectional area and volume from lip to the glottis. Variations in vocal tract diameters are needed for speech scientists to validate various acoustic models and for medical professionals since the advent of endoscopic surgical techniques. Race is known to be one of the most important factors affecting the oral and nasal structures. This study compared vocal tract dimensions of White American, African American, and Chinese male and female speakers. One hundred and twenty healthy adult subjects with equal numbers of men and women were divided among three races. Subjects were controlled for age, gender, height, and weight. Six dimensional parameters of the speakers' vocal tract cavities were measured with acoustic reflection technology (AR). Significant gender and race main effects were found in certain vocal tract dimensions. The findings of this study now provide speech scientists, speech-language pathologists, and other health professionals with a new anatomical database of vocal tract variations for adult speakers from three different races.  相似文献   

10.
A comparison of type I thyroplasty and arytenoid adduction   总被引:1,自引:0,他引:1  
Glottal incompetence is a common laryngeal disorder causing impaired swallowing and phonation. The resultant voice has been characterized as weak and breathy with a restricted pitch range. Currently, medialization thyroplasty and arytenoid adduction are two of the surgical treatments for patients with glottal incompetence. However, few studies have evaluated the changes in objective measures of speech with type I thyroplasty and arytenoid adduction. In this study, 59 patients with glottal incompetence underwent either type I thyroplasty or arytenoid adduction. Acoustic (jitter, shimmer, and harmonics-to-noise ratio) and aerodynamic (airflow, subglottic pressure, and glottal resistance) measures were obtained both pre- and postoperatively. No significant differences were found among acoustic or aerodynamic measures for operation type. However, a significant pre/postsurgery effect was observed for translaryngeal airflow. In addition, no significant differences were found among the measures for patients with traditional compared with nontraditional operative indications. Patients who developed glottal insufficiency due to previous laryngeal surgery (e.g., vocal fold stripping) demonstrated no statistically significant improvement in acoustic or aerodynamic measures following thyroplasty or arytenoid adduction.  相似文献   

11.
Vocal perturbation, harmonics-to-noise, and intensity measures were obtained for 10 subjects during three experimental tasks: (a) prolonged /a/, (b) /pa/ with vowel prolonged, and (c) same as (b) with subjects wearing a pneumotachographic mask and oral pressure tube inserted between the lips. There were no statistically significant differences among the experimental conditions for any of the measures. The findings suggest that a single task may be used to obtain airflow, oral pressure, and acoustic measures of vocal performance. Observed differences in jitter and harmonic-to-noise means for the male and female speakers are discussed.  相似文献   

12.
The purpose of this retrospective study is to describe results of acoustic, aerodynamic, and videostroboscopic measures in patients complaining of laryngeal fatigue. Data were collected from 88 patients whose primary complaint was chronic laryngeal fatigue in the absence of visible laryngeal pathologies. The results revealed an abnormally high airflow rate and decreased maximum phonation time. An anterior glottal chink, anterior and posterior glottal chinks, or spindle-shaped glottal closure were found in 61% of the subjects.  相似文献   

13.
This study was primarily motivated by the need to establish the correspondence between auditory abilities and laryngeal function. Just noticeable differences (JNDs) were obtained for the open quotient and speed quotient of the glottal flow waveform. The quotients were synthesized for both the glottal flow alone, and for the output pressure signal after the glottal flow signal was applied to the synthesis vocal tract for the vowel /a/. Six adult men and five adult women, all teachers of singing, participated as listeners. An adaptive auditory listening procedure was used to estimate JNDs for the four types of stimuli. The group average JND values were as follows. For the standard open quotient value of .6000, JND = 0.0264 (SD = .010) for the glottal flow and JND = 0.0344 (SD = .020) for the output pressure. For the open quotient, there was no statistically significant difference between genders or between the types of signals. For the standard speed quotient value of 2.000, JND = 0.154 (SD = .043) for the glottal flow and JND = 0.319 (SD = .167) for the output pressure. For the speed quotient, there was no statistically significant difference between genders, but the difference between types of stimulus (glottal flow versus output pressure) was significant (p <.006). The variance among the JND values was significantly larger for the output pressure stimuli compared to the glottal flow stimuli for both the open quotient and the speed quotient.  相似文献   

14.
The attainment of a feminine-sounding voice is a highly desirable goal among male-to-female transgender (MFT) persons, but this goal may be difficult for many to accomplish. The characteristics associated with a feminine vocal quality include increases in fundamental frequency and in vocal breathiness. In this study, we used inverse-filtering of the airflow signal to indirectly assess vocal fold function in 13 MFT persons. Each participant was asked to sustain the vowel /a/ first in her biological male voice and then again in her female voice. In addition, these vowel productions were compared with vowels produced by age-matched biologic women and men. The results of the study revealed a significant increase in maximum flow declination rate during female voice production. Perceptual ratings of a feminine voice were associated with a fundamental frequency (F0) of 180 Hz or greater, although F0 did not differ significantly between male and female voice production. These results are discussed relative to the mechanisms that obtained a feminine-sounding voice.  相似文献   

15.
Five professional operatic baritone singers' voice-source characteristics were analyzed by means of inverse filtering of the flow signal as captured by a flow mask. The subjects sang a long sustained diminuendo, from loudest to softest, three times on the vowels [a:] and [ae:] at fundamental frequencies representing 25%, 50%, and 75% of their total pitch range as measured in semitones. During the diminuendos, they repeatedly inserted the consonant [p] so that associated subglottal pressures could be estimated from the oral pressure during the p-occlusions. Pooling the three takes of each condition, ten subglottal pressures, equidistantly spaced between highest and lowest, were selected for analysis. Sound-pressure levels (SPL), peak-to-peak glottal airflow, maximum flow declination rate, closed quotient, glottal dc flow, and the level difference between the two lowest partials of the source spectrum (H1-H2) were determined. All parameters except the glottal dc flow showed a systematic variation with subglottal pressure or the fractional excess pressure over threshold. The results are given in terms of equations representing the average across subjects for the relation between subglottal pressure and each of the mentioned voice-source parameters.  相似文献   

16.
Photoglottographic measures in parkinson''s disease   总被引:1,自引:0,他引:1  
This study examines the usefulness of photoglottographic measures in reflecting the phonatory effect of Parkinson's disease. In the first experiment, data obtained by photoglottography were compared between 15 male patients with Parkinson's disease and 15 normal male speakers of similar age. Six photoglottographic parameters, mean open quotient (OQ), mean speed quotient (SQ), perturbation of open quotient (POQ), perturbation of speed quotient (PSQ), frequency perturbation ratio (FPR), and amplitude perturbation ratio (APR), in sustained vowel phonation were investigated. Increased SQ (t = -2.731, df = 28, P = 0.011) and POQ (t = -2.584, df = 28, P = 0.015) were significantly associated with data from patients in comparison to normal speakers. The FPR, APR, and OQ were not significantly different between normal subjects and patients. A follow-up experiment, including 12 female and 19 male patients with Parkinson's disease, was designed to evaluate the sensitivity of SQ and POQ in detecting vocal dysfunction. The sensitivity of SQ was found to be relatively high (93.5%), while that of POQ was low (45.2%). Methodological issues regarding the effects of gender, age, stage of the disease, and treatment on photoglottographic measures in Parkinson's disease were discussed.  相似文献   

17.
Vocal fold vibratory asymmetry is often associated with inefficient sound production through its impact on source spectral tilt. This association is investigated in both a computational voice production model and a group of 47 human subjects. The model provides indirect control over the degree of left-right phase asymmetry within a nonlinear source-filter framework, and high-speed videoendoscopy provides in vivo measures of vocal fold vibratory asymmetry. Source spectral tilt measures are estimated from the inverse-filtered spectrum of the simulated and recorded radiated acoustic pressure. As expected, model simulations indicate that increasing left-right phase asymmetry induces steeper spectral tilt. Subject data, however, reveal that none of the vibratory asymmetry measures correlates with spectral tilt measures. Probing further into physiological correlates of spectral tilt that might be affected by asymmetry, the glottal area waveform is parameterized to obtain measures of the open phase (open/plateau quotient) and closing phase (speed/closing quotient). Subjects' left-right phase asymmetry exhibits low, but statistically significant, correlations with speed quotient (r=0.45) and closing quotient (r=-0.39). Results call for future studies into the effect of asymmetric vocal fold vibration on glottal airflow and the associated impact on voice source spectral properties and vocal efficiency.  相似文献   

18.
Simultaneous measurements of mean airflow rate, vocal intensityand fundamental frequency were made during flexible video endoscopic recording of the vowel /i/ sustained in two vocal registers, modal and falsetto. The glottal closure patterns of four males and four females were evaluated by visually inspecting the video images. Acoustic signals were recorded and analyzed to verify the frequency and intensity criteria. Aerodynamic analysis of mean airflow rate was done via Rothenberg mask and commercial software. Incomplete glottic closure was common in both males and females. The degree of closure was significantly higher for modal samples than for falsetto samples with frequency and intensity held constant. The shape of the glottal closure did not vary with changes in the mode of phonation. As expected, the mean airflow rate increased with decreased glottal closure. The results suggest that incomplete glottic closure should be considered as a normal glottal configuration in high frequency modal and falsetto phonation. Moreover, hourglass and spindle glottal configurations may also be found in both the modal and falsetto registers of normal subjects. These results also confirm the positive relationships between degree of glottal gap and mean airflow rate. Thus, mean airflow rate may be regarded as a criterion for judging degree of glottal closure.  相似文献   

19.
The effects of age, sex, and vocal tract configuration on the glottal excitation signal in speech are only partially understood, yet understanding these effects is important for both recognition and synthesis of speech as well as for medical purposes. In this paper, three acoustic measures related to the voice source are analyzed for five vowels from 3145 CVC utterances spoken by 335 talkers (8-39 years old) from the CID database [Miller et al., Proceedings of ICASSP, 1996, Vol. 2, pp. 849-852]. The measures are: the fundamental frequency (F0), the difference between the "corrected" (denoted by an asterisk) first two spectral harmonic magnitudes, H1* - H2* (related to the open quotient), and the difference between the "corrected" magnitudes of the first spectral harmonic and that of the third formant peak, H1* - A3* (related to source spectral tilt). The correction refers to compensating for the influence of formant frequencies on spectral magnitude estimation. Experimental results show that the three acoustic measures are dependent to varying degrees on age and vowel. Age dependencies are more prominent for male talkers, while vowel dependencies are more prominent for female talkers suggesting a greater vocal tract-source interaction. All talkers show a dependency of F0 on sex and on F3, and of H1* - A3* on vowel type. For low-pitched talkers (F0 < or = 175 Hz), H1* - H2* is positively correlated with F0 while for high-pitched talkers, H1* - H2* is dependent on F1 or vowel height. For high-pitched talkers there were no significant sex dependencies of H1* - H2* and H1* - A3*. The statistical significance of these results is shown.  相似文献   

20.
Vocal intensity is studied as a function of fundamental frequency and lung pressure. A combination of analytical and empirical models is used to predict sound pressure levels from glottal waveforms of five professional tenors and twenty five normal control subjects. The glottal waveforms were obtained by inverse filtering the mouth flow. Empirical models describe features of the glottal flow waveform (peak flow, peak flow derivative, open quotient, and speed quotient) in terms of lung pressure and phonation threshold pressure, a key variable that incorporates the Fo dependence of many of the features of the glottal flow. The analytical model describes the contributions to sound pressure levels SPL by the vocal tract. Results show that SPL increases with Fo at a rate of 8-9 dB/octave provided that lung pressure is raised proportional to phonation threshold pressure. The SPL also increases at a rate of 8-9 dB per doubling of excess pressure over threshold, a new quantity that assumes considerable importance in vocal intensity calculations. For the same excess pressure over threshold, the professional tenors produced 10-12 dB greater intensity than the male nonsingers, primarily because their peak airflow was much higher for the same pressure. A simple set of rules is devised for predicting SPL from source waveforms.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号