首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 78 毫秒
1.
Noninvasive measures of vocal fold activity are useful for describingnormal and disordered voice production. Measures of open and speed quotient from glottal airflow and electroglottographic (EGG) waveforms have been used to describe timing events associated with vocal fold vibration. To date, there has been little consistency in the measurement criteria used to calculate quotient values. In this study, criteria of 20% and 50% were applied to the AC amplitude of glottal airflow and inverted EGG waveforms for measurement of open quotient. Criteria of 20%, 50%, and 80%, and a midslope criterion that segmented the waveform between 20% and 80% of the waveform amplitude, were used for the calculation of speed quotient. Subjects produced waveforms at sound pressure levels (SPL) of 70, 75, 80 and 85 dB. Results indicated that approximations of open quotient obtained from the glottal airflow waveform significantly decreased using both the 20% and 50% criteria as SPL increased from 80 to 85 dB. No significant changes were found in open quotient from the EGG waveform as a function of SPL. Results of speed quotient measures from the glottal airflow and EGG waveforms showed a generally increasing trend as SPL increased, although the differences were not statistically significant. The data suggest that the signal type, measurement criterion and SPL must be considered in interpreting quotient measures.  相似文献   

2.
The present investigation was designed to examine the effect of change in vocal fold mass and stiffness on vocal fold vibration. To do this, the effect of variation in superior laryngeal nerve stimulation (SLNS) and recurrent laryngeal nerve stimulation (RLNS) was studied Photoglottography (PGG), electroglottography (EGG), and subglottic pressure (Psub) were measured in seven mongrel dogs using an in vivo canine model of phonation. The PGG, EGG, and Psub signals were examined at three frequencies (100, 130, and 160 Hz) for SLNS and RLNS, using a constant rate of air flow. Increasing SLNS, which caused a contraction of the cricothyroid muscle, produced a marked increase in F0, little change in Psub, an increase in open quotient (OQ), and a decrease in the closed quotient (CQ) of the glottal cycle. Increasing RLNS, which caused activation of the intrinsic laryngeal muscles, produced a modest increase in F0, a marked increase in Psub, no change in the OQ, and an increase in CQ. Phase quotient (Qp), which describes the interval between opening of the lower and upper fold margins, decreased with increasing RLNS and did not change significantly with increasing SLNS. Based upon changes in F0, Psub, OQ, CQ, and Qp, SLNS provides a physiologic correlate of the tension parameter Q, and RLNS provides a physiologic correlate of the parameter Psub in the Ishizaka and Flanagan two-mass model.  相似文献   

3.
Interpretation of electroglottography (EGG) as an index of glottal contact area has been complicated by difficulty obtaining independent validation measures. The purpose of this research was to implement a new simultaneous EGG/videostroboscopic technique for the evaluation of the relationship between a discontinuity in the opening phase of the EGG waveform with the onset of glottal opening viewed via videostroboscopy. The results support previous suggestions that this EGG discontinuity, when observed in nonpathologic individuals, usually marks the onset of glottal opening along the superior surface of the vocal folds.  相似文献   

4.
Vocal fold contact behavior was examined in separate groups of boys and girls through application of an electroglottograph(EGG). In general, a contact quotient (EGG duty cycle) showed minimal differences within and between boys and girls during sustained production of the vowels /i/, /u/, and /a/. The findings are discussed with respect to the laryngeal behavior of prepubescent children as well as the clinical utility and applicability of the EGG for examining phonatory behavior among young children.  相似文献   

5.
This study was primarily motivated by the need to establish the correspondence between auditory abilities and laryngeal function. Just noticeable differences (JNDs) were obtained for the open quotient and speed quotient of the glottal flow waveform. The quotients were synthesized for both the glottal flow alone, and for the output pressure signal after the glottal flow signal was applied to the synthesis vocal tract for the vowel /a/. Six adult men and five adult women, all teachers of singing, participated as listeners. An adaptive auditory listening procedure was used to estimate JNDs for the four types of stimuli. The group average JND values were as follows. For the standard open quotient value of .6000, JND = 0.0264 (SD = .010) for the glottal flow and JND = 0.0344 (SD = .020) for the output pressure. For the open quotient, there was no statistically significant difference between genders or between the types of signals. For the standard speed quotient value of 2.000, JND = 0.154 (SD = .043) for the glottal flow and JND = 0.319 (SD = .167) for the output pressure. For the speed quotient, there was no statistically significant difference between genders, but the difference between types of stimulus (glottal flow versus output pressure) was significant (p <.006). The variance among the JND values was significantly larger for the output pressure stimuli compared to the glottal flow stimuli for both the open quotient and the speed quotient.  相似文献   

6.
Five commonly used methods for determining the onset of voicing of syllable-initial stop consonants were compared. The speech and glottal activity of 16 native speakers of Cantonese with normal voice quality were investigated during the production of consonant vowel (CV) syllables in Cantonese. Syllables consisted of the initial consonants /ph/, /th/, /kh/, /p/, /t/, and /k/ followed by the vowel /a/. All syllables had a high level tone, and were all real words in Cantonese. Measurements of voicing onset were made based on the onset of periodicity in the acoustic waveform, and on spectrographic measures of the onset of a voicing bar (f0), the onset of the first formant (F1), second formant (F2), and third formant (F3). These measurements were then compared against the onset of glottal opening as determined by electroglottography. Both accuracy and variability of each measure were calculated. Results suggest that the presence of aspiration in a syllable decreased the accuracy and increased the variability of spectrogram-based measurements, but did not strongly affect measurements made from the acoustic waveform. Overall, the acoustic waveform provided the most accurate estimate of voicing onset; measurements made from the amplitude waveform were also the least variable of the five measures. These results can be explained as a consequence of differences in spectral tilt of the voicing source in breathy versus modal phonation.  相似文献   

7.
Vocal warm-up was studied in terms of changes in voice parameters during a 45-minute vocal loading session in the morning. The voices of a randomly chosen group of 40 female and 40 male young students were loaded by having them read a novel aloud. The exposure groups (5 females and 5 males per cell) consisted of eight combinations of the following factors: (1) low (25 +/- 5%) or high (65 +/- 5%) relative humidity of ambient air; (2) low [< 65 dB(SPL)] or high [> 65 dB(SPL)] speech output level during vocal loading; (3) sitting or standing posture during vocal loading. Two sets of voice samples were recorded: a resting sample before the loading session and a loading sample after the loading session. The material recorded consisted of /pa:ppa/ words produced normally, as softly and as loudly as possible in this order by all subjects. The long /a/ vowel of the test word was inverse-filtered to obtain the glottal flow waveform. Time domain parameters of the glottal flow [open quotient (OQ), closing quotient (CQ), speed quotient (SQ), fundamental frequency (F0)], amplitude domain parameters of the glottal flow [glottal flow (fAC) and its logarithm, minimum of the first derivative of the glottal flow (dpeak) and its logarithm, amplitude quotient (AQ), and a new parameter, CQAQ], intraoral pressure (p), and sound pressure level (SPL) values of the phonations were analyzed. Voice range profiles (VRP) and the singer's formant (g/G, a/A, cl/c, e1/e, g1/g for females/males) of the loud phonation were also measured. Statistically significant differences between the preloading and postloading samples could be seen in many parameters, but the differences depended on gender and the type of phonation. In females the values of CQ, AQ, and CQAQ decreased and the values of SQ and p increased in normal phonations; the values of fAC, dpeak, and SPL increased in soft phonations; the values of AQ and CQAQ decreased in loud phonations; the harmonic energy in the singer's formant region increased significantly at every pitch. In males the values of OQ and AQ decreased and the values of dpeak, F0, p, and SPL increased in normal phonations; the values of fAC and p increased in soft phonations. The changes could be interpreted as signs of a shift toward hyperfunctional voice production. Low humidity was associated with more hyperfunctional changes than high humidity. High output was associated with more hyperfunctional changes than low output. Sitting position was associated with an increasing trend at both margins of male VRP, whereas the case was the opposite for standing position.  相似文献   

8.
Singing requires exquisite coordination between the respiratory and phonatory systems to efficiently control glottal airflow. Asymptomatic singing students underwent pulmonary function testing (PFT), videostrobolaryngoscopic examination, and measures of glottal efficiency (maximum phonation time [MPT], glottal flow rate [GFR], and phonation quotient [PQ]) performed in both a sung and spoken tone. Pulmonary function and glottal efficiency values were within reported normative data for professional singers. However, sung tones were made with significantly higher GFR and PQ and lower PQ than spoken tones. The mean GFR was not related to the degree of glottal closure (by videostrobolaryngoscopy) or underlying pulmonary support.  相似文献   

9.
This study was designed to compare information on laryngeal vibrations obtained by high-speed filming, photoglottography (PGG), and electroglottography (ECG). Simultaneous glottographic signals and high-speed films were obtained from two subjects producing steady phonation. Measurements of glottal width were made at three points along the glottis in the anterior--posterior dimension and aligned with the other records. Results indicate that PGG and film measurements give essentially the same information for peak glottal opening and glottal closure. The EGG signal appears to reliably indicate vocal-fold contact. Together, PGG and EGG may provide much of the information obtained from high-speed filming as well as potentially detect horizontal phase differences during opening and closing.  相似文献   

10.
HearFones (HF) have been designed to enhance auditory feedback during phonation. This study investigated the effects of HF (1) on sound perceivable by the subject, (2) on voice quality in reading and singing, and (3) on voice production in speech and singing at the same pitch and sound level.

Test 1: Text reading was recorded with two identical microphones in the ears of a subject. One ear was covered with HF, and the other was free. Four subjects attended this test. Tests 2 and 3: A reading sample was recorded from 13 subjects and a song from 12 subjects without and with HF on. Test 4: Six females repeated [pa:p:a] in speaking and singing modes without and with HF on same pitch and sound level.

Long-term average spectra were made (Tests 1–3), and formant frequencies, fundamental frequency, and sound level were measured (Tests 2 and 3). Subglottic pressure was estimated from oral pressure in [p], and simultaneously electroglottography (EGG) was registered during voicing on [a:] (Test 4). Voice quality in speech and singing was evaluated by three professional voice trainers (Tests 2–4).

HF seemed to enhance sound perceivable at the whole range studied (0–8 kHz), with the greatest enhancement (up to ca 25 dB) being at 1–3 kHz and at 4–7 kHz. The subjects tended to decrease loudness with HF (when sound level was not being monitored). In more than half of the cases, voice quality was evaluated “less strained” and “better controlled” with HF. When pitch and loudness were constant, no clear differences were heard but closed quotient of the EGG signal was higher and the signal more skewed, suggesting a better glottal closure and/or diminished activity of the thyroarytenoid muscle.  相似文献   


11.
12.
The purpose of this exploratory study was to determine if laryngeal transillumination in combination with stroboscopy (strobophotoglottography; SPGG) is useful for (1) the visualization of vocal fold vibration (VFV) opening patterns, (2) the localization of initial vocal fold opening in horizontal glottal thirds (anterior, midmembranous, and posterior), (3) determination of the temporal correspondence of the so-called electroglottography (EGG)-knee and initial vocal fold separation, and, finally, (4) automatized quantitative measurements of glottal area function within endoscopic images. With stroboscopic transillumination, initial inferior vocal fold separation was detectable during the "closed" phase, where the vocal folds were still closed in the upper portion and therefore initial inferior vocal fold separation could not be visualized with usual laryngoscopy techniques. In the horizontal plane within similar fundamental frequencies in modal voice registers in two male subjects, localization of initial glottal opening depended on the voice types used (soft, normal, or pressed phonation). We found zipperlike posterior-to-anterior openings, initial midmembranous openings, initial anterior openings, as well as simultaneous initial opening of all three portions in the two healthy male adults examined. This technique proved to add temporal and spatial information to vocal fold opening patterns and extends our examination techniques to the very beginning of vocal fold opening at the inferior portion. Simultaneous electroglottogram tracking and comparison with bidirectionally illuminated stroboscopic images revealed a time-locked correspondence of the EGG-knee with the aforementioned initial inferior vocal fold separation. Bidirectional illumination combined with digital color extraction techniques allowed for image separation of subglottally and supraglottally illuminated structures. This facilitated vocal fold contour detection and automatized image processing, for example, for determination of glottal area function, and is considered to be a further step to objective automatized quantitative measurements within endoscopic images.  相似文献   

13.
This study examined the amount of jaw opening used by two groups of singers, those with less than 4 years of training (novice) and those with more than 8 years of training (experienced) in the Western tradition of opera and art song. Movement of the jaw in the superior-inferior plane was measured with the use of a lightweight head-mounted cephalostat with a strain gauge. The subjects spoke and then sung a carrier phrase "I say b(v)p," where (v) was each of three vowels, [a], [i], and [u]. The phrase was first spoken with a natural inflection and then sung on a repeated pitch at three notes from the low, medium, and high singing voice range. There was no statistically significant difference in jaw opening between the two groups of singers. Vowel was significant for jaw opening in both groups, with [a] being produced with more jaw opening than [i] or [u]. The voicing condition was also significant for jaw opening with greater jaw opening being used as pitch increased. In general the amount of jaw opening was smallest for the low singing voice condition and greatest for the high singing voice condition. The jaw opening most typically was less in the low voice condition than in the speech condition and then increased for both the medium and high voice tasks. All but two singers used more jaw opening on the [a] vowel than the other two vowels at all voicing conditions.  相似文献   

14.
This study explores resonance strategies used for the belting style and associated vocal fold vibratory patterns, for the vowels /e/, /a/, /i/, and /u/ on G4 and B4-flat. Acoustic spectra of belted vowels and their unoptimized, "speech-like" equivalents were compared. Vocal fold vibratory patterns were quantified using electroglottography. Results show that /a/ is inherently suitable for belting and requires no adjustment. For /e/, F2-H5 tuning was observed. For /i/, F1 was detuned from H1, enhancing also H2. For /u/, both F1 and F2 were raised to accomplish F2-H3 tuning. These results show that the loud, bright sound of the belting style is achieved by the implementation of resonance strategies that enhance higher harmonics. Electroglottography revealed that resonance strategies also result in raising the closed quotient (CQ) above 52%, an apparent threshold value for belting.  相似文献   

15.
Normative measures of open quotient, speed quotient, maximum flow declination rate (MFDR), and subglottal pressure were determined for 75 children between the ages of 6 years 0 months and 10 years 11 months. The participants produced a sustained /a/ at low, comfort, and high pitches for a minimum of 5 seconds, and five to seven repetitions of /pa/ at low, comfort, and high pitches. No statistically significant differences were found in the mean measures of any aerodynamic variables (open quotient, speed quotient, maximum flow declination rate, subglottal pressure) between the frequency levels (low, comfort, high pitches). Also, no strong evidence (P > .05) exists that age or sex effect differed between the frequency levels (low, comfort, high) for any of the aerodynamic measures. For /a/ response tasks, mean open quotient measures increased slightly from low to comfort frequency and from comfort to high frequency. Mean speed quotient measures showed minimal differences between low and comfort frequency, with decreased mean measures for high frequency. Mean MFDR measures increased from low to comfort frequency and from comfort to high frequency. Mean subglottal pressure measures increased slightly from low to comfort frequency and from comfort to high frequency.  相似文献   

16.
Vocal fold vibratory asymmetry is often associated with inefficient sound production through its impact on source spectral tilt. This association is investigated in both a computational voice production model and a group of 47 human subjects. The model provides indirect control over the degree of left-right phase asymmetry within a nonlinear source-filter framework, and high-speed videoendoscopy provides in vivo measures of vocal fold vibratory asymmetry. Source spectral tilt measures are estimated from the inverse-filtered spectrum of the simulated and recorded radiated acoustic pressure. As expected, model simulations indicate that increasing left-right phase asymmetry induces steeper spectral tilt. Subject data, however, reveal that none of the vibratory asymmetry measures correlates with spectral tilt measures. Probing further into physiological correlates of spectral tilt that might be affected by asymmetry, the glottal area waveform is parameterized to obtain measures of the open phase (open/plateau quotient) and closing phase (speed/closing quotient). Subjects' left-right phase asymmetry exhibits low, but statistically significant, correlations with speed quotient (r=0.45) and closing quotient (r=-0.39). Results call for future studies into the effect of asymmetric vocal fold vibration on glottal airflow and the associated impact on voice source spectral properties and vocal efficiency.  相似文献   

17.
Electroglottography is a common method for providing noninvasive measurements of glottal activity. The derivative of the electroglottographic signal, however, has not attracted much attention, although it yields reliable indicators of glottal closing instants. The purpose of this paper is to provide a guide to the usefulness of this signal. The main features that are to be found in this signal are presented on the basis of an extensive analysis of a database of items sung by 18 trained singers. Glottal opening and closing instants are related to peaks in the signal; the latter can be used to measure glottal parameters such as fundamental frequency and open quotient. In some cases, peaks are doubled or imprecise, which points to special (but by no means uncommon) glottal configurations. A correlation-based algorithm for the automatic measurement of fundamental frequency and open quotient using the derivative of electroglottographic signals is proposed. It is compared to three other electroglottographic-based methods with regard to the measurement of open quotient in inverse-filtered derived glottal flow. It is shown that agreement with the glottal-flow measurements is much better than most threshold-based measurements in the case of sustained sounds.  相似文献   

18.
A new method "simultaneous inverse filtering and model matching" (SIM) is proposed that allows one to calculate voice source measures without any user interaction. It is based on the discrete all-pole modeling (DAP) technique for inverse filtering (IF), which is modified to include a model of the glottal flow as integral part [LF model, Fant et al., STL-QPSR (Stockholm) 4/1985, 1-13 (1986)]. As the correct LF parameters are initially unknown, they are estimated in an iterative procedure using multi-dimensional optimization techniques that are initialized according to the results of an exhaustive search. The error criteria applied reflect how well the IF is performed after the spectral contribution of the glottal flow has been removed. The resulting optimal LF parameter constellation serves as the basis to calculate 11 voice source measures. The performance was evaluated using synthesized signals and recordings of natural utterances. For the synthesized signals, the accuracy to reproduce the original parameters was high (correlations exceeding 0.88) for measures where the starting point of the glottal cycle did not enter explicitly. Errors were smaller compared to conventional estimation methods where the measures were estimated from the IF signal. The analysis of natural utterances indicates that problems still exist with regard to robustness, but that under advantageous conditions the open quotient, the speed quotient, the closing quotient, the parabolic spectral parameter, and the negative peak amplitude of the glottal flow derivative can indeed be determined automatically by the SIM method.  相似文献   

19.
The effects of age, sex, and vocal tract configuration on the glottal excitation signal in speech are only partially understood, yet understanding these effects is important for both recognition and synthesis of speech as well as for medical purposes. In this paper, three acoustic measures related to the voice source are analyzed for five vowels from 3145 CVC utterances spoken by 335 talkers (8-39 years old) from the CID database [Miller et al., Proceedings of ICASSP, 1996, Vol. 2, pp. 849-852]. The measures are: the fundamental frequency (F0), the difference between the "corrected" (denoted by an asterisk) first two spectral harmonic magnitudes, H1* - H2* (related to the open quotient), and the difference between the "corrected" magnitudes of the first spectral harmonic and that of the third formant peak, H1* - A3* (related to source spectral tilt). The correction refers to compensating for the influence of formant frequencies on spectral magnitude estimation. Experimental results show that the three acoustic measures are dependent to varying degrees on age and vowel. Age dependencies are more prominent for male talkers, while vowel dependencies are more prominent for female talkers suggesting a greater vocal tract-source interaction. All talkers show a dependency of F0 on sex and on F3, and of H1* - A3* on vowel type. For low-pitched talkers (F0 < or = 175 Hz), H1* - H2* is positively correlated with F0 while for high-pitched talkers, H1* - H2* is dependent on F1 or vowel height. For high-pitched talkers there were no significant sex dependencies of H1* - H2* and H1* - A3*. The statistical significance of these results is shown.  相似文献   

20.
Tam  s Hacki 《Journal of voice》1996,10(4):342-347
Crescendo phonation (swelltone) was used to evaluate the laryngeal tensioning behavior of seven normal speakers and of 12 dysphonic patients. EGG quasi-open quotient (qOq), stroboscopic open quotient, and vocal sound pressure level (SPL) were measured, and EGG amplitude and the mucosal wave were assessed qualitatively. For normal speakers, the qOq decreased greatly as vocal intensity increased. The same tendency was observed, but to a lesser extent, among hyperfunctional dysphonics. In contrast, qOq increased with vocal intensity among the hypofunctional dysphonics. The crescendo task combined with EGG assessment appears to offer a valid approach to the classification of laryngeal dysfunctions.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号