首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 31 毫秒
1.
This study presents an approach to visualizing intensity regulation in speech. The method expresses a voice sample in a two-dimensional space using amplitude-domain values extracted from the glottal flow estimated by inverse filtering. The two-dimensional presentation is obtained by expressing a time-domain measure of the glottal pulse, the amplitude quotient (AQ), as a function of the negative peak amplitude of the flow derivative (d(peak)). The regulation of vocal intensity was analyzed with the proposed method from voices varying from extremely soft to very loud with a SPL range of approximately 55 dB. When vocal intensity was increased, the speech samples first showed a rapidly decreasing trend as expressed on the proposed AQ-d(peak) graph. When intensity was further raised, the location of the samples converged toward a horizontal line, the asymptote of a hypothetical hyperbola. This behavior of the AQ-d(peak) graph indicates that the intensity regulation strategy changes from laryngeal to respiratory mechanisms and the method chosen makes it possible to quantify how control mechanisms underlying the regulation of vocal intensity change gradually between the two means. The proposed presentation constitutes an easy-to-implement method to visualize the function of voice production in intensity regulation because the only information needed is the glottal flow wave form estimated by inverse filtering the acoustic speech pressure signal.  相似文献   

2.
The effects of prolonged (5x45 minute) reading (vocal loading) on fundamental frequency (F0), sound pressure level (SPL), subglottal (intraroral) pressure (p), and two glottal flow waveform parameters (AC amplitude of glottal flow, f, and negative peak amplitude of differentiated flow (d) of normal female and male subjects (N = 80) were studied. Two rest (morning and noon) and three loading (two in the morning and one in the afternoon) samples were recorded and analyzed. The glottal waveforms were obtained by inverse filtering of the acoustic pressure waveforms of speaking voice samples. The analyses were based on measurement and inverse filtering of the first stressed syllable of "paappa" words repeated 3x5 times for normal, as soft as possible, and as loud as possible phonation. In normal phonation the parameter values changed statistically significantly due to loading. In many cases the values obtained in the morning samples changed after the first loading session. This is interpreted as a vocal "warming-up effect." Especially in soft phonation p, d, and f were sensitive indicators of vocal loading. In both normal and soft phonation, the SPL, p, d, and f values tended to rise due to prolonged reading in the morning and afternoon samples, indicating increased effort (normal phonation) and a rise in the phonatory threshold (soft phonation). The lunch break vocal rest ("rest effect") considerably affected the parameter values in many cases.  相似文献   

3.
Noninvasive measures of vocal fold activity are useful for describingnormal and disordered voice production. Measures of open and speed quotient from glottal airflow and electroglottographic (EGG) waveforms have been used to describe timing events associated with vocal fold vibration. To date, there has been little consistency in the measurement criteria used to calculate quotient values. In this study, criteria of 20% and 50% were applied to the AC amplitude of glottal airflow and inverted EGG waveforms for measurement of open quotient. Criteria of 20%, 50%, and 80%, and a midslope criterion that segmented the waveform between 20% and 80% of the waveform amplitude, were used for the calculation of speed quotient. Subjects produced waveforms at sound pressure levels (SPL) of 70, 75, 80 and 85 dB. Results indicated that approximations of open quotient obtained from the glottal airflow waveform significantly decreased using both the 20% and 50% criteria as SPL increased from 80 to 85 dB. No significant changes were found in open quotient from the EGG waveform as a function of SPL. Results of speed quotient measures from the glottal airflow and EGG waveforms showed a generally increasing trend as SPL increased, although the differences were not statistically significant. The data suggest that the signal type, measurement criterion and SPL must be considered in interpreting quotient measures.  相似文献   

4.
Vocal intensity is studied as a function of fundamental frequency and lung pressure. A combination of analytical and empirical models is used to predict sound pressure levels from glottal waveforms of five professional tenors and twenty five normal control subjects. The glottal waveforms were obtained by inverse filtering the mouth flow. Empirical models describe features of the glottal flow waveform (peak flow, peak flow derivative, open quotient, and speed quotient) in terms of lung pressure and phonation threshold pressure, a key variable that incorporates the Fo dependence of many of the features of the glottal flow. The analytical model describes the contributions to sound pressure levels SPL by the vocal tract. Results show that SPL increases with Fo at a rate of 8-9 dB/octave provided that lung pressure is raised proportional to phonation threshold pressure. The SPL also increases at a rate of 8-9 dB per doubling of excess pressure over threshold, a new quantity that assumes considerable importance in vocal intensity calculations. For the same excess pressure over threshold, the professional tenors produced 10-12 dB greater intensity than the male nonsingers, primarily because their peak airflow was much higher for the same pressure. A simple set of rules is devised for predicting SPL from source waveforms.  相似文献   

5.
A combined-modality treatment program consisting of botulinum toxin injection (Botox) and voice therapy was used to treat 17 subjects diagnosed with adductor spasmodic dysphonia (ADD SD). Ten subjects with ADD SD served as the control and were given Botox only. Voice therapy after Botox injection was directed toward reducing the hyperfunctional vocal behaviors, primarily glottal overpressure at voice onset and anterior-posterior squeezing. The results indicated that subjects who underwent combined-modality treatment maintained significantly higher mean airflow rates for significantly longer periods. Moreover, there was a carryover effect in these patients when they received Botox only. Adductor spasmodic dysphonia is treated most effectively when intrinsic laryngeal muscle spasms are reduced or eliminated by Botox injection and extrinsic hyperfunctional vocal behaviors are treated with voice therapy  相似文献   

6.
This study was aimed at identifying acoustic and physiological measures useful for monitoring voice changes in postnasopharyngeal patients with nonlaryngeal malignancies, and providing evidences of vocal tract effect on voice through comparisons between individuals with and without intact vocal tract. Simultaneous acoustic-electroglottographic signals recorded during phonation of vowels /i/ and /a/ sustained at habitual, high, and low pitch levels were compared among 10 postradiotherapy patients with nasopharyngeal carcinoma (NPC), 10 voice patients (VPs) with intact vocal tract, and 10 healthy individuals with normal voice (NORM). Results from a series of discriminant analyses revealed that the NPC group generally exhibited lower signal-to-noise (SNR) and open quotient (OQ) and higher Formant 1 frequency (F(1)) and speed quotient (SQ) than the NORM group. Unlike both VP and NORM groups, the NPC group failed to show a pitch effect on all voice measures, including OQ, SQ, percent jitter, percent shimmer, and SNR, suggesting an effect of radiotherapy and/or vocal tract on laryngeal behaviors. For the vowel /i/, on the other hand, only the NPC and NORM groups showed a pattern of pitch-dependent F(1) raising, a reflection of increased pharyngeal narrowing. These findings suggested that the pitch effect on laryngeal behaviors differed not only between individuals with intact vocal tract and those without but also between those with structural and dynamic changes of vocal tract.  相似文献   

7.
《Journal of voice》2020,34(3):485.e33-485.e43
PurposeThe present study aimed at measuring the smoothed and non-smoothed cepstral peak prominence (CPPS and CPP) in teachers who considered themselves to have normal voice but some of them had laryngeal pathology. The changes of CPP, CPPS, sound pressure level (SPL) and perceptual ratings with different voice tasks were investigated and the influence of vocal pathology on these measures was studied.MethodEighty-four Finnish female primary school teachers volunteered as participants. Laryngoscopically, 52.4% of these had laryngeal changes (39.3% mild, 13.1% disordered). Sound recordings were made for phonations of comfortable sustained vowel, comfortable speech, and speech produced at increased loudness level as used during teaching. CPP, CPPS and SPL values were extracted using Praat software for all three voice samples. Sound samples were also perceptually evaluated by five voice experts for overall voice quality (10 point scale from poor to excellent) and vocal firmness (10 point scale from breathy to pressed, with normal in the middle).ResultsThe CPP, CPPS and SPL values were significantly higher for vowels than for comfortable speech and for loud speech compared to comfortable speech (P < 0.001). Significant correlations were found between SPL and cepstral measures. The loud speech was perceived to be firmer and have a better voice quality than comfortable speech. No significant relationships of the laryngeal pathology status with cepstral values, perceptual ratings, or voice SPLs were found (P > 0.05).ConclusionNeither the acoustic measures (CPP, CPPS, and SPL) nor the perceptual evaluations could clearly distinguish teachers with laryngeal changes from laryngeally healthy teachers. Considering no vocal complaints of the subjects, the data could be considered representative of teachers with functionally healthy voice.  相似文献   

8.
The present investigation was designed to examine the effect of change in vocal fold mass and stiffness on vocal fold vibration. To do this, the effect of variation in superior laryngeal nerve stimulation (SLNS) and recurrent laryngeal nerve stimulation (RLNS) was studied Photoglottography (PGG), electroglottography (EGG), and subglottic pressure (Psub) were measured in seven mongrel dogs using an in vivo canine model of phonation. The PGG, EGG, and Psub signals were examined at three frequencies (100, 130, and 160 Hz) for SLNS and RLNS, using a constant rate of air flow. Increasing SLNS, which caused a contraction of the cricothyroid muscle, produced a marked increase in F0, little change in Psub, an increase in open quotient (OQ), and a decrease in the closed quotient (CQ) of the glottal cycle. Increasing RLNS, which caused activation of the intrinsic laryngeal muscles, produced a modest increase in F0, a marked increase in Psub, no change in the OQ, and an increase in CQ. Phase quotient (Qp), which describes the interval between opening of the lower and upper fold margins, decreased with increasing RLNS and did not change significantly with increasing SLNS. Based upon changes in F0, Psub, OQ, CQ, and Qp, SLNS provides a physiologic correlate of the tension parameter Q, and RLNS provides a physiologic correlate of the parameter Psub in the Ishizaka and Flanagan two-mass model.  相似文献   

9.
A single female professional vocal artist and pedagogue sang examples of “twang” and neutral voice quality, which a panel of experts classified, in almost complete agreement with the singer's intentions. Subglottal pressure was measured as the oral pressure during the occlusion during the syllable /pae/. This pressure tended to be higher in “twang,” whereas the sound pressure level (SPL) was invariably higher. Voice source properties and formant frequencies were analyzed by inverse filtering. In “twang,” as compared with neutral, the closed quotient was greater, the pulse amplitude and the fundamental were weaker, and the normalized amplitude tended to be lower, whereas formants 1 and 2 were higher and 3 and 5 were lower. The formant differences, which appeared to be the main cause of the SPL differences, were more important than the source differences for the perception of “twanginess.” As resonatory effects occur independently of the voice source, the formant frequencies in “twang” may reflect a vocal strategy that is advantageous from the point of view of vocal hygiene.  相似文献   

10.
This study was primarily motivated by the need to establish the correspondence between auditory abilities and laryngeal function. Just noticeable differences (JNDs) were obtained for the open quotient and speed quotient of the glottal flow waveform. The quotients were synthesized for both the glottal flow alone, and for the output pressure signal after the glottal flow signal was applied to the synthesis vocal tract for the vowel /a/. Six adult men and five adult women, all teachers of singing, participated as listeners. An adaptive auditory listening procedure was used to estimate JNDs for the four types of stimuli. The group average JND values were as follows. For the standard open quotient value of .6000, JND = 0.0264 (SD = .010) for the glottal flow and JND = 0.0344 (SD = .020) for the output pressure. For the open quotient, there was no statistically significant difference between genders or between the types of signals. For the standard speed quotient value of 2.000, JND = 0.154 (SD = .043) for the glottal flow and JND = 0.319 (SD = .167) for the output pressure. For the speed quotient, there was no statistically significant difference between genders, but the difference between types of stimulus (glottal flow versus output pressure) was significant (p <.006). The variance among the JND values was significantly larger for the output pressure stimuli compared to the glottal flow stimuli for both the open quotient and the speed quotient.  相似文献   

11.
Tam  s Hacki 《Journal of voice》1996,10(4):342-347
Crescendo phonation (swelltone) was used to evaluate the laryngeal tensioning behavior of seven normal speakers and of 12 dysphonic patients. EGG quasi-open quotient (qOq), stroboscopic open quotient, and vocal sound pressure level (SPL) were measured, and EGG amplitude and the mucosal wave were assessed qualitatively. For normal speakers, the qOq decreased greatly as vocal intensity increased. The same tendency was observed, but to a lesser extent, among hyperfunctional dysphonics. In contrast, qOq increased with vocal intensity among the hypofunctional dysphonics. The crescendo task combined with EGG assessment appears to offer a valid approach to the classification of laryngeal dysfunctions.  相似文献   

12.
The abduction quotient, a measure of effective glottal width, was obtained for electroglottographic recordings from a professional operatic baritone singer. The subject produced repeated tokens of the voice qualities breathy, normal, and pressed (or constricted) in both a speech and a singing manner. In the singing manner, the subject produced the three vocal qualities at three pitch levels and three loudness levels. The abduction quotient decreased from breathy to pressed voice, suggesting that the measure corresponds to effective glottal width. The measure was found to be consistently low during all conditions of singing, suggesting that the subject produced all singing tokens with relatively strong laryngeal adduction at the vocal process level. Although the results of this study support the validity and usefulness of the abduction quotient, further verification is needed.  相似文献   

13.
《Journal of voice》2023,37(2):298.e11-298.e29
IntroductionTypical singing registers are the chest and falsetto; however, trained singers have an additional register, namely, the mixed register. The mixed register, which is also called “mixed voice” or “mix,” is an important technique for singers, as it can help bridge from the chest voice to falsetto without noticeable voice breaks.ObjectiveThe present study aims to reveal the nature of the voice-production mechanism of the different registers (chest, mix, and falsetto) using high-speed digital imaging (HSDI), electroglottography (EGG), and acoustic and aerodynamic measurements.Study DesignCross-sectional study.MethodsAerodynamic measurements were acquired for twelve healthy singers (six men and women) during the phonation of a variety of pitches using three registers. HSDI and EGG devices were simultaneously used on three healthy singers (two men and one woman) from which an open quotient (OQ) and speed quotient (SQ) were detected. Audio signals were recorded for five sustained vowels, and a spectral analysis was conducted to determine the amplitude of each harmonic component. Furthermore, the absolute (not relative) value of the glottal volume flow was estimated by integrating data obtained from the HSDI and aerodynamic studies.ResultsFor all singers, the subglottal pressure (PSub) was the highest for the chest in the three registers, and the mean flow rate (MFR) was the highest for the falsetto. Conversely, the PSub of the mix was as low as the falsetto, and the MFR of the mix was as low as the chest. The HSDI analysis showed that the OQ differed significantly among the registers, even when the fundamental frequency was the same; the OQ of the mix was higher than that of the chest but lower than that of the falsetto. The acoustic analysis showed that, for the mix, the harmonic structure was intermediate between the chest and falsetto. The results of the glottal volume-flow analysis revealed that the maximum volume velocity was the least for the mix register at every fundamental frequency. The first and second harmonic (H1-H2) difference of the voice source spectrum was the greatest for the falsetto, then the mix, and finally, the chest.ConclusionsWe found differences in the registers in terms of the aeromechanical mechanisms and vibration patterns of the vocal folds. The mixed register proved to have a distinct voice-production mechanism, which can be differentiated from those of the chest or falsetto registers.  相似文献   

14.
Correlations among parameters indicating air usage during phonation were investigated in 60 normal subjects and 1,545 voice patients. The parameters examined were maximum phonation time (MPT), mean air flow rate for maximum sustained phonation (MFRm), mean air flow rate for comfortable phonation (MFRc), and phonation quotient (PQ). In normal subjects, correlations among MPT, MFRm, and PQ were high, but those between MFRc and the others were moderate. In cases of paralysis and hypofunctional dysphonia, all correlations between the four parameters were high. In the cases of polyp, nodule, epithelial hyperplasia, glottic carcinoma, and hyperfunctional dysphonia, the correlation between MPT and MFRc was moderate or not significant. The low correlation between these parameters was associated with the variations in flow rate differences between maximum and comfortable phonations. The results suggest that measurement of all four parameters is not necessary in routine tests and that MPT and MFRc should be measured in most voice patients.  相似文献   

15.
Vocal fold vibratory asymmetry is often associated with inefficient sound production through its impact on source spectral tilt. This association is investigated in both a computational voice production model and a group of 47 human subjects. The model provides indirect control over the degree of left-right phase asymmetry within a nonlinear source-filter framework, and high-speed videoendoscopy provides in vivo measures of vocal fold vibratory asymmetry. Source spectral tilt measures are estimated from the inverse-filtered spectrum of the simulated and recorded radiated acoustic pressure. As expected, model simulations indicate that increasing left-right phase asymmetry induces steeper spectral tilt. Subject data, however, reveal that none of the vibratory asymmetry measures correlates with spectral tilt measures. Probing further into physiological correlates of spectral tilt that might be affected by asymmetry, the glottal area waveform is parameterized to obtain measures of the open phase (open/plateau quotient) and closing phase (speed/closing quotient). Subjects' left-right phase asymmetry exhibits low, but statistically significant, correlations with speed quotient (r=0.45) and closing quotient (r=-0.39). Results call for future studies into the effect of asymmetric vocal fold vibration on glottal airflow and the associated impact on voice source spectral properties and vocal efficiency.  相似文献   

16.
An important clinical issue concerns the efficacy of current voice therapy approaches in treating voice disorders, such as vocal nodules. Much research focuses on finding reliable methods for documentation of treatment results. In this second treatment study of ten patients with vocal nodules, who participated in a behaviorally based voice therapy program, 11 aerodynamic (transglottal air pressure and glottal waveform) and acoustic (spl, f0, and spectrum slope) measures were used. Three pretherapy baseline assessments were carried out, followed by one assessment after each of five therapy phases. Measurements were made of two types of speech materials: Strings of repeated /pae/ syllables and sustained /ae/ phonations in two loudness conditions: comfortable loudness and loud voice. The data were normalized using z-scores, which were based on data from 22 normal subjects. The results showed that the aerodynamic measures reflected the presence of vocal pathology to a higher degree than did the acoustic spectral measures, and they should be useful in studies comparing nodule and normal voice production. Large individual session-to-session variation was found for all measures across pretherapy baseline recordings, which contributed to nonsignificant differences between baseline and therapy data.  相似文献   

17.
The aim of the study was to identify the acoustic correlates of female teachers' subjective voice complaints by recording their voices in their working environment. The subjects made recordings during lessons (N = 10) and breaks (N = 11). The subjects were divided into 2 groups: those with few voice complaints (FC group) and those with many voice complaints (MC group). The speech sample made in the breaks was maximally sustained /a/, from which fundamental frequency (F0), jitter, and shimmer were analyzed. The classroom samples were analyzed for F0, sound pressure level (SPL), and F0 time (the active vibration time of the vocal folds). Additionally, an index for assessing voice loading is presented. The results revealed a tendency of the MC group to have higher F0 and lower SPL and perturbation values than the FC group. The index values correlated moderately with the subjective vocal complaints.  相似文献   

18.
Posterior closure insufficiency of the glottis is often mentioned in connection with permanent voice disorders. Recently published studies have revealed that an incomplete closure of the glottis can be found also in normal-speaking voices, especially in women. However, the effect of glottal closure configuration on vocal efficacy is not sufficiently clarified. The purpose of this study was to determine the effect of glottal closure configuration on singing and speaking voice characteristics. Overall, 520 young female normal-speaking subjects were examined by videostroboscopy for different phonation conditions in the combination of soft, loud, low, and/or high phonation and by voice range profile measurements. According to the videostroboscopic analysis, the subjects were subdivided into four groups: complete closure of the vocal folds already in soft phonation (group 1), closure of the vocal fold with increasing intensity (group 2), persistent closure insufficiencies despite increasing intensity (group 3), and hourglass-shaped closure in subjects with vocal nodules (group 4). Subjects in which the glottal closure could not be evaluated sufficiently were subclassified into group 5 (missing values).

Selected criteria of the singing and speaking voice were evaluated and statistically processed according to the mentioned subclassification. Group 1 reached significantly the highest sound pressure levels (SPLmax) for the singing voice as well as for the shouting voice. Group 3 showed a limited capacity to increase the intensity of the singing and speaking voice. The results gathered in this study objectify the relationship of insufficient glottal closure and reduced vocal capabilities. As long as no conclusive data on long-term consequences of insufficient glottal closure are available, a prophylactic improvement of the laryngeal situation especially in female professional voice users by voice therapy should be recommended.  相似文献   


19.
This study investigated the relation of symptoms of vocal fatigue to acoustic variables reflecting type of voice production and the effects of vocal loading. Seventy-nine female primary school teachers volunteered as subjects. Before and after a working day, (1) a 1-minute text reading sample was recorded at habitual loudness and loudly (as in large classroom), (2) a prolonged phonation on [a:] was recorded at habitual speaking pitch and loudness, and (3) a questionnaire about voice quality, ease, or difficulty of phonation and tiredness of throat was completed. The samples were analyzed for average fundamental frequency (F0), sound pressure level (SPL), and phonation type reflecting alpha ratio (SPL [1-5 kHz]-SPL [50 Hz-1 kHz]). The vowel samples were additionally analyzed for perturbation (jitter and shimmer). After a working day, F0, SPL, and alpha ratio were higher, jitter and shimmer values were lower, and more tiredness of throat was reported. The average levels of the acoustic parameters did not correlate with the symptoms. Increase in jitter and mean F0 in loud reading correlated with tiredness of throat. The results seem to suggest that, at least among experienced vocal professionals, voice production type had little relevance from the point of view of vocal fatigue reported. Differences in the acoustic parameters after a vocally loading working day mainly seem to reflect increased muscle activity as a consequence of vocal loading.  相似文献   

20.
Seventeen healthy women, 45 to 61 years old, were examined using videofiberstroboscopy during phonation at three loudness levels. Two phoniatricians evaluated glottal closure using category and ratio scales. Transglottal airflow was studied by inverse filtering of the oral airflow signal recorded in a flow mask (Glottal Enterprises System) during the spoken phrase /ba:pa:pa:pa:p/ at three loudness levels. Subglottal pressure was estimated from the intraoral pressure during p occlusion. Running speech and the repeated /pa:/ syllables were perceptually evaluated by three speech pathologists regarding breathiness, hypo-, and hyperfunction, using continuous scales. Incomplete glottal closure was found in 35 of 46 phonations (76%). The degree of glottal closure increased significantly with raised loudness. Half of the women closed the glottis completely during loud phonation. Posterior glottal chink (PGC) was the most common gap configuration and was found in 28 of 46 phonations (61%). One third of the PGCs were in the cartilaginous glottis (PGCc) only. Two thirds extended into the membranous portion (PGCm); most of these occurred during soft phonation. Peak flow, peak-to-peak (AC) flow, and the maximum rate of change for the flow in the closing phase increased significantly with raised loudness. Minimum flow decreased significantly from normal to loud voice. Breathiness decreased with increased loudness. The results suggest that the incomplete closure patterns PGCc and PGCm during soft phonation ought primarily to be regarded as normal for Swedish women in this age group.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号