首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 15 毫秒
1.
Fundamental frequency (F0) and voice onset time (VOT) were measured in utterances containing voiceless aspirated [ph, th, kh], voiceless unaspirated [sp, st, sk], and voiced [b, d, g] stop consonants produced in the context of [i, e, u, o, a] by 8- to 9-year-old subjects. The results revealed that VOT reliably differentiated voiceless aspirated from voiceless unaspirated and voiced stops, whereas F0 significantly contrasted voiced with voiceless aspirated and unaspirated stops, except for the first glottal period, where voiceless unaspirated stops contrasted with the other two categories. Fundamental frequency consistently differentiated vowel height in alveolar and velar stop consonant environments only. In comparing the results of these children and of adults, it was observed that the acoustic correlates of stop consonant voicing and vowel quality were different not only in absolute values, but also in terms of variability. Further analyses suggested that children were more variable in production due to inconsistency in achieving specific targets. The findings also suggest that, of the acoustic correlates of the voicing feature, the primary distinction of VOT is strongly developed by 8-9 years of age, whereas the secondary distinction of F0 is still in an emerging state.  相似文献   

2.
对齿龈塞音在腭裂语音中的声门塞音代偿现象进行了声学分析,计算频谱分布的多阶统计量—谱矩,并将代偿塞音和正常塞音进行对比。结果显示声门塞音爆破段的第一阶谱矩即频谱质心的频率位置比正常塞音低,因为声门塞音的阻塞部位在声门,导致声道腔体偏长从而共振频率偏低。还观察到声门塞音的第二阶谱矩即标准偏差偏高,说明其谱能量分布比正常塞音更加分散。声门塞音的第三阶谱矩即偏度大多为正值,反映了声门塞音功率谱的非对称性且大头朝向低频区而长尾朝向高频区。采用逻辑回归模型进行样本分类,通过交叉验证选出最优的四阶谱矩作为模型自变量,分类正确率为89.7%。结合塞音爆破时刻自动检测,实现了音节/di/的声门塞音客观判定。   相似文献   

3.

Objectives/Hypotheses

Singers learn to produce well-controlled tone onsets by accurate synchronization of glottal adduction and buildup of subglottal pressure. Spectrographic analyses have shown that the higher spectrum partials are present also at the vowel onset in classically trained singers’ performances. Such partials are produced by a sharp discontinuity in the waveform of the transglottal airflow, presumably produced by vocal fold collision.

Study Design

After hearing a prompt series of a triad pattern, six singer subjects sang the same triad pattern on the vowel /i/ (1) preceded by an aspirated /p/, (2) preceded by an unaspirated /p/, and (3) without any preceding consonant in staccato.

Methods

Using high-speed imaging we examined the initiation of vocal fold vibration in aspirated and unaspirated productions of the consonant /p/ as well as in the staccato tones.

Results

The number vibrations failing to produce vocal fold collision were significantly higher in the aspirated /p/ than in the unaspirated /p/ and in the staccato tones. High frequency ripple in the audio waveform was significantly delayed in the aspirated /p/.

Conclusions

Initiation of vocal fold collision and the appearance of high-frequency ripple in the vowel /i/ are slightly delayed in aspirated productions of a preceding consonant /p/.  相似文献   

4.
Occurrences of period-doubling are found in human phonation, in particular for pathological and some singing phonations such as Sardinian A Tenore Bassu vocal performance. The combined vibration of the vocal folds and the ventricular folds has been observed during the production of such low pitch bass-type sound. The present study aims to characterize the physiological correlates of this acoustical production and to provide a better understanding of the physical interaction between ventricular fold vibration and vocal fold self-sustained oscillation. The vibratory properties of the vocal folds and the ventricular folds during phonation produced by a professional singer are analyzed by means of acoustical and electroglottographic signals and by synchronized glottal images obtained by high-speed cinematography. The periodic variation in glottal cycle duration and the effect of ventricular fold closing on glottal closing time are demonstrated. Using the detected glottal and ventricular areas, the aerodynamic behavior of the laryngeal system is simulated using a simplified physical modeling previously validated in vitro using a larynx replica. An estimate of the ventricular aperture extracted from the in vivo data allows a theoretical prediction of the glottal aperture. The in vivo measurements of the glottal aperture are then compared to the simulated estimations.  相似文献   

5.
Laryngeal framework surgery can change the position and tensionof the vocal folds safely without direct surgical intervention in the vocal fold proper. Some 23 years of experience with phonosurgery have proved its usefulness in treating dysphonia related to unilateral vocal fold paralysis, vocal fold atrophy, and pitch-related dysphonias. Meanwhile, much information about the mechanism of voice production has been obtained through intraoperative findings of voice and fiberscopic examination of the larynx . Based on such knowledge together with information obtained through model experiments, the human vocal organ was reconsidered mainly from the mechanical view point, and the roles of voice therapy and singing pedagogy were discussed in relation to phonosurgery. The vocal organ may not be an ideal musical organ and is rather vulnerable, but its potential is enormous.  相似文献   

6.
7.
《Journal of voice》2023,37(3):444-451
ObjectiveA single injection of basic fibroblast growth factor (bFGF) into the vocal folds of patients with glottal insufficiency has been shown to be effective for a few years. However, the long-term therapeutic effect of a single injection of bFGF into the vocal folds has yet to be demonstrated. In this study, the therapeutic effect of a single injection of bFGF into the vocal folds was investigated over several years by monitoring patients for 36 months following this treatment.MethodsNineteen patients with glottal insufficiency received injections of bFGF diluted to 20 μg/mL in the superficial layer of the lamina propria of the bilateral vocal folds. The following parameters were evaluated at preinjection baseline and 6, 12, 18, 24, and 36 months later, and statistical comparisons were performed. The parameters evaluated were: the Grade, Rough, Breathy, Asthenic, and Strained (GRBAS) scale score; maximum phonation time; acoustic analysis; and glottal wave analysis (GWA) and kymograph edge analysis (KEA) using high-speed digital imaging (HSDI). The amplitude perturbation quotient (APQ) and period perturbation quotient (PPQ) were measured by acoustic analysis. The mean minimum glottal area during vocalization and mean minimum distance between the vocal folds were measured by GWA. The amplitudes of the bilateral vocal folds were measured by KEA.ResultsPostinjection, the GRBAS scale score decreased from 6 months after injection, and maximum phonation time was prolonged. The mean minimum glottal area during vocalization and the mean minimum distance between the vocal folds calculated by GWA of HSDI decreased significantly after 6 months. These effects persisted until 36 months postinjection. APQ and PPQ derived from acoustic analysis tended to decrease, but not significantly. There was no clear change in the amplitudes of the bilateral vocal folds calculated by KEA of HSDI before and after injection.ConclusionsThese results suggest that the effects of a single injection of bFGF into the vocal folds persist for 36 months.  相似文献   

8.
Mongolian "throat singing" can be performed in different modes. In Mongolia, the bass-type is called Kargyraa. The voice source in bass-type throat singing was studied in one male singer. The subject alternated between modal voice and the throat singing mode. Vocal fold vibrations were observed with high-speed photography, using a computerized recording system. The spectral characteristics of the sound signal were analyzed. Kymographic image data were compared to the sound signal and flow inverse filtering data from the same singer were obtained on a separate occasion. It was found that the vocal folds vibrated at the same frequency throughout both modes of singing. During throat singing the ventricular folds vibrated with complete but short closures at half the frequency of the true vocal folds, covering every second vocal fold closure. Kymographic data confirmed the findings. The spectrum contained added subharmonics compared to modal voice. In the inverse filtered signal the amplitude of every second airflow pulse was considerably lowered. The ventricular folds appeared to modulate the sound by reducing the glottal flow of every other vocal fold vibratory cycle.  相似文献   

9.
Measurements were made of intraoral air pressure and oral flow of ten native speakers uttering word pairs contrasting Korean fortis and lenis voiceless stop consonants in initial position. The production of fortis stops was found to be characterized by a higher intraoral pressure before release, yet a lower oral flow after release, than corresponding lenis stops. Possible reasons for this difference were explored with the use of a computer implemented aerodynamic model, giving an output of air pressure and flow. Input parameters were adjusted in accordance with known or hypothesized variations in glottal area function, vocal tract wall tension, respiratory muscle force, and supraglottal cavity volume, as given in the literature. In addition to the previously known differences in glottal area, it is inferred from the results of the modeling experiment that fortis stops are produced with greater vocal tract wall tension than lenis stops. Speaker-specific production strategies such as larynx lowering and heightened subglottal pressure during fortis stops and differences noted between word pairs are also discussed.  相似文献   

10.
Mongolian “throat singing” can be performed in different modes. In Mongolia, the bass-type is called Kargyraa. The voice source in bass-type throat singing was studied in one male singer. The subject alternated between modal voice and the throat singing mode. Vocal fold vibrations were observed with high-speed photography, using a computerized recording system. The spectral characteristics of the sound signal were analyzed. Kymographic image data were compared to the sound signal and flow inverse filtering data from the same singer were obtained on a separate occasion. It was found that the vocal folds vibrated at the same frequency throughout both modes of singing. During throat singing the ventricular folds vibrated with complete but short closures at half the frequency of the true vocal folds, covering every second vocal fold closure. Kymographic data confirmed the findings. The spectrum contained added subharmonics compared to modal voice. In the inverse filtered signal the amplitude of every second airflow pulse was considerably lowered. The ventricular folds appeared to modulate the sound by reducing the glottal flow of every other vocal fold vibratory cycle.  相似文献   

11.
Geometry of the human vocal folds strongly influences their oscillatory motion. While the effect of intraglottal geometry on phonation has been widely investigated, the study of the geometry of the inferior surface of the vocal folds has been limited. In this study the way in which the inferior vocal fold surface angle affects vocal fold vibration was explored using a two-dimensional, self-oscillating finite element vocal fold model. The geometry was parameterized to create models with five different inferior surface angles. Four of the five models exhibited self-sustained oscillations. Comparisons of model motion showed increased vertical displacement and decreased glottal width amplitude with decreasing inferior surface angle. In addition, glottal width and air flow rate waveforms changed as the inferior surface angle was varied. Structural, rather than aerodynamic, effects are shown to be the cause of the changes in model response as the inferior surface angle was varied. Supporting data including glottal pressure distribution, average intraglottal pressure, energy transfer, and flow separation point locations are discussed, and suggestions for future research are given.  相似文献   

12.
This paper presents a Hilbert transform-based approach to analyze vocal fold vibrations in human subjects exhibiting normal and abnormal voice productions. This new approach is applied to the analysis of glottal area waveform (GAW) and is capable of providing useful information on the vocal fold vibration. The GAW is extracted from high-speed laryngeal images by delineating the glottal edge for each image frame. An analytic signal is generated through the Hilbert transform of the GAW, which yields a recognizable pattern of the vocal fold vibration in the analytic phase plane. The vibratory pattern is comprehensive and can be correlated with specific voice conditions. Quantitative measures of the glottal perturbation are introduced using the analytic amplitude and instantaneous frequency obtained from the analysis. Examples of clinical voice recordings are used to evaluate and test the effectiveness of this approach in providing qualitative representation and quantitative characteristics of vocal fold vibratory behavior. The results demonstrate the potential of using this new analytical tool incorporated with the high-speed laryngeal imaging modality for clinical voice assessment.  相似文献   

13.
The membranous contact quotient (MCQ) is introduced as a measure of dynamic glottal competence. It is defined as the ratio of the membranous contact glottis (the anterior-posterior length of contact between the two membranous vocal folds) and the membranous vocal fold length. An elliptical approximation to the vocal fold contour during phonation was used to predict MCQ values as a function of vocal process gap (adduction), maximum glottal width, and membranous glottal length. MCQ is highly dependent on the vocal process gap and the maximum glottal width, but not on vocal fold length. Five excised larynges were used to obtain MCQ data for a wide range of vocal process gaps and maximum glottal widths. Predicted and measured MCQ values had a correlation of 0.93, with an average absolute difference of 9.6% (SD = 10.5%). The model is better at higher values of MCQ. The theory for MCQ is also expressed as a function of vocal process gap and subglottal pressure to suggest production control potential. The MCQ measure is obtainable with the use of stroboscopy and appears to be a potentially useful clinical measure.  相似文献   

14.
This study was designed to examine the temporal acoustic differences between male trained singers and nonsingers during speaking and singing across voiced and voiceless English stop consonants. Recordings were made of 5 trained singers and 5 nonsingers, and acoustically analyzed for voice onset time (VOT). A mixed analysis of variance showed that the male trained singers had significantly longer mean VOT than did the nonsingers during voiceless stop production. Sung productions of voiceless stops had significantly longer mean VOTs than did the spoken productions. No significant differences were observed for the voiced stops, nor were any interactions observed. These results indicated that vocal training and phonatory task have a significant influence on VOT.  相似文献   

15.
In obstruent consonants, a major constriction in the upper vocal tract yields an increase in intraoral pressure (P(io)). Phonation requires that subglottal pressure (P(sub)) exceed P(io) by a threshold value, so as the transglottal pressure reaches the threshold, phonation will cease. This work investigates how P(io) levels at phonation offset and onset vary before and after different German voiceless obstruents (stop, fricative, affricates, clusters), and with following high vs low vowels. Articulatory contacts, measured using electropalatography, were recorded simultaneously with P(io) to clarify how supraglottal constrictions affect P(io). Effects of consonant type on phonation thresholds could be explained mainly in terms of the magnitude and timing of vocal-fold abduction. Phonation offset occurred at lower values of P(io) before fricative-initial sequences than stop-initial sequences, and onset occurred at higher levels of P(io) following the unaspirated stops of clusters compared to fricatives, affricates, and aspirated stops. The vowel effects were somewhat surprising: High vowels had an inhibitory effect at voicing offset (phonation ceasing at lower values of P(io)) in short-duration consonant sequences, but a facilitating effect on phonation onset that was consistent across consonantal contexts. The vowel influences appear to reflect a combination of vocal-fold characteristics and vocal-tract impedance.  相似文献   

16.
This paper examines an updated version of a lumped mucosal wave model of the vocal fold oscillation during phonation. Threshold values of the subglottal pressure and the mean (DC) glottal airflow for the oscillation onset are determined. Depending on the nonlinear characteristics of the model, an oscillation hysteresis phenomenon may occur, with different values for the oscillation onset and offset threshold. The threshold values depend on the oscillation frequency, but the occurrence of the hysteresis is independent of it. The results are tested against pressure data collected from a mechanical replica of the vocal folds, and oral airflow data collected from speakers producing intervocalic /h/. In the human speech data, observed differences between voice onset and offset may be attributed to variations in voice pitch, with a very small or inexistent hysteresis phenomenon.  相似文献   

17.
18.
Negative damping and eigenmode synchronization as two different mechanisms of phonation onset are distinguished. Although both mechanisms lead to a favorable phase relationship between the flow pressure and the vocal fold motion as required for a net energy transfer into the vocal folds, the underlying mechanisms for this favorable phase relationship are different. The negative damping mechanism relies on glottal aerodynamics or acoustics to establish before onset and maintain after onset the favorable phase relationship, and therefore has minimum requirements on vocal fold geometry and biomechanics. A single degree-of-freedom vocal fold model is all that is needed for self-oscillation in the presence of a negative damping mechanism. In contrast, the mechanism of eigenmode synchronization critically depends on the geometrical and biomechanical properties of the vocal folds (at least 2-degrees-of-freedom are required), and has little requirement on the glottal aerodynamics other than flow separation. The favorable phase relation is established once synchronization occurs, regardless of the phase relationship imposed by glottal aerodynamics before onset. Unlike that of the negative damping mechanism, initiation of eigenmode synchronization requires neither a velocity-dependent flow pressure nor an alternating convergent-divergent glottis. The clinical implications of the distinctions between these two mechanisms are discussed.  相似文献   

19.
Glottal adduction is a primary laryngeal variable that helps to determine glottal configuration and phonatory output. Greater adduction of the vocal folds can be produced by narrowing the gap between the vocal processes or by bulging the medial surface of the vocal folds. This study examined phonatory effects due to changing the degree of bulging using a computational model. Bulging was modeled as a quadratic surface and was related to active muscle stress. Results indicated that bulging had a significant effect on glottal flow resistance, maximum glottal width and area, and mean glottal volume velocity. The results are discussed relative to clinical issues of hyperfunction.  相似文献   

20.
High-speed filming is one of the most informative methods for assessing voice physiology data. Tracing high-speed images of the glottis provides quantitative parameters such as the glottal area and the glottal width function. By way of example, a number of studies are discussed which extract quantitative data from high-speed images showing voice onsets. Furthermore, a new computer system (MVAS; multi-dimensional voice analysis system) is presented that synchronously displays a laryngoscopic high-speed film, the electroglottographical signal, and several acoustic analyses of the recorded voice sample. The automatic measurement of glottal width and glottal area from the laryngoscopic images is also provided. Looking at former studies and our analyses of voice onsets reveals a tremendous intersubject and even intrasubject variability (different prephonatory closure, different time span until full amplitude is reached, different open quotient).  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号