首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 15 毫秒
1.
Mongolian "throat singing" can be performed in different modes. In Mongolia, the bass-type is called Kargyraa. The voice source in bass-type throat singing was studied in one male singer. The subject alternated between modal voice and the throat singing mode. Vocal fold vibrations were observed with high-speed photography, using a computerized recording system. The spectral characteristics of the sound signal were analyzed. Kymographic image data were compared to the sound signal and flow inverse filtering data from the same singer were obtained on a separate occasion. It was found that the vocal folds vibrated at the same frequency throughout both modes of singing. During throat singing the ventricular folds vibrated with complete but short closures at half the frequency of the true vocal folds, covering every second vocal fold closure. Kymographic data confirmed the findings. The spectrum contained added subharmonics compared to modal voice. In the inverse filtered signal the amplitude of every second airflow pulse was considerably lowered. The ventricular folds appeared to modulate the sound by reducing the glottal flow of every other vocal fold vibratory cycle.  相似文献   

2.
3.
The purpose of this study is to explore the possibility for physically based mathematical models of the voice source to accurately reproduce inverse filtered glottal volume-velocity waveforms. A low-dimensional, self-oscillating model of the glottal source with waveform-matching properties is proposed. The model relies on a lumped mechano-aerodynamic scheme loosely inspired by the one- and multimass lumped models. The vocal folds are represented by a single mechanical resonator and a propagation line which takes into account the vertical phase differences. The vocal-fold displacement is coupled to the glottal flow by means of an aerodynamic driving block which includes a general parametric nonlinear component. The principal characteristics of the flow-induced oscillations are retained, and the overall model is able to match inverse-filtered glottal flow signals. The method offers in principle the possibility of performing transformations of the glottal flow by acting on the physiologically based parameters of the model. This is a desirable property, e.g., for speech synthesis applications. The model was tested on a data set which included inverse-filtered glottal flow waveforms of different characteristics. The results demonstrate the possibility of reproducing natural speech waveforms with high accuracy, and of controlling important characteristics of the synthesis such as pitch.  相似文献   

4.
Synchronized videostroboscopy and electroglottography were applied to the measurement of anterior-to-posterior open glottal length in four groups of patients; two with no clinically significant voice disorder, one with vocal fold polyps, and one with vocal fold nodules. The data showed that the groups did not differ significantly when open glottal length was measured at the time of minimum glottal opening. The pathological groups had significantly lower open glottal length measurements, however, when measurements were obtained at the time that vocal fold contact was initiated during the glottal cycle. The findings are preliminary evidence that vocal fold neoplasms may not have the effect of reducing glottal closure, as previously suggested in the literature. The data also highlight the importance of examining differential effects of vocal fold neoplasms at various points throughout the glottal cycle.  相似文献   

5.
Mongolian “throat singing” can be performed in different modes. In Mongolia, the bass-type is called Kargyraa. The voice source in bass-type throat singing was studied in one male singer. The subject alternated between modal voice and the throat singing mode. Vocal fold vibrations were observed with high-speed photography, using a computerized recording system. The spectral characteristics of the sound signal were analyzed. Kymographic image data were compared to the sound signal and flow inverse filtering data from the same singer were obtained on a separate occasion. It was found that the vocal folds vibrated at the same frequency throughout both modes of singing. During throat singing the ventricular folds vibrated with complete but short closures at half the frequency of the true vocal folds, covering every second vocal fold closure. Kymographic data confirmed the findings. The spectrum contained added subharmonics compared to modal voice. In the inverse filtered signal the amplitude of every second airflow pulse was considerably lowered. The ventricular folds appeared to modulate the sound by reducing the glottal flow of every other vocal fold vibratory cycle.  相似文献   

6.
Values for acoustic voice measurements were obtained from 88 normal individuals and 98 pathological cases of mass lesions of vocal fold and 50 cases of unilateral vocal fold paralysis. Overall, all items reflecting perturbations of pitch and amplitude as well as glottal noise were significantly higher in the groups of patients compared with the normal group. The measurement of normalized noise energy (NNE) was found to be an optimum parameter for discrimination of normal/abnormal voices. The voices of patients with vocal fold nodules and vocal fold polyps were analyzed before endolaryngeal phonomicrosurgery (EPM) and 2 weeks after. Statistically significant (p < 0.01) improvement was achieved both in perceptual and acoustic analysis. EPM resulted in a significant decrease of mean jitter, shimmer, and NNE. Clinically, these measures provided documentable and measurable evidence of vocal function and were helpful for comparing patients with normal speakers. They also were useful for a thorough documentation of patient's voice pathology and for evaluation of the presurgical and postsurgical voice status.  相似文献   

7.
提出一种声带动力学模型参数反演方法,从发声机理角度对声带病变嗓音进行有效区分。依据声带生理组织和伯努利定律构建声带动力学模型,确定模型优化参数向量,耦合声门气流获取模型声门波;利用迭代自适应逆滤波算法获得实际嗓音声门波作为目标声门波;采用遗传优化算法提出通过匹配目标和模型声门波特征参数实现模型参数反演。实验结果表明,表征声门波的各时频域参数匹配相对误差不超过2%;依据反演所获模型参数提出去除声门下压影响的平均归一化缩放系数,克服声带非对称性特征在区分病变嗓音方面的不足,实现病理嗓音的全面有效区分。   相似文献   

8.
A nonlinear model is applied to study pathologic vocal vibratory characteristics and voice treatments of Parkinson's disease. We find that a number of pathologic vocal characteristics commonly observed in Parkinson's disease, including reduced vibratory intensity, incomplete vocal closure, increased phonation threshold pressure, glottal tremor, subharmonics, and chaotic vocal fold vibrations, can be studied with this nonlinear model. We also find that two kinds of clinical voice treatments for Parkinson's disease, including respiratory effort treatment and Lee Silverman voice treatment can be studied with this computer model. Results suggest that respiratory effort treatment, in which subglottal pressure is increased, might aid in enhancing vibratory intensity, improving glottal closure, and avoiding vibratory irregularity. However, the Lee Silverman voice treatment, in which both subglottal pressure and vocal fold adduction are increased, might be better than respiratory effort treatment. Increasing vocal fold thickness would be further helpful to improve these pathologic characteristics. The model studies show consistencies with clinical observations. Computer models may be of value in understanding the dynamic mechanism of disordered voices and studying voice treatment effects in Parkinson's disease.  相似文献   

9.
Spectral measures of the glottal source were investigated using an excised canine larynx (CL) model for various aerodynamic and phonatory conditions. These measures included spectral harmonic difference H1-H2 and spectral slope that are highly correlated with voice quality but not reported in a systematic manner using an excised larynx model. It was hypothesized that the acoustic spectra of the glottal source were significantly influenced by the subglottal pressure, glottal adduction, and vocal fold elongation, as well as the resulting vibration pattern. CLs were prepared, mounted on the bench with and without false vocal folds, and made to oscillate with a flow of heated and humidified air. Major control parameters were subglottal pressure, adduction, and elongation. Electroglottograph, subglottal pressure, flow rate, and audio signals were analyzed using custom software. Results suggest that an increase in subglottal pressure and glottal adduction may change the energy balance between harmonics by increasing the spectral energy of the first few harmonics in an unpredictable manner. It is suggested that changes in the dynamics of vocal fold motion may be responsible for different spectral patterns. The finding that the spectral harmonics do not conform to previous findings was demonstrated through various cases. Results of this study may shed light on phonatory spectral control when the larynx is part of a complete vocal tract system.  相似文献   

10.
The purpose of this exploratory study was to determine if laryngeal transillumination in combination with stroboscopy (strobophotoglottography; SPGG) is useful for (1) the visualization of vocal fold vibration (VFV) opening patterns, (2) the localization of initial vocal fold opening in horizontal glottal thirds (anterior, midmembranous, and posterior), (3) determination of the temporal correspondence of the so-called electroglottography (EGG)-knee and initial vocal fold separation, and, finally, (4) automatized quantitative measurements of glottal area function within endoscopic images. With stroboscopic transillumination, initial inferior vocal fold separation was detectable during the "closed" phase, where the vocal folds were still closed in the upper portion and therefore initial inferior vocal fold separation could not be visualized with usual laryngoscopy techniques. In the horizontal plane within similar fundamental frequencies in modal voice registers in two male subjects, localization of initial glottal opening depended on the voice types used (soft, normal, or pressed phonation). We found zipperlike posterior-to-anterior openings, initial midmembranous openings, initial anterior openings, as well as simultaneous initial opening of all three portions in the two healthy male adults examined. This technique proved to add temporal and spatial information to vocal fold opening patterns and extends our examination techniques to the very beginning of vocal fold opening at the inferior portion. Simultaneous electroglottogram tracking and comparison with bidirectionally illuminated stroboscopic images revealed a time-locked correspondence of the EGG-knee with the aforementioned initial inferior vocal fold separation. Bidirectional illumination combined with digital color extraction techniques allowed for image separation of subglottally and supraglottally illuminated structures. This facilitated vocal fold contour detection and automatized image processing, for example, for determination of glottal area function, and is considered to be a further step to objective automatized quantitative measurements within endoscopic images.  相似文献   

11.
The harmonics-to-noise ratio (HNR) of the voiced speech signal has implicitly been used to infer information regarding the turbulent noise level at the glottis. However, two problems exist for inferring glottal noise attributes from the HNR of the speech wave form: (i) the measure is fundamental frequency (f0) dependent for equal levels of glottal noise, and (ii) any deviation from signal periodicity affects the ratio, not just turbulent noise. An alternative harmonics-to-noise ratio formulation [glottal related HNR (GHNR')] is proposed to overcome the former problem. In GHNR' a mean over the spectral range of interest of the HNRs at specific harmonic/between-harmonic frequencies (expressed in linear scale) is calculated. For the latter issue [(ii)] two spectral tilt measures are shown, using synthesis data, to be sensitive to glottal noise while at the same time being comparatively insensitive to other glottal aperiodicities. The theoretical development predicts that the spectral tilt measures reduce as noise levels increase. A conventional HNR estimator, GHNR' and two spectral tilt measures are applied to a data set of 13 pathological and 12 normal voice samples. One of the tilt measures and GHNR' are shown to provide statistically significant differentiating power over a conventional HNR estimator.  相似文献   

12.
Measurements on the inverse filtered airflow waveform and of estimated average transglottal pressure and glottal airflow were made from syllable sequences in low, normal, and high pitch for 25 male and 20 female speakers. Correlation analyses indicated that several of the airflow measurements were more directly related to voice intensity than to fundamental frequency (F0). Results suggested that pressure may have different influences in low and high pitch in this speech task. It is suggested that unexpected results of increased pressure in low pitch were related to maintaining voice quality, that is, avoiding vocal fry. In high pitch, the increased pressure may serve to maintain vocal fold vibration. The findings suggested different underlying laryngeal mechanisms and vocal adjustments for increasing and decreasing F0 from normal pitch.  相似文献   

13.
Vocal fold vibratory asymmetry is often associated with inefficient sound production through its impact on source spectral tilt. This association is investigated in both a computational voice production model and a group of 47 human subjects. The model provides indirect control over the degree of left-right phase asymmetry within a nonlinear source-filter framework, and high-speed videoendoscopy provides in vivo measures of vocal fold vibratory asymmetry. Source spectral tilt measures are estimated from the inverse-filtered spectrum of the simulated and recorded radiated acoustic pressure. As expected, model simulations indicate that increasing left-right phase asymmetry induces steeper spectral tilt. Subject data, however, reveal that none of the vibratory asymmetry measures correlates with spectral tilt measures. Probing further into physiological correlates of spectral tilt that might be affected by asymmetry, the glottal area waveform is parameterized to obtain measures of the open phase (open/plateau quotient) and closing phase (speed/closing quotient). Subjects' left-right phase asymmetry exhibits low, but statistically significant, correlations with speed quotient (r=0.45) and closing quotient (r=-0.39). Results call for future studies into the effect of asymmetric vocal fold vibration on glottal airflow and the associated impact on voice source spectral properties and vocal efficiency.  相似文献   

14.
Geometry of the human vocal folds strongly influences their oscillatory motion. While the effect of intraglottal geometry on phonation has been widely investigated, the study of the geometry of the inferior surface of the vocal folds has been limited. In this study the way in which the inferior vocal fold surface angle affects vocal fold vibration was explored using a two-dimensional, self-oscillating finite element vocal fold model. The geometry was parameterized to create models with five different inferior surface angles. Four of the five models exhibited self-sustained oscillations. Comparisons of model motion showed increased vertical displacement and decreased glottal width amplitude with decreasing inferior surface angle. In addition, glottal width and air flow rate waveforms changed as the inferior surface angle was varied. Structural, rather than aerodynamic, effects are shown to be the cause of the changes in model response as the inferior surface angle was varied. Supporting data including glottal pressure distribution, average intraglottal pressure, energy transfer, and flow separation point locations are discussed, and suggestions for future research are given.  相似文献   

15.
An algorithm for estimating the vocal pulse positions and durations in an actual speech signal is described. Testing of the algorithm shows that it outperforms the best of the competitor algorithms in accuracy on the average by a factor of two. The algorithm is less sensitive to spectrum distortions in telephone channels, to various types of noise, and to instability in duration and amplitude of pulses produced by the voice source. The accuracy of the pulse position estimate is sufficient for a synchronous speech signal analysis, while the speed of signal processing makes the algorithm suitable for real-time operation.  相似文献   

16.
This paper presents a Hilbert transform-based approach to analyze vocal fold vibrations in human subjects exhibiting normal and abnormal voice productions. This new approach is applied to the analysis of glottal area waveform (GAW) and is capable of providing useful information on the vocal fold vibration. The GAW is extracted from high-speed laryngeal images by delineating the glottal edge for each image frame. An analytic signal is generated through the Hilbert transform of the GAW, which yields a recognizable pattern of the vocal fold vibration in the analytic phase plane. The vibratory pattern is comprehensive and can be correlated with specific voice conditions. Quantitative measures of the glottal perturbation are introduced using the analytic amplitude and instantaneous frequency obtained from the analysis. Examples of clinical voice recordings are used to evaluate and test the effectiveness of this approach in providing qualitative representation and quantitative characteristics of vocal fold vibratory behavior. The results demonstrate the potential of using this new analytical tool incorporated with the high-speed laryngeal imaging modality for clinical voice assessment.  相似文献   

17.
Peter Murphy   《Journal of voice》2008,22(2):125-137
SUMMARY: An investigation of the effect of glottal source aperiodicities (jitter, shimmer, and aspiration noise) on the estimation of fundamental frequency (f0) perturbation and amplitude perturbation, of synthesized, glottal source and voiced speech waveforms, is considered. Firstly, 4, cycle-event f0 estimators are examined: (1) waveform matching of the low-pass filtered waveform, (2) positive peaks (PPs) from the speech waveform, (3) PPs from the low-pass filtered waveform, and (4) positive zero crossings from the low-pass filtered waveform. The analysis shows that f0 perturbation measures taken from the low-pass filtered waveform are affected by both amplitude perturbation and random glottal noise, whereas, f0 perturbation measures taken from the PPs of the original waveform are affected by noise but not by amplitude perturbation. It is shown for the low-pass filter methods that the effects of amplitude perturbation and noise lead to increased errors in the measurement of f0 perturbation for the synthesized speech waveforms when compared with the synthesized glottal waveforms. Shimmer of the synthesized speech waveform is approximately equal to shimmer of the synthesized glottal source. However, noise and jitter affect measures of amplitude perturbation. The estimation of f0 perturbation from the synthesized speech waveform is shown to be nonlinearly related to f0 perturbation estimation from the synthesized glottal waveform as a consequence of the filtering action of the vocal tract. Low-pass filtering the voiced speech waveform is shown to provide a partial solution to this problem.  相似文献   

18.
A theoretical flow solution is presented for predicting the pressure distribution along the vocal fold walls arising from asymmetric flow that forms during the closing phases of speech. The resultant wall jet was analyzed using boundary layer methods in a non-inertial reference frame attached to the moving wall. A solution for the near-wall velocity profiles on the flow wall was developed based on a Falkner-Skan similarity solution and it was demonstrated that the pressure distribution along the flow wall is imposed by the velocity in the inviscid core of the wall jet. The method was validated with experimental velocity data from 7.5 times life-size vocal fold models, acquired for varying flow rates and glottal divergence angles. The solution for the asymmetric pressures was incorporated into a widely used two-mass model of vocal fold oscillation with a coupled acoustical model of sound propagation. Asymmetric pressure loading was found to facilitate glottal closure, which yielded only slightly higher values of maximum flow declination rate and radiated sound, and a small decrease in the slope of the spectral tilt. While the impact on symmetrically tensioned vocal folds was small, results indicate the effect becomes more significant for asymmetrically tensioned vocal folds.  相似文献   

19.
This study presents an approach to visualizing intensity regulation in speech. The method expresses a voice sample in a two-dimensional space using amplitude-domain values extracted from the glottal flow estimated by inverse filtering. The two-dimensional presentation is obtained by expressing a time-domain measure of the glottal pulse, the amplitude quotient (AQ), as a function of the negative peak amplitude of the flow derivative (d(peak)). The regulation of vocal intensity was analyzed with the proposed method from voices varying from extremely soft to very loud with a SPL range of approximately 55 dB. When vocal intensity was increased, the speech samples first showed a rapidly decreasing trend as expressed on the proposed AQ-d(peak) graph. When intensity was further raised, the location of the samples converged toward a horizontal line, the asymptote of a hypothetical hyperbola. This behavior of the AQ-d(peak) graph indicates that the intensity regulation strategy changes from laryngeal to respiratory mechanisms and the method chosen makes it possible to quantify how control mechanisms underlying the regulation of vocal intensity change gradually between the two means. The proposed presentation constitutes an easy-to-implement method to visualize the function of voice production in intensity regulation because the only information needed is the glottal flow wave form estimated by inverse filtering the acoustic speech pressure signal.  相似文献   

20.
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号