首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 46 毫秒
1.
This correspondence demonstrates that the line spectral frequencies (LSFs) are the pole and zero frequencies of the glottal driving-point impedance of a discrete matched-impedance vocal-tract model. Several well-known characteristics of the LSFs, including the interlacing of pole and zero frequencies, are shown to follow naturally from this proof.  相似文献   

2.
A technique for modifying vocal tract area functions is developed by using sum and difference combinations of acoustic sensitivity functions to perturb an initial vocal tract configuration. First, sensitivity functions [e.g., Fant and Pauli, Proc. Speech Comm. Sem. 74, 1975] are calculated for a given area function, at its specific formant frequencies. The sensitivity functions are then multiplied by scaling coefficients that are determined from the difference between a desired set of formant frequencies and those supported by the current area function. The scaled sensitivity functions are then summed together to generate a perturbation of the area function. This produces a new area function whose associated formant frequencies are closer to the desired values than the previous one. This process is repeated iteratively until the coefficients are equal to zero or are below a threshold value.  相似文献   

3.
Resonances of a bent vocal tract   总被引:1,自引:0,他引:1  
Analyses of wave propagation in a vocal tract invariably assume the tract to be a variable area tube with a straight axis. No estimates appear to be available of the effect of curvature of the tract on its resonances (or, equivalently, on its transfer function). For arbitrarily varying cross section, the computation is difficult. However, one can choose an idealized bent vocal tract with a uniform cross section for which the wave equation is separable. The resonance frequencies for such a bent tract are computed and compared to those of the corresponding straight tract. The comparison shows that for typical dimensions of the tract the shift in the resonance frequencies below 4 kHz is in the range of 2%-8%.  相似文献   

4.
According to recent model investigations, vocal tract resonance is relevant to vocal registers. However, no experimental corroboration of this claim has been published so far. In the present investigation, ten professional tenors' vocal tract configurations were analyzed using MRI volumetry. All subjects produced a sustained tone on the pitch F4 (349 Hz) on the vowel /a/ (1) in modal and (2) in falsetto register. The area functions were estimated from the MRI data and their associated formant frequencies were calculated. In a second condition the same subjects repeated the same tasks in a sound treated room and their formant frequencies were estimated by means of inverse filtering. In both recordings similar formant frequencies were observed. Vocal tract shapes differed between modal and falsetto register. In modal as compared to falsetto the lip opening and the oral cavity were wider and the first formant frequency was higher. In this sense the presented results are in agreement with the claim that the formant frequencies differ between registers.  相似文献   

5.
A synthetic two-layer, self-oscillating, life-size vocal fold model was used to study the influence of the vocal tract and false folds on the glottal jet. The model vibrated at frequencies, pressures, flow rates, and amplitudes consistent with human phonation, although some differences in behavior between the model and the human vocal folds are noted. High-speed images of model motion and flow visualization were acquired. Phase-locked ensemble-averaged glottal jet velocity measurements using particle image velocimetry (PIV) were acquired with and without an idealized vocal tract, with and without false folds. PIV data were obtained with varying degrees of lateral asymmetric model positioning. Glottal jet velocity magnitudes were consistent with those measured using excised larynges. A starting vortex was observed in all test cases. The false folds interfered with the starting vortex, and in some cases vortex shedding from the false folds was observed. In asymmetric cases without false folds, the glottal jet tended to skew toward the nearest wall; with the false folds, the opposite trend was observed. rms velocity calculations showed the jet shear layer and laminar core. The rms velocities were higher in the vocal tract cases compared to the open jet and false fold cases.  相似文献   

6.
A methodological study is presented to examine the acoustic role of the vocal tract in playing the trumpet. Preliminary results obtained for one professional player are also shown to demonstrate the effectiveness of the method. Images of the vocal tract with a resolution of 0.5 mm (2 mm in thickness) were recorded with magnetic resonance imaging to observe the tongue posture and estimate the vocal-tract area function during actual performance. The input impedance was then calculated for the player's air column including both the supra- and subglottal tracts using an acoustic tube model including the effect of wall losses. Finally, a time-domain blowing simulation by Adachi and Sato [J. Acoust. Soc. Am. 99, 1200-1209 (1996)] was performed with a model of the lips. In this simulation, the oscillating frequency of the lips was slightly affected by using different shapes of the vocal tract measured for the player. In particular, when the natural frequency of the lips was gradually increased, the transition to the higher mode occurred at different frequencies for different vocal-tract shapes. Furthermore, simulation results showed that the minimum blowing pressure required to attain the lip oscillation can be reduced by adjusting the vocal-tract shape properly.  相似文献   

7.
Cavities branching off the main vocal tract are ubiquitous in nonhumans. Mammalian air sacs exist in human relatives, including all four great apes, but only a substantially reduced version exists in humans. The present paper focuses on acoustical functions of the air sacs. The hypotheses are investigated on whether the air sacs affect amplitude of utterances and/or position of formants. A multilayer synthetic model of the vocal folds coupled with a vocal tract model was utilized. As an air sac model, four configurations were considered: open and closed uniform tube-like side branches, a rigid cavity, and an inflatable cavity. Results suggest that some air sac configurations can enhance the sound level. Furthermore, an air sac model introduces one or more additional resonance frequencies, shifting formants of the main vocal tract to some extent but not as strongly as previously suggested. In addition, dynamic range of vocalization can be extended by the air sacs. A new finding is also an increased variability of the vocal tract impedance, leading to strong nonlinear source-filter interaction effects. The experiments demonstrated that air-sac-like structures can destabilize the sound source. The results were validated by a transmission line computational model.  相似文献   

8.
The relation between the spatial configuration of the vocal tract as determined by magnetic resonance imaging (MRI) and the acoustical signal produced was investigated. A male subject carried out a set of phonatory tasks, comprising the utterance of the sustained vowels /i/ and /a/, each in a single articulation, and the vowel /epsilon/ with his larynx positioned variously on a vertical axis. Two- and three-dimensional measurements of the vocal tract were performed. The results of these measurements were used to calculate resonance frequencies, according to predictions from acoustical theory. Finally, calculated frequencies were compared with actually measured resonance frequencies in the audio signal. We found a strong relation between the acoustical signal produced and the spatial configuration for the first resonance frequencies of the articulations of the vowel /epsilon/, and first two resonance frequencies of the vowels /a/ and /i/. The capability to determine accurately vocal tract dimensions is a major advantage of this imaging technique.  相似文献   

9.
Vocal tract area functions may contain quite abrupt changes in cross-sectional area. In formant frequency calculations for such area functions, an inner length correction (ILC) should be applied. The relevance of this correction was investigated by comparing acoustic measurements obtained from a physical model of the vocal tract with data gathered by means of computer simulations. Calculating formant frequencies without applying internal length corrections caused substantial errors, particularly for area functions representing apical stops just anterior to occlusion. Decentering and axial symmetry in the arrangement of the area elements of the physical model were briefly studied and found to have effects on the formant frequency values.  相似文献   

10.
We describe an arrangement for simultaneous recording of speech and vocal tract geometry in patients undergoing surgery involving this area. Experimental design is considered from an articulatory phonetic point of view. The speech signals are recorded with an acoustic-electrical arrangement. The vocal tract is simultaneously imaged with MRI. A MATLAB-based system controls the timing of speech recording and MR image acquisition. The speech signals are cleaned from acoustic MRI noise by an adaptive signal processing algorithm. Finally, a vowel data set from pilot experiments is qualitatively compared both with validation data from the anechoic chamber and with Helmholtz resonances of the vocal tract volume, obtained using FEM.  相似文献   

11.
《Journal of voice》2023,37(1):1-8
The novel stochastic model to produce voiced sounds proposed in this paper uses the source-filter Fant theory to generate voice signals and, consequently, it does not consider the coupling between the vocal tract and the vocal folds. Two novelties are proposed in the paper. The first one is the new model obtained from the unification of two other deterministic one mass-spring-damper models obtained from the literature and the second one is to build a stochastic model which can generate and control the level of jitter resulting even in hoarse voice signals or with pathological characteristics but using a simpler model than those ones discussed in the literature. An inverse stochastic problem is then solved for two cases, considering a normal voice and other obtained from a case of paralysis on the vocal folds. The parameters of the model are identified in the two cases allowing the validation of the model.  相似文献   

12.
This letter analyzes the oscillation onset-offset conditions of the vocal folds as a function of laryngeal size. A version of the two-mass model of the vocal folds is used, coupled to a two-tube approximation of the vocal tract in configuration for the vowel /a/. The standard male configurations of the laryngeal and vocal tract models are used as reference, and their dimensions are scaled using a single factor. Simulations of the vocal fold oscillation and oral output are produced for varying values of the scaling factor. The results show that the oscillation threshold conditions become more restricted for smaller laryngeal sizes, such as those appropriate for females and children.  相似文献   

13.
The calculation of the resonance frequencies from experimental cross-sectional areas of a vocal tract under the assumption that its walls are perfectly rigid provides values that noticeably differ from the measured resonance frequencies. The compliance of the walls affects the first resonance and almost does not affect the higher-order resonances. The presence of branching in the tract at the level of the larynx affects the second and third resonances stronger than the first resonance. The parameters of the wall impedance (the loss, mass, and elasticity) and the length and cross-sectional area of the branchings are determined by minimizing the rms discrepancy between the measured and calculated resonance frequencies. The error in the frequency calculation with allowance for the wall compliance and branching in the tract proves to be within the accuracy of the formant estimation.  相似文献   

14.
An alternative and complete derivation of the vocal tract length sensitivity function, which is an equation for finding a change in formant frequency due to perturbation of the vocal tract length [Fant, Quarterly Progress and Status Rep. No. 4, Speech Transmission Laboratory, Kungliga Teknisha Hogskolan, Stockholm, 1975, pp. 1-14] is presented. It is based on the adiabatic invariance of the vocal tract as an acoustic resonator and on the radiation pressure on the wall and at the exit of the vocal tract. An algorithm for tuning the vocal tract shape to match the formant frequencies to target values, such as those of a recorded speech signal, which was proposed in Story [J. Acoust. Soc. Am. 119, 715-718 (2006)], is extended so that the vocal tract length can also be changed. Numerical simulation of this extended algorithm shows that it can successfully convert between the vocal tract shapes of a male and a female for each of five Japanese vowels.  相似文献   

15.
The purpose of this study was to investigate the relation between vocal tract deformation patterns obtained from statistical analyses of a set of area functions representative of a vowel repertoire, and the acoustic properties of a neutral vocal tract shape. Acoustic sensitivity functions were calculated for a mean area function based on seven different speakers. Specific linear combinations of the sensitivity functions corresponding to the first two formant frequencies were shown to possess essentially the same amplitude variation along the vocal tract length as the statistically derived deformation patterns reported in previous studies.  相似文献   

16.
Analytical and computer simulation studies have shown that the acoustic impedance of the vocal tract as well as the viscoelastic properties of vocal fold tissues are critical for determining the dynamics and the energy transfer mechanism of vocal fold oscillation. In the present study, a linear, small-amplitude oscillation theory was revised by taking into account the propagation of a mucosal wave and the inertive reactance (inertance) of the supraglottal vocal tract as the major energy transfer mechanisms for flow-induced self-oscillation of the vocal fold. Specifically, analytical results predicted that phonation threshold pressure (Pth) increases with the viscous shear properties of the vocal fold, but decreases with vocal tract inertance. This theory was empirically tested using a physical model of the larynx, where biological materials (fat, hyaluronic acid, and fibronectin) were implanted into the vocal fold cover to investigate the effect of vocal fold tissue viscoelasticity on Pth. A uniform-tube supraglottal vocal tract was also introduced to examine the effect of vocal tract inertance on Pth. Results showed that Pth decreased with the inertive impedance of the vocal tract and increased with the viscous shear modulus (G") or dynamic viscosity (eta') of the vocal fold cover, consistent with theoretical predictions. These findings supported the potential biomechanical benefits of hyaluronic acid as a surgical bioimplant for repairing voice disorders involving the superficial layer of the lamina propria, such as scarring, sulcus vocalis, atrophy, and Reinke's edema.  相似文献   

17.
The skilled use of nonperiodic phonation techniques in combination with spectrum analysis has been proposed here as a practical method for locating formant frequencies in the singing voice. The study addresses the question of the degree of similarity between sung phonations and their nonperiodic imitations, with respect to both frequency of the first two formants as well as posture of the vocal tract. Using magnetic resonance imaging (MRI), linear predictive coding (LPC), and spectrum analysis, two types of nonperiodic phonation (ingressive and vocal fry) are compared with singing phonations to determine the degree of similarity/difference in acoustic and spatial dimensions of the vocal tract when these phonation types are used to approximate the postures of singing. In comparing phonation types, the close similarity in acoustic data in combination with the relative dissimilarity in spatial data indicates that the accurate imitations are not primarily the result of imitating the singing postures, but have instead an aural basis.  相似文献   

18.
The inverse filter is a serial cascade of filter elements with a transfer function that cancels the effect of the poles of the vocal tract transfer function on the acoustic waveform to reveal the underlying glottal volume velocity waveform. Inaccuracies in the glottal wave reconstruction derived from an all-zero inverse filter can be attributed to deviations of the vocal tract transfer function from an all-pole model. Presented is an analysis of the error stemming from the effect of the yielding vocal tract sidewalls on the vocal tract transfer function. Predictions about the resulting artifacts in the estimated glottal volume velocity are derived from an acoustic model. These predictions are confirmed by applying a linear predictive coding (LPC) inverse filter analysis method to vowels synthesized using a transmission line model of the vocal tract containing yielding sidewall parameters as well as natural productions of nonnasalized vowels.  相似文献   

19.
An auxiliary vector particle filter was proposed to present the vocal tract resonances (VTRs) tracking.It uses particle filter based on a version of state-space model that describes the characteristics of speech signal.The speech model consists of a target-guided dynamic function and a non-linear prediction mapping from resonance frequencies and bandwidths to LPC cepstra(LPCC).There are two characteristics in the proposed method.First,particle filtering technique is put forth to solve the non-linear problem of speech model.Second,an auxiliary vector,embedded in the state function of speech model,is applied to incorporate the most current observations and to generate the proposal distribution of particle filter.The experimental results show that this method is able to track the VTRs of continuous speech utterance efficiently with a small number of particles and able to solve the problem of spurious peaks and merging peaks.  相似文献   

20.
Although advances in techniques for image acquisition and analysis have facilitated the direct measurement of three-dimensional vocal tract air space shapes associated with specific speech phonemes, little information is available with regard to changes in three-dimensional (3-D) vocal tract shape as a function of vocal register, pitch, and loudness. In this study, 3-D images of the vocal tract during falsetto and chest register phonations at various pitch and loudness conditions were obtained using electron beam computed tomography (EBCT). Detailed measurements and differences in vocal tract configuration and formant characteristics derived from the eight measured vocal tract shapes are reported.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号