首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 110 毫秒
1.
A theory of interaction between the source of sound in phonation and the vocal tract filter is developed. The degree of interaction is controlled by the cross-sectional area of the laryngeal vestibule (epilarynx tube), which raises the inertive reactance of the supraglottal vocal tract. Both subglottal and supraglottal reactances can enhance the driving pressures of the vocal folds and the glottal flow, thereby increasing the energy level at the source. The theory predicts that instabilities in vibration modes may occur when harmonics pass through formants during pitch or vowel changes. Unlike in most musical instruments (e.g., woodwinds and brasses), a stable harmonic source spectrum is not obtained by tuning harmonics to vocal tract resonances, but rather by placing harmonics into favorable reactance regions. This allows for positive reinforcement of the harmonics by supraglottal inertive reactance (and to a lesser degree by subglottal compliant reactance) without the risk of instability. The traditional linear source-filter theory is encumbered with possible inconsistencies in the glottal flow spectrum, which is shown to be influenced by interaction. In addition, the linear theory does not predict bifurcations in the dynamical behavior of vocal fold vibration due to acoustic loading by the vocal tract.  相似文献   

2.
Analytical and computer simulation studies have shown that the acoustic impedance of the vocal tract as well as the viscoelastic properties of vocal fold tissues are critical for determining the dynamics and the energy transfer mechanism of vocal fold oscillation. In the present study, a linear, small-amplitude oscillation theory was revised by taking into account the propagation of a mucosal wave and the inertive reactance (inertance) of the supraglottal vocal tract as the major energy transfer mechanisms for flow-induced self-oscillation of the vocal fold. Specifically, analytical results predicted that phonation threshold pressure (Pth) increases with the viscous shear properties of the vocal fold, but decreases with vocal tract inertance. This theory was empirically tested using a physical model of the larynx, where biological materials (fat, hyaluronic acid, and fibronectin) were implanted into the vocal fold cover to investigate the effect of vocal fold tissue viscoelasticity on Pth. A uniform-tube supraglottal vocal tract was also introduced to examine the effect of vocal tract inertance on Pth. Results showed that Pth decreased with the inertive impedance of the vocal tract and increased with the viscous shear modulus (G") or dynamic viscosity (eta') of the vocal fold cover, consistent with theoretical predictions. These findings supported the potential biomechanical benefits of hyaluronic acid as a surgical bioimplant for repairing voice disorders involving the superficial layer of the lamina propria, such as scarring, sulcus vocalis, atrophy, and Reinke's edema.  相似文献   

3.
Cavities branching off the main vocal tract are ubiquitous in nonhumans. Mammalian air sacs exist in human relatives, including all four great apes, but only a substantially reduced version exists in humans. The present paper focuses on acoustical functions of the air sacs. The hypotheses are investigated on whether the air sacs affect amplitude of utterances and/or position of formants. A multilayer synthetic model of the vocal folds coupled with a vocal tract model was utilized. As an air sac model, four configurations were considered: open and closed uniform tube-like side branches, a rigid cavity, and an inflatable cavity. Results suggest that some air sac configurations can enhance the sound level. Furthermore, an air sac model introduces one or more additional resonance frequencies, shifting formants of the main vocal tract to some extent but not as strongly as previously suggested. In addition, dynamic range of vocalization can be extended by the air sacs. A new finding is also an increased variability of the vocal tract impedance, leading to strong nonlinear source-filter interaction effects. The experiments demonstrated that air-sac-like structures can destabilize the sound source. The results were validated by a transmission line computational model.  相似文献   

4.
A voice production model is created in this work by considering essential aerodynamic and acoustic phenomena in human voice production. A precise flow analysis is performed based on a boundary-layer approximation and the viscous-inviscid interaction between the boundary layer and the core flow. This flow analysis can supply information on the separation point of the glottal flow and the thickness of the boundary layer, both of which strongly depend on the glottal configuration and yield an effective prediction of the flow behavior. When the flow analysis is combined with the modified two-mass model of the vocal fold [Pelorson et al. (1994). J. Acoust. Soc. Am. 96, 3416-3431], the resulting acoustic wave travels through the vocal tract and a pressure change develops in the vicinity of the glottis. This change can affect the glottal flow and the motion of the vocal folds, causing source-filter coupling. The property of the acoustic feedback is explicitly expressed in the frequency domain by using an acoustic tube model, allowing a clear interpretation of the coupling. Numerical experiments show that the vocal-tract input impedance and frequency responses representing the source-filter coupling have dominant peaks corresponding to the fourth and fifth formants. Results of time-domain simulations also suggest the importance of these high-frequency peaks in voice production.  相似文献   

5.
《Journal of voice》2023,37(1):1-8
The novel stochastic model to produce voiced sounds proposed in this paper uses the source-filter Fant theory to generate voice signals and, consequently, it does not consider the coupling between the vocal tract and the vocal folds. Two novelties are proposed in the paper. The first one is the new model obtained from the unification of two other deterministic one mass-spring-damper models obtained from the literature and the second one is to build a stochastic model which can generate and control the level of jitter resulting even in hoarse voice signals or with pathological characteristics but using a simpler model than those ones discussed in the literature. An inverse stochastic problem is then solved for two cases, considering a normal voice and other obtained from a case of paralysis on the vocal folds. The parameters of the model are identified in the two cases allowing the validation of the model.  相似文献   

6.
Nonlinear source-filter coupling has been demonstrated in computer simulations, in excised larynx experiments, and in physical models, but not in a consistent and unequivocal way in natural human phonations. Eighteen subjects (nine adult males and nine adult females) performed three vocal exercises that represented a combination of various fundamental frequency and formant glides. The goal of this study was to pinpoint the proportion of source instabilities that are due to nonlinear source-tract coupling. It was hypothesized that vocal fold vibration is maximally destabilized when F(0) crosses F(1), where the acoustic load changes dramatically. A companion paper provides the theoretical underpinnings. Expected manifestations of a source-filter interaction were sudden frequency jumps, subharmonic generation, or chaotic vocal fold vibrations that coincide with F(0)-F(1) crossovers. Results indicated that the bifurcations occur more often in phonations with F(0)-F(1) crossovers, suggesting that nonlinear source-filter coupling is partly responsible for source instabilities. Furthermore it was observed that male subjects show more bifurcations in phonations with F(0)-F(1) crossovers, presumably because in normal speech they are less likely to encounter these crossovers as much as females and hence have less practice in suppressing unwanted instabilities.  相似文献   

7.
The relation between subglottal pressure (Ps) and fundamental frequency (F0) in phonation was investigated with an in vivo canine model. Direct muscle stimulation was used in addition to brain stimulation. This allowed the Ps-F0 slope to be quantified in terms of cricothyroid muscle activity. Results showed that, for ranges of 0-2 mA constant current stimulation of the cricothyroid muscle, the Ps-F0 slope ranged from 10 Hz/kPa to 60 Hz/kPa. These results were compared to similar slopes obtained in a previous study on excised larynges in which the vocal fold length was varied instead of cricothyroid activation. A physical interpretation of the Ps-F0 slope is that the amplitude-to-length ratio of the vocal folds decreases with CT activity, resulting in a smaller time-varying stiffness. In other words, there is less dependence of F0 on amplitude of vibration when the vocal folds are long instead of short.  相似文献   

8.
The didjeridu (didgeridoo) or yidaki of the Australian Aboriginal people consists of the narrow trunk of a small Eucalypt tree that has been hollowed out by the action of termites, cut to a length of about 1.5 m, smoothed, and decorated. It is lip-blown like a trumpet and produces a simple drone in the frequency range 55 to 80 Hz. Interest arises from the fact that a skilled player can make a very wide variety of sounds with formants rather like those of human vowels, and can also produce additional complex sounds by adding vocalization. An outline is given of the way in which the whole system can be analyzed using the harmonic-balance technique, but a simpler approach with lip motion assumed shows easily that upper harmonics of the drone with frequencies lying close to impedance maxima of the vocal tract are suppressed, so that formant bands appear near impedance minima of the vocal tract. This agrees with experimental findings. Simultaneous vibration of the player's lips and vocal folds is shown to generate multiple sum and difference tones, and can be used to produce subharmonics of the drone. A brief discussion is given of player preference of particular bore profiles.  相似文献   

9.
10.
The vocal folds and glottis are analyzed as a single system rather than as two separate but interacting systems, i.e., an aerodynamic one (the glottis) and a mechanical one (the vocal folds). Simplified steady flow calculations based on the two-mass model, and similar to those of Ishizaka and Matsudaira [SCRL Monograph No. 8, Santa Barbara, CA (1972)], are made except that flexible walls are assumed for both dc and ac flows. A negative differential resistance is found for steady flow when the coupling spring is weak compared to that of the lower mass. Dynamic transverse motion of the masses is represented by two transverse series resonant circuits in parallel within the glottis. The vocal tract is represented by a lumped resistance and inertance in series. Sustained, self-excited, small-amplitude oscillations can be obtained when the magnitude of the negative differential resistance is equal to the real part of the impedance of the rest of the circuit. The oscillation frequency depends only on the elasticity and mass of the vocal folds. The present analysis differs from Ishizaka and Matsudaira's analysis because their oscillation frequency decreases as dc volume velocity increases.  相似文献   

11.
Although listeners routinely perceive both the sex and individual identity of talkers from their speech, explanations of these abilities are incomplete. Here, variation in vocal production-related anatomy was assumed to affect vowel acoustics thought to be critical for indexical cueing. Integrating this approach with source-filter theory, patterns of acoustic parameters that should represent sex and identity were identified. Due to sexual dimorphism, the combination of fundamental frequency (F0, reflecting larynx size) and vocal tract length cues (VTL, reflecting body size) was predicted to provide the strongest acoustic correlates of talker sex. Acoustic measures associated with presumed variations in supralaryngeal vocal tract-related anatomy occurring within sex were expected to be prominent in individual talker identity. These predictions were supported by results of analyses of 2500 tokens of the /epsilon/ phoneme, extracted from the naturally produced speech of 125 subjects. Classification by talker sex was virtually perfect when F0 and VTL were used together, whereas talker classification depended primarily on the various acoustic parameters associated with vocal-tract filtering.  相似文献   

12.
Voice training techniques often make use of exercises involving partial occlusion of the vocal tract, typically at the anterior part of the oral cavity or at the lips. In this study two techniques are investigated: a bilabial fricative and a small diameter hard-walled tube placed between the lips. Because the input acoustic impedance of the vocal tract is known to affect both the shaping of the glottal flow pulse and the vibrational pattern of the vocal folds, a study of the input impedance is an essential step in understanding the benefits of these two techniques. The input acoustic impedance of the vocal tract was investigated theoretically for cases of a vowel, bilabial occlusion (fully closed lips), a bilabial fricative, and artificially lengthening the tract with small diameter tubes. The results indicate that the tubes increase the input impedance in the range of the fundamental frequency of phonation by lowering the first formant frequency to nearly that of the bilabial occlusion (the lower bound on the first formant) while still allowing a continuous airflow. The bilabial fricative also has the effect of lowering the first formant frequency and increasing the low-frequency impedance, but not as effectively as the extension tubes.  相似文献   

13.
Acoustic radiation impedance of the mouth is an important parameter when the vocal tract is modelled by the equivalent electrical circuit. If the vocal tract is closed by a cavity, as when the speaker wears some kind of mask, total impedance acoustically loading the vocal tract becomes serial connection of the mouth radiation impedance and the mask impedance. In that case the mouth radiation impedance has to be changed compared to free field conditions. This paper introduces a simplified approach to the modelling of that change by an appropriate reduction coefficient. The analysis based on an experiment preformed by measurement in the vocal tract physical model accompanied with analytical estimation has shown that the value of such reduction coefficient is 0.5. The results reveal that for a vocal tract closed with mask cavity the change in mouth radiation impedance introduced in an equivalent electrical circuit can be approximated by the value for free field radiation decreased by about 50%.  相似文献   

14.
A synthetic two-layer, self-oscillating, life-size vocal fold model was used to study the influence of the vocal tract and false folds on the glottal jet. The model vibrated at frequencies, pressures, flow rates, and amplitudes consistent with human phonation, although some differences in behavior between the model and the human vocal folds are noted. High-speed images of model motion and flow visualization were acquired. Phase-locked ensemble-averaged glottal jet velocity measurements using particle image velocimetry (PIV) were acquired with and without an idealized vocal tract, with and without false folds. PIV data were obtained with varying degrees of lateral asymmetric model positioning. Glottal jet velocity magnitudes were consistent with those measured using excised larynges. A starting vortex was observed in all test cases. The false folds interfered with the starting vortex, and in some cases vortex shedding from the false folds was observed. In asymmetric cases without false folds, the glottal jet tended to skew toward the nearest wall; with the false folds, the opposite trend was observed. rms velocity calculations showed the jet shear layer and laminar core. The rms velocities were higher in the vocal tract cases compared to the open jet and false fold cases.  相似文献   

15.
The fundamental frequency of vocal fold oscillation (F(0)) is controlled by laryngeal mechanics and aerodynamic properties. F(0) change per unit change of transglottal pressure (dF/dP) using a shutter valve has been studied and found to have nonlinear, V-shaped relationship with F(0). On the other hand, the vocal tract is also known to affect vocal fold oscillation. This study examined the effect of artificially lengthened vocal tract length on dF/dP. dF/dP was measured in six men using two mouthpieces of different lengths. Results: The dF/dP graph for the longer vocal tract was shifted leftward relative to the shorter one. Conclusion: Using the one-mass model, the nadir of the "V" on the dF/dP graph was strongly influenced by the resonance around the first formant frequency. However, a more precise model is needed to account for the effects of viscosity and turbulence.  相似文献   

16.
The aim of the study was to identify the acoustic correlates of female teachers' subjective voice complaints by recording their voices in their working environment. The subjects made recordings during lessons (N = 10) and breaks (N = 11). The subjects were divided into 2 groups: those with few voice complaints (FC group) and those with many voice complaints (MC group). The speech sample made in the breaks was maximally sustained /a/, from which fundamental frequency (F0), jitter, and shimmer were analyzed. The classroom samples were analyzed for F0, sound pressure level (SPL), and F0 time (the active vibration time of the vocal folds). Additionally, an index for assessing voice loading is presented. The results revealed a tendency of the MC group to have higher F0 and lower SPL and perturbation values than the FC group. The index values correlated moderately with the subjective vocal complaints.  相似文献   

17.
Well-known multimass models of vocal folds are useful to describe main behavior observed in human voicing but their principle of functioning, based on harmonic oscillation, may appear complex. This work is designed to show that a simple one-mass model ruled by laws of relaxation oscillation can also depict main behavior of glottis dynamic. Theory of relaxation oscillation is detailed. A relaxation oscillation model is assessed through a numerical simulation using conventional values for tissue characteristics and subglottal pressure. As expected, raising the mass decreases the fundamental frequency and increases the amplitude of vocal fold vibration: for a mass ranging from 0.01 to 0.4 g, F0 decreased from 297.5 to 42.5 Hz and vibrational amplitude increased from 1.26 to 3.25 mm (for stiffness k=10Nm(-1), damping r=0.015 N s m(-1), and subglottal pressure=1 kPa). Stiffness value has the opposite effect. The subglottal pressure controls the fundamental frequency with a rate ranging from 20 to 50 Hz/kPa. The vibrational amplitude is also controlled linearly by subglottal pressure from 0.22 to 0.26 mm/kPa. The range of phonation threshold pressure (PTP) is close to the values currently proposed, that is, 0.1 to 1 kPa and varies with the fundamental frequency. The relaxation oscillator is a simple and useful tool for modeling vocal fold vibration.  相似文献   

18.
This letter analyzes the oscillation onset-offset conditions of the vocal folds as a function of laryngeal size. A version of the two-mass model of the vocal folds is used, coupled to a two-tube approximation of the vocal tract in configuration for the vowel /a/. The standard male configurations of the laryngeal and vocal tract models are used as reference, and their dimensions are scaled using a single factor. Simulations of the vocal fold oscillation and oral output are produced for varying values of the scaling factor. The results show that the oscillation threshold conditions become more restricted for smaller laryngeal sizes, such as those appropriate for females and children.  相似文献   

19.
20.
The purpose of this investigation was to study voice changes during a working day. The subjects consisted of 33 female primary and secondary schoolteachers who recorded their first and last lessons during one school day. The subjects were studied both as one group and two subgroups (those with many and those with few voice complaints). Estimates of fundamental frequency (F0), sound pressure level (SPL), the standard deviations of these values (F0 SD; SPL SD) and F0 time (vibration time of vocal folds) were made. The most obvious change due to loading was the rise of F0 that was 9.7 Hz between the first and last lesson (P = 0.00). F0 increased more (12.8. Hz, P = 0.006) in the subgroup with few complaints.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号