首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 21 毫秒
1.
Although listeners routinely perceive both the sex and individual identity of talkers from their speech, explanations of these abilities are incomplete. Here, variation in vocal production-related anatomy was assumed to affect vowel acoustics thought to be critical for indexical cueing. Integrating this approach with source-filter theory, patterns of acoustic parameters that should represent sex and identity were identified. Due to sexual dimorphism, the combination of fundamental frequency (F0, reflecting larynx size) and vocal tract length cues (VTL, reflecting body size) was predicted to provide the strongest acoustic correlates of talker sex. Acoustic measures associated with presumed variations in supralaryngeal vocal tract-related anatomy occurring within sex were expected to be prominent in individual talker identity. These predictions were supported by results of analyses of 2500 tokens of the /epsilon/ phoneme, extracted from the naturally produced speech of 125 subjects. Classification by talker sex was virtually perfect when F0 and VTL were used together, whereas talker classification depended primarily on the various acoustic parameters associated with vocal-tract filtering.  相似文献   

2.
This study was aimed at identifying acoustic and physiological measures useful for monitoring voice changes in postnasopharyngeal patients with nonlaryngeal malignancies, and providing evidences of vocal tract effect on voice through comparisons between individuals with and without intact vocal tract. Simultaneous acoustic-electroglottographic signals recorded during phonation of vowels /i/ and /a/ sustained at habitual, high, and low pitch levels were compared among 10 postradiotherapy patients with nasopharyngeal carcinoma (NPC), 10 voice patients (VPs) with intact vocal tract, and 10 healthy individuals with normal voice (NORM). Results from a series of discriminant analyses revealed that the NPC group generally exhibited lower signal-to-noise (SNR) and open quotient (OQ) and higher Formant 1 frequency (F(1)) and speed quotient (SQ) than the NORM group. Unlike both VP and NORM groups, the NPC group failed to show a pitch effect on all voice measures, including OQ, SQ, percent jitter, percent shimmer, and SNR, suggesting an effect of radiotherapy and/or vocal tract on laryngeal behaviors. For the vowel /i/, on the other hand, only the NPC and NORM groups showed a pattern of pitch-dependent F(1) raising, a reflection of increased pharyngeal narrowing. These findings suggested that the pitch effect on laryngeal behaviors differed not only between individuals with intact vocal tract and those without but also between those with structural and dynamic changes of vocal tract.  相似文献   

3.
Although there has been continuing interest in voice quality, much of this research has focused on the vocal folds rather than the supraglottal structures. This paper reports the use of videoendoscopy for studying supraglottal participation in various singing tasks. In a preliminary study presented last year by the present authors, CT scanning was used to corroborate videoendoscopic observation. Vocal tract activities observed included variation of laryngeal height with pitch, variation of pharyngeal wall dimension with pitch and vowel, and marked supraglottic constriction with certain vocal imitations. In order to gain a better understanding of vocal training, and its effect upon vocal tract physiology, a study was designed using videoendoscopy to observe singers with significant experience and training while performing various vocal tasks. The tasks focused on the following: (1) vocal tract activity associated with pitch changes; (2) the physiology involved in the production of “cover”; (3) the structures involved in the production of vibrato; and (4) the physiology of the singer's “ring.” It would appear that videoendoscopy will become increasingly more valuable to the voice community as our understanding of vocal tract physiology improves.  相似文献   

4.
Voice training techniques often make use of exercises involving partial occlusion of the vocal tract, typically at the anterior part of the oral cavity or at the lips. In this study two techniques are investigated: a bilabial fricative and a small diameter hard-walled tube placed between the lips. Because the input acoustic impedance of the vocal tract is known to affect both the shaping of the glottal flow pulse and the vibrational pattern of the vocal folds, a study of the input impedance is an essential step in understanding the benefits of these two techniques. The input acoustic impedance of the vocal tract was investigated theoretically for cases of a vowel, bilabial occlusion (fully closed lips), a bilabial fricative, and artificially lengthening the tract with small diameter tubes. The results indicate that the tubes increase the input impedance in the range of the fundamental frequency of phonation by lowering the first formant frequency to nearly that of the bilabial occlusion (the lower bound on the first formant) while still allowing a continuous airflow. The bilabial fricative also has the effect of lowering the first formant frequency and increasing the low-frequency impedance, but not as effectively as the extension tubes.  相似文献   

5.
The biomechanics of medialization laryngoplasty are not well understood. An excised canine larynx model was used to test the effects of various sized silicon implants. The vocal fold length, position, and tension were measured. Medialization laryngoplasty did not affect vocal fold length. At the mid-membranous vocal fold, larger shims resulted in greater medialization and tension. Medialization laryngoplasty neither medialized nor stiffened the vocal process to resist lateralizing forces. We conclude that medialization laryngoplasty provides bulk and support for defects of the membranous region of the vocal fold, but does not appear to close a posterior glottal gap. The selection of a surgical procedure to treat glottal incompetence should take into account the unique biomechanical properties of the anterior (membranous vocal folds) and posterior (cartilaginous portion) glottis.  相似文献   

6.
Three-dimensional vocal tract shapes and consequent area functions representing the vowels [i, ae, a, u] have been obtained from one male and one female speaker using magnetic resonance imaging (MRI). The two speakers were trained vocal performers and both were adept at manipulation of vocal tract shape to alter voice quality. Each vowel was performed three times, each with one of the three voice qualities: normal, yawny, and twangy. The purpose of the study was to determine some ways in which the vocal tract shape can be manipulated to alter voice quality while retaining a desired phonetic quality. To summarize any overall tract shaping tendencies mean area functions were subsequently computed across the four vowels produced within each specific voice quality. Relative to normal speech, both the vowel area functions and mean area functions showed, in general, that the oral cavity is widened and tract length increased for the yawny productions. The twangy vowels were characterized by shortened tract length, widened lip opening, and a slightly constricted oral cavity. The resulting acoustic characteristics of these articulatory alterations consisted of the first two formants (F1 and F2) being close together for all yawny vowels and far apart for all the twangy vowels.  相似文献   

7.
SUMMARY: Acoustic pharyngometry evaluates the geometry of the vocal tract with acoustic reflections and provides information about vocal tract cross-sectional area and volume from lip to the glottis. Variations in vocal tract diameters are needed for speech scientists to validate various acoustic models and for medical professionals since the advent of endoscopic surgical techniques. Race is known to be one of the most important factors affecting the oral and nasal structures. This study compared vocal tract dimensions of White American, African American, and Chinese male and female speakers. One hundred and twenty healthy adult subjects with equal numbers of men and women were divided among three races. Subjects were controlled for age, gender, height, and weight. Six dimensional parameters of the speakers' vocal tract cavities were measured with acoustic reflection technology (AR). Significant gender and race main effects were found in certain vocal tract dimensions. The findings of this study now provide speech scientists, speech-language pathologists, and other health professionals with a new anatomical database of vocal tract variations for adult speakers from three different races.  相似文献   

8.
In this study, vocal tract area functions for one American English speaker, recorded using magnetic resonance imaging, were used to simulate and analyze the acoustics of vowel nasalization. Computer vocal tract models and susceptance plots were used to study the three most important sources of acoustic variability involved in the production of nasalized vowels: velar coupling area, asymmetry of nasal passages, and the sinus cavities. Analysis of the susceptance plots of the pharyngeal and oral cavities, -(B(p)+B(o)), and the nasal cavity, B(n), helped in understanding the movement of poles and zeros with varying coupling areas. Simulations using two nasal passages clearly showed the introduction of extra pole-zero pairs due to the asymmetry between the passages. Simulations with the inclusion of maxillary and sphenoidal sinuses showed that each sinus can potentially introduce one pole-zero pair in the spectrum. Further, the right maxillary sinus introduced a pole-zero pair at the lowest frequency. The effective frequencies of these poles and zeros due to the sinuses in the sum of the oral and nasal cavity outputs changes with a change in the configuration of the oral cavity, which may happen due to a change in the coupling area, or in the vowel being articulated.  相似文献   

9.
Glottal-pulse rate (GPR) and vocal-tract length (VTL) are related to the size, sex, and age of the speaker but it is not clear how the two factors combine to influence our perception of speaker size, sex, and age. This paper describes experiments designed to measure the effect of the interaction of GPR and VTL upon judgements of speaker size, sex, and age. Vowels were scaled to represent people with a wide range of GPRs and VTLs, including many well beyond the normal range of the population, and listeners were asked to judge the size and sex/age of the speaker. The judgements of speaker size show that VTL has a strong influence upon perceived speaker size. The results for the sex and age categorization (man, woman, boy, or girl) show that, for vowels with GPR and VTL values in the normal range, judgements of speaker sex and age are influenced about equally by GPR and VTL. For vowels with abnormal combinations of low GPRs and short VTLs, the VTL information appears to decide the sex/age judgement.  相似文献   

10.
Developmental characteristics of formant I (FI) and formant 2 (F2) are reported for spontaneous vocalizations produced by four young children. Each child was systematically sampled at between 15 and 36 months of age. Results indicated that both F1 and F2 remained relatively unchanged prior to 24 months of age. Significant decreases in average F1 and F2 occurred between 24 and 36 months. When F1 and F2 values were categorized according to tongue elevation and tongue advancement, the most significant changes were associated with high/back articulations. The pattern of formant frequencies noted in the present group of children appears to reflect developmental changes in vocal tract growth and reconfiguration.  相似文献   

11.
12.
A 53-year-old man with severe vocal fold atrophy underwent bilateral type 1 thyroplasty and anterior commissure advancement. Postoperatively, he developed a strained voice with less projection and volume than prior to surgery. This was verified by objective assessment of vocal function. Videoendoscopy revealed bilateral false vocal fold fullness and blunting of the anterior commissure. Magnetic resonance imaging demonstrated cephalic migration of the posterior ends of the implants and retrusion of the anterior commissure segment. Surgical exploration revealed that the type 1 implants had rotated and buckled. The anterior commissure segment was rotated and displaced inferiorly, and its inferior surface was tethered to the cricoid by scar tissue. The implants were removed, the anterior segment was repositioned and rigidly fixed, and bilateral lipoinjection performed. Vocal function was significantly improved, and endoscopy revealed normal tension and length of the vocal folds and restoration of the anterior commissure. This case demonstrates the importance of stable fixation during laryngeal framework surgery  相似文献   

13.

Objective

To analyze the vocal tract morphometry of women with vocal nodules (VN) compared with normal subjects by means of magnetic resonance imaging (MRI) at rest position.

Study Design

Prospective study.

Methods

The present research included 20 young adult women, aged 18–40 years: 10 dysphonic patients with VN and 10 normal subjects. All participants were tested using MRI; 12 measurements of the vocal tract were performed: nine in median sagittal section and three in axial section.

Results

The 12 measurements were smaller in the dysphonic group; statistical significance was obtained for three parameters: in the sagittal plane, the laryngeal vestibule area was significantly smaller in the dysphonic group, with P = 0.012∗ (∗ = statistical significance); in the axial section, the distance between the right and left vocal processes of the arytenoids’ cartilages and the distance between the anterior commissure of the glottis and the laryngeal posterior wall were also significantly lower in the dysphonic group, with P = 0.036∗ and 0.010∗, respectively. Significant differences in the vocal tract morphometry of individuals with VN were observed compared with normal subjects, at rest position.

Conclusions

Results obtained from this study suggest that patients with VN may present a constantly increased tension of the laryngeal muscles, even at rest; moreover, reduced anterior-posterior dimension of the larynx may be a morphological characteristic of patients with VN.  相似文献   

14.
Vocal tract area functions may contain quite abrupt changes in cross-sectional area. In formant frequency calculations for such area functions, an inner length correction (ILC) should be applied. The relevance of this correction was investigated by comparing acoustic measurements obtained from a physical model of the vocal tract with data gathered by means of computer simulations. Calculating formant frequencies without applying internal length corrections caused substantial errors, particularly for area functions representing apical stops just anterior to occlusion. Decentering and axial symmetry in the arrangement of the area elements of the physical model were briefly studied and found to have effects on the formant frequency values.  相似文献   

15.
The fundamental frequency of vocal fold oscillation (F(0)) is controlled by laryngeal mechanics and aerodynamic properties. F(0) change per unit change of transglottal pressure (dF/dP) using a shutter valve has been studied and found to have nonlinear, V-shaped relationship with F(0). On the other hand, the vocal tract is also known to affect vocal fold oscillation. This study examined the effect of artificially lengthened vocal tract length on dF/dP. dF/dP was measured in six men using two mouthpieces of different lengths. Results: The dF/dP graph for the longer vocal tract was shifted leftward relative to the shorter one. Conclusion: Using the one-mass model, the nadir of the "V" on the dF/dP graph was strongly influenced by the resonance around the first formant frequency. However, a more precise model is needed to account for the effects of viscosity and turbulence.  相似文献   

16.
In a re-analysis of x rays of speakers producing Swedish vowels, midsagittal pharyngeal dimensions were predicted from anterior tongue positions using procedures based on estimated tongue pellet positions. Principal component analysis was used to reduce the number of pellet degrees of freedom from eight to three prior to applying linear regression from these three independent variables to dependent vocal tract midsagittal cross distances. Except for the regions around the laryngopharynx and uvula, the pharynx dimensions were predictable from linear regressions and were significant at the p<0.05 level. Numerical experiments show that it is crucial to reduce the number of independent variables in tests of statistical significance.  相似文献   

17.
18.
An alternative and complete derivation of the vocal tract length sensitivity function, which is an equation for finding a change in formant frequency due to perturbation of the vocal tract length [Fant, Quarterly Progress and Status Rep. No. 4, Speech Transmission Laboratory, Kungliga Teknisha Hogskolan, Stockholm, 1975, pp. 1-14] is presented. It is based on the adiabatic invariance of the vocal tract as an acoustic resonator and on the radiation pressure on the wall and at the exit of the vocal tract. An algorithm for tuning the vocal tract shape to match the formant frequencies to target values, such as those of a recorded speech signal, which was proposed in Story [J. Acoust. Soc. Am. 119, 715-718 (2006)], is extended so that the vocal tract length can also be changed. Numerical simulation of this extended algorithm shows that it can successfully convert between the vocal tract shapes of a male and a female for each of five Japanese vowels.  相似文献   

19.
The purpose of this exploratory study was to determine if laryngeal transillumination in combination with stroboscopy (strobophotoglottography; SPGG) is useful for (1) the visualization of vocal fold vibration (VFV) opening patterns, (2) the localization of initial vocal fold opening in horizontal glottal thirds (anterior, midmembranous, and posterior), (3) determination of the temporal correspondence of the so-called electroglottography (EGG)-knee and initial vocal fold separation, and, finally, (4) automatized quantitative measurements of glottal area function within endoscopic images. With stroboscopic transillumination, initial inferior vocal fold separation was detectable during the "closed" phase, where the vocal folds were still closed in the upper portion and therefore initial inferior vocal fold separation could not be visualized with usual laryngoscopy techniques. In the horizontal plane within similar fundamental frequencies in modal voice registers in two male subjects, localization of initial glottal opening depended on the voice types used (soft, normal, or pressed phonation). We found zipperlike posterior-to-anterior openings, initial midmembranous openings, initial anterior openings, as well as simultaneous initial opening of all three portions in the two healthy male adults examined. This technique proved to add temporal and spatial information to vocal fold opening patterns and extends our examination techniques to the very beginning of vocal fold opening at the inferior portion. Simultaneous electroglottogram tracking and comparison with bidirectionally illuminated stroboscopic images revealed a time-locked correspondence of the EGG-knee with the aforementioned initial inferior vocal fold separation. Bidirectional illumination combined with digital color extraction techniques allowed for image separation of subglottally and supraglottally illuminated structures. This facilitated vocal fold contour detection and automatized image processing, for example, for determination of glottal area function, and is considered to be a further step to objective automatized quantitative measurements within endoscopic images.  相似文献   

20.
Computer models of the process of speech articulation require a detailed knowledge of the vocal tract configurations employed in speech and the application of acoustic theory to calculate the sound waveform. Almost all currently available data on vocal tract dimensions come from x-ray films and are severely limited in quantity and coherence due to restrictions on radiation dosage and intersubject differences. We are using MRI techniques to obtain the pharyngeal dimensions of speakers producing sustained vowels. The fact that MRI does not employ ionizing radiation provides speech research with the opportunity to obtain comprehensive bodies of much-needed data on the articulatory characteristics of single subjects.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号