首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 765 毫秒
1.
Strained, strangled, and tremulous vocal qualities that are typically seen in adductor spasmodic dysphonia (ADSD), voice tremor (Tremor), and the spastic dysarthria of amyotrophic lateral sclerosis (ALS) may sound similar and be difficult to differentiate. The purpose of this study was to determine if these vocal qualities of neurologic origin could be differentiated on the basis of acoustic and motor speech parameters. Three groups of subjects (ADSD, ALS, and Tremor) were analyzed by the Motor Speech Profile System (Kay Elemetrics, Lincoln Park, NJ) for fundamental frequency (Fo), standard deviation of Fo, diadochokinetic rate (ddk), standard deviation of ddk, mean intensity and standard deviation of ddk, frequency and amplitude variability in connected speech, and speaking rate in connected speech. Profiles of the three groups are presented with the significant features that differentiated one from the other.  相似文献   

2.
In this paper, the acoustic-phonetic characteristics of steady apical trills--trill sounds produced by the periodic vibration of the apex of the tongue--are studied. Signal processing methods, namely, zero-frequency filtering and zero-time liftering of speech signals, are used to analyze the excitation source and the resonance characteristics of the vocal tract system, respectively. Although it is natural to expect the effect of trilling on the resonances of the vocal tract system, it is interesting to note that trilling influences the glottal source of excitation as well. The excitation characteristics derived using zero-frequency filtering of speech signals are glottal epochs, strength of impulses at the glottal epochs, and instantaneous fundamental frequency of the glottal vibration. Analysis based on zero-time liftering of speech signals is used to study the dynamic resonance characteristics of vocal tract system during the production of trill sounds. Qualitative analysis of trill sounds in different vowel contexts, and the acoustic cues that may help spotting trills in continuous speech are discussed.  相似文献   

3.
Acoustic measurements believed to reflect glottal characteristics were made on recordings collected from 21 male speakers. The waveforms and spectra of three nonhigh vowels (/ae, lambda, epsilon/) were analyzed to obtain acoustic parameters related to first-formant bandwidth, open quotient, spectral tilt, and aspiration noise. Comparisons were made with previous results obtained for 22 female speakers [H. M. Hanson, J. Acoust. Soc. Am. 101, 466-481 (1997)]. While there is considerable overlap across gender, the male data show lower average values and less interspeaker variation for all measures. In particular, the amplitude of the first harmonic relative to that of the third formant is 9.6 dB lower for the male speakers than for the female speakers, suggesting that spectral tilt is an especially significant parameter for differentiating male and female speech. These findings are consistent with fiberscopic studies which have shown that males tend to have a more complete glottal closure, leading to less energy loss at the glottis and less spectral tilt. Observations of the speech waveforms and spectra suggest the presence of a second glottal excitation within a glottal period for some of the male speakers. Possible causes and acoustic consequences of these second excitations are discussed.  相似文献   

4.
To test the effects of different sources of tremor on the voice, tremor was simulated by external rhythmic perturbation of structures at the subglottal, glottal, and supraglottal levels in 10 healthy subjects. The acoustic and airflow signals simultaneously recorded during sustained phonation in the normal and the 3 simulated tremor conditions were analyzed and compared. Voice measures included: fundamental frequency, 2 short-term perturbation measures (jitter and shimmer), and 3 long-term tremor measures (prominence ratios of the spectral peaks of the acoustic frequency contour, acoustic amplitude contour, and airflow contour). Measures of fundamental frequency and percent shimmer were not significantly affected by the simulated tremors. Measures of percent jitter and the amplitudes of the long-term frequency and amplitude modulations were most prominently increased when respiratory drive was perturbed by simulated tremor. Spectral analysis of the acoustic amplitude contour was most useful in distinguishing the 3 sites of simulated tremor.  相似文献   

5.
《Journal of voice》2023,37(3):444-451
ObjectiveA single injection of basic fibroblast growth factor (bFGF) into the vocal folds of patients with glottal insufficiency has been shown to be effective for a few years. However, the long-term therapeutic effect of a single injection of bFGF into the vocal folds has yet to be demonstrated. In this study, the therapeutic effect of a single injection of bFGF into the vocal folds was investigated over several years by monitoring patients for 36 months following this treatment.MethodsNineteen patients with glottal insufficiency received injections of bFGF diluted to 20 μg/mL in the superficial layer of the lamina propria of the bilateral vocal folds. The following parameters were evaluated at preinjection baseline and 6, 12, 18, 24, and 36 months later, and statistical comparisons were performed. The parameters evaluated were: the Grade, Rough, Breathy, Asthenic, and Strained (GRBAS) scale score; maximum phonation time; acoustic analysis; and glottal wave analysis (GWA) and kymograph edge analysis (KEA) using high-speed digital imaging (HSDI). The amplitude perturbation quotient (APQ) and period perturbation quotient (PPQ) were measured by acoustic analysis. The mean minimum glottal area during vocalization and mean minimum distance between the vocal folds were measured by GWA. The amplitudes of the bilateral vocal folds were measured by KEA.ResultsPostinjection, the GRBAS scale score decreased from 6 months after injection, and maximum phonation time was prolonged. The mean minimum glottal area during vocalization and the mean minimum distance between the vocal folds calculated by GWA of HSDI decreased significantly after 6 months. These effects persisted until 36 months postinjection. APQ and PPQ derived from acoustic analysis tended to decrease, but not significantly. There was no clear change in the amplitudes of the bilateral vocal folds calculated by KEA of HSDI before and after injection.ConclusionsThese results suggest that the effects of a single injection of bFGF into the vocal folds persist for 36 months.  相似文献   

6.
Acoustic duration and degree of vowel reduction are known to correlate with a word's frequency of occurrence. The present study broadens the research on the role of frequency in speech production to voice assimilation. The test case was regressive voice assimilation in Dutch. Clusters from a corpus of read speech were more often perceived as unassimilated in lower-frequency words and as either completely voiced (regressive assimilation) or, unexpectedly, as completely voiceless (progressive assimilation) in higher-frequency words. Frequency did not predict the voice classifications over and above important acoustic cues to voicing, suggesting that the frequency effects on the classifications were carried exclusively by the acoustic signal. The duration of the cluster and the period of glottal vibration during the cluster decreased while the duration of the release noises increased with frequency. This indicates that speakers reduce articulatory effort for higher-frequency words, with some acoustic cues signaling more voicing and others less voicing. A higher frequency leads not only to acoustic reduction but also to more assimilation.  相似文献   

7.
In dynamical motor theory, skill acquisition occurs as a modification of preexisting coordination patterns or attractor states. The purpose of this study was to assess how different levels of voice onset, voice quality, and fundamental frequency (F0) combine to form the attractor states common to voice motor control. Three levels of voice onset (glottal, simultaneous, and breathy), voice quality (modal speech, mixed, and falsetto), and fundamental frequency (low, mid, and high) were manipulated by vocally untrained, female subjects. Percent correct of acquisition trials and self-report of effort were used as measures of stable phonations indicative of an attractor state. Using intensity as a covariate, the results provided support for two of the three predicted triads representing attractor states in female speakers: (1) glottal onset/modal speech quality/low F0; and (2) breathy onset/falsetto quality/high F0. The results of this study suggest that certain parameters of voice motor control, such as onset, quality, and F0, exist as part of a dynamical system that can be identified and manipulated in voice motor acquisition and learning.  相似文献   

8.
This study presents stochastic models of jitter. Jitter designates small, random, involuntary perturbations of the glottal cycle lengths. Jitter is a base-line phenomenon that may be observed in all voiced speech sounds. Knowledge of its properties is therefore relevant to the acoustic modeling, analysis, and synthesis of voice quality. Also, models of jitter are conceptual frameworks that enable experimenters and clinicians to distinguish jitter in particular from aperiodic cycle length patterns in general. Vocal jitter is modeled by means of the ribbon model of the glottal vibration combined with stochastic models of the disturbances of the instantaneous frequency. The disturbance model comprises correlation-free noise and vocal microtremor. Properties of jitter that are simulated are the stochasticity, stationarity, and normality of the decorrelated cycle length perturbations, the size of decorrelated jitter, the correlation between the perturbations of neighboring glottal cycles, the modulation level and modulation frequency owing to microtremor, the asynchrony between external disturbances and glottal cycles, the dependence of the size of jitter on the average glottal cycle length, and the relation between jitter and laryngeal pathologies. Modeled jitter is discussed in the light of measured jitter, as well as the physiological and statistical plausibility of the model parameters.  相似文献   

9.
Using acoustic analysis techniques, Waldstein [J. Acoust. Soc. Am. 88, 2099-2114 (1990] reported abnormal speech findings in postlingual deaf speakers. She interpreted her findings to suggest that auditory feedback is important in motor speech control. However, it is argued here that Waldstein's interpretation may be unwarranted without addressing the possibility of neurologic deficits (e.g., dysarthria) as confounding (or even primary) causes of the abnormal speech in her subjects.  相似文献   

10.
The purpose of this study was to evaluate the effects of bilateral botulinum toxin injection into the thyroarytenoid (TA) muscles of a patient with essential voice tremor. Acoustic and aerodynamic data were collected weekly over a 16-week period. Flexible nasolaryngoscopy was performed prior to injection and 2, 6, 10, and 16 weeks postinjection. Perceptual analyses of the acoustic and nasolaryngoscopic data were performed. A reduction in frequency tremor and, to a lesser extent, amplitude tremor was observed during the 1-10 week period. Estimated laryngeal resistance decreased after injection and was accompanied in perceptual measures by a reduction in vocal effort, laryngeal tremor, and supraglottic hyperfunction. Essential voice tremor can be successfully attenuated with bilateral percutaneous injection of botulinum toxin A into the vocalis muscle.  相似文献   

11.
SUMMARY: The aim of this study was to investigate how different acoustic parameters, extracted both from speech pressure waveforms and glottal flows, can be used in measuring vocal loading in modern working environments and how these parameters reflect the possible changes in the vocal function during a working day. In addition, correlations between objective acoustic parameters and subjective voice symptoms were addressed. The subjects were 24 female and 8 male customer-service advisors, who mainly use telephone during their working hours. Speech samples were recorded from continuous speech four times during a working day and voice symptom questionnaires were completed simultaneously. Among the various objective parameters, only F0 resulted in a statistically significant increase for both genders. No correlations between the changes in objective and subjective parameters appeared. However, the results encourage researchers within the field of occupational voice use to apply versatile measurement techniques in studying occupational voice loading.  相似文献   

12.
《Journal of voice》2019,33(6):945.e19-945.e25
Three electroglottographic parameters, fundamental frequency, contact quotient, and speed quotient were analyzed for two singers of Young girl role in Kunqu Opera. Each singer performed three conditions, singing, stage speech, and reading lyrics. The phonation types adopted in different conditions were explored based on electroglottographic parameters. Fundamental frequency, contact quotient, and speed quotient showed different distributions among conditions. Five phonation types were used in singing and stage speech, which include (1) breathy voice, (2) modal voice with low degree of posterior glottal adduction, (3) modal voice, (4) falsetto, and (5) falsetto with high degree of posterior glottal adduction. The phonation strategies partly showed differences between singers. Different phonation type collocations were employed in singing and stage speech. The relationship between phonation types and pitch was complex. The phonation types actually used were different from and more complex than those in traditional Kunqu Opera singing theory.  相似文献   

13.
SUMMARY: Inverse filtering (IF) is a common method used to estimate the source of voiced speech, the glottal flow. This investigation aims to compare two IF methods: one manual and the other semiautomatic. Glottal flows were estimated from speech pressure waveforms of six female and seven male subjects producing sustained vole /a/ in breathy, normal, and pressed phonation. The closing phase characteristics of the glottal pulse were parameterized using two time-based parameters: the closing quotient (C1Q) and the normalized amplitude quotient (NAQ). The information given by these two parameters indicates a strong correlation between the two IF methods. The results are encouraging in showing that the parameterization of the voice source in different speech sounds can be performed independently of the technique used for inverse filtering.  相似文献   

14.
Several types of measurements were made to determine the acoustic characteristics that distinguish between voiced and voiceless fricatives in various phonetic environments. The selection of measurements was based on a theoretical analysis that indicated the acoustic and aerodynamic attributes at the boundaries between fricatives and vowels. As expected, glottal vibration extended over a longer time in the obstruent interval for voiced fricatives than for voiceless fricatives, and there were more extensive transitions of the first formant adjacent to voiced fricatives than for the voiceless cognates. When two fricatives with different voicing were adjacent, there were substantial modifications of these acoustic attributes, particularly for the syllable-final fricative. In some cases, these modifications leads to complete assimilation of the voicing feature. Several perceptual studies with synthetic vowel-consonant-vowel stimuli and with edited natural stimuli examined the role of consonant duration, extent and location of glottal vibration, and extent of formant transitions on the identification of the voicing characteristics of fricatives. The perceptual results were in general consistent with the acoustic observations and with expectations based on the theoretical model. The results suggest that listeners base their voicing judgments of intervocalic fricatives on an assessment of the time interval in the fricative during which there is no glottal vibration. This time interval must exceed about 60 ms if the fricative is to be judged as voiceless, except that a small correction to this threshold is applied depending on the extent to which the first-formant transitions are truncated at the consonant boundaries.  相似文献   

15.
Vocal quality factors: analysis, synthesis, and perception.   总被引:4,自引:0,他引:4  
The purpose of this study was to examine several factors of vocal quality that might be affected by changes in vocal fold vibratory patterns. Four voice types were examined: modal, vocal fry, falsetto, and breathy. Three categories of analysis techniques were developed to extract source-related features from speech and electroglottographic (EGG) signals. Four factors were found to be important for characterizing the glottal excitations for the four voice types: the glottal pulse width, the glottal pulse skewness, the abruptness of glottal closure, and the turbulent noise component. The significance of these factors for voice synthesis was studied and a new voice source model that accounted for certain physiological aspects of vocal fold motion was developed and tested using speech synthesis. Perceptual listening tests were conducted to evaluate the auditory effects of the source model parameters upon synthesized speech. The effects of the spectral slope of the source excitation, the shape of the glottal excitation pulse, and the characteristics of the turbulent noise source were considered. Applications for these research results include synthesis of natural sounding speech, synthesis and modeling of vocal disorders, and the development of speaker independent (or adaptive) speech recognition systems.  相似文献   

16.
A single subject design was used to determine if pressure threshold training strengthens the inspiratory muscles in a subject with a limited glottal airway as well as diminish dyspnea and improve parameters of speech. The subject was a 19-year-old woman whose glottal airway was limited due to bilateral abductor vocal fold paralysis following a thyroidectomy. A 5-week inspiratory muscle strength-training program was implemented using a pressure-threshold trainer to strengthen the inspiratory muscles with the intent of enabling the generation of higher inspiratory pressures. The pressure threshold on the trainer was set at 75% of the subject's maximum inspiratory pressure (MIP). The subject was required to generate sufficient inspiratory pressure to bring air through the trainer during an inspiratory maneuver. MIP was the dependent variable used as an indication of inspiratory muscle strength. MIP increased by 47% following the training program. Maximal minute ventilation and oxygen uptake increased posttraining. Dyspnea during exercise and speech decreased as reported by the subject. Total reading duration and pause duration demonstrated a declining trend during connected speech. The results indicated that inspiratory muscle training using a pressure threshold device improves functional tasks such as exercise and speech in a subject with upper airway limitation.  相似文献   

17.
This paper describes acoustic cues for classification of consonant voicing in a distinctive feature-based speech recognition system. Initial acoustic cues are selected by studying consonant production mechanisms. Spectral representations, band-limited energies, and correlation values, along with Mel-frequency cepstral coefficients features (MFCCs) are also examined. Analysis of variance is performed to assess relative significance of features. Overall, 82.2%, 80.6%, and 78.4% classification rates are obtained on the TIMIT database for stops, fricatives, and affricates, respectively. Combining acoustic parameters with MFCCs shows performance improvement in all cases. Also, performance in the NTIMIT telephone channel speech shows that acoustic parameters are more robust than MFCCs.  相似文献   

18.
A comparison of type I thyroplasty and arytenoid adduction   总被引:1,自引:0,他引:1  
Glottal incompetence is a common laryngeal disorder causing impaired swallowing and phonation. The resultant voice has been characterized as weak and breathy with a restricted pitch range. Currently, medialization thyroplasty and arytenoid adduction are two of the surgical treatments for patients with glottal incompetence. However, few studies have evaluated the changes in objective measures of speech with type I thyroplasty and arytenoid adduction. In this study, 59 patients with glottal incompetence underwent either type I thyroplasty or arytenoid adduction. Acoustic (jitter, shimmer, and harmonics-to-noise ratio) and aerodynamic (airflow, subglottic pressure, and glottal resistance) measures were obtained both pre- and postoperatively. No significant differences were found among acoustic or aerodynamic measures for operation type. However, a significant pre/postsurgery effect was observed for translaryngeal airflow. In addition, no significant differences were found among the measures for patients with traditional compared with nontraditional operative indications. Patients who developed glottal insufficiency due to previous laryngeal surgery (e.g., vocal fold stripping) demonstrated no statistically significant improvement in acoustic or aerodynamic measures following thyroplasty or arytenoid adduction.  相似文献   

19.
20.
The voice source is dominated by aeroacoustic sources downstream of the glottis. In this paper an investigation is made of the contribution to voiced speech of secondary sources within the glottis. The acoustic waveform is ultimately determined by the volume velocity of air at the glottis, which is controlled by vocal fold vibration, pressure forcing from the lungs, and unsteady backreactions from the sound and from the supraglottal air jet. The theory of aerodynamic sound is applied to study the influence on the fine details of the acoustic waveform of "potential flow" added-mass-type glottal sources, glottis friction, and vorticity either in the glottis-wall boundary layer or in the portion of the free jet shear layer within the glottis. These sources govern predominantly the high frequency content of the sound when the glottis is near closure. A detailed analysis performed for a canonical, cylindrical glottis of rectangular cross section indicates that glottis-interior boundary/shear layer vortex sources and the surface frictional source are of comparable importance; the influence of the potential flow source is about an order of magnitude smaller.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号