首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 15 毫秒
1.
In this paper, we investigated the acoustic characteristics of sustained and running vowels from normal subjects and patients with laryngeal pathologies. Perturbation methods (including jitter and shimmer), signal-to-noise ratio (SNR), and nonlinear dynamic methods (such as correlation dimension and second-order entropy) were used to analyze sustained and running vowels. We found that the sustained vowels and running voices from normal subjects and patients with laryngeal pathologies had low-dimensional dynamic characteristics. For sustained vowels, the analyses of jitter, shimmer, correlation dimension, and second-order entropy revealed significant differences between normal and pathological voices. For running voices, jitter and shimmer did not statistically discriminate between normal and pathological voices, but a significant difference was found for SNR, correlation dimension, and second-order entropy. The results suggest that nonlinear dynamic analysis and traditional SNR analysis may be valuable for the analysis of sustained and running vowels; perturbation analysis may be applicable for the analysis of sustained vowels but should be applied with caution for running voice analysis.  相似文献   

2.
3.
Most studies investigating the effect of sex hormones on the larynx and vocal folds focused on the voice quality of women either around menopause or during the menstrual cycle. To our knowledge, however, there have been no studies that investigated the effect of oral contraceptives on the female voice. In the present study five women who ingest oral contraceptives (pill group) and five women who do not (natural group) were recorded producing the vowels /i/ and /a/ repeatedly over a period of 40 days. Acoustic analyses were performed on these recordings including F0, amplitude, jitter, shimmer, and harmonic-to-noise ratio (HNR). Results indicated that jitter and shimmer values of the pill group were significantly lower than those of the natural group. No group differences were found for F0, amplitude, or HNR. In addition, the pill group demonstrated significantly smaller variance for all variables tested. The results suggest that oral contraceptives might increase voice stability associated with smaller hormonal changes. Thus the present study provides preliminary evidence of the effect of oral contraceptives on the female voice.  相似文献   

4.
The harmonics-to-noise ratio (HNR) was proposed for measurement of noise in sustained vowels. A generalized algorithm is described here for removing the dependence on the data offset and for reducing the influence of jitter and shimmer. In addition, a new algorithm is proposed that accommodates periodic perturbations. Formulations are presented that can be computed in a single pass through the data under certain assumptions, thus simplifying the implementation of an efficient analysis program. A mathematical analysis of the effect of small errors in pitch-period demarcation is presented. The analysis indicates that the sensitivity to demarcation errors depends on the preponderance of high-frequency components, and significant underestimation is predicted for /a/ and /i/ data. A method for correcting quantization and demarcation errors in pitch-period markers is described. The method is shown to be effective at reducing demarcation errors for all but the most severely perturbed waveforms. Examples of the use of this method for HNR estimation are provided. Finally, a comparison with recent work by Milenkovic [J. Speech Hear. Res. 30, 529-538 (1987)] is presented.  相似文献   

5.
The measurement of the harmonics-to-noise ratio (HNR) in speech signals gives an indication of the aperiodicity of the speech waveform. This may be due to the presence of jitter, shimmer, additive noise, waveshape change, or some unknown combination of these factors. In order to estimate the HNR as a measure of the additive noise component only, the contaminating effects of the other contributory components must first be removed. A pitch synchronous harmonic analysis is proposed to overcome this problem. The procedure takes advantage of the time scale compression-frequency expansion property of the Fourier series in order to eliminate jitter and shimmer. Successive spectra are added by harmonic number as opposed to frequency location, and perturbation is removed due to the fact that the relative heights of the harmonic components remain the same for scaled signals. The technique is examined on synthetically generated voice signals. A discussion of the results is given in terms of human voice signals, characterization of jitter, vocal tract filtering effects, perturbation mechanisms, nonlinear dynamics, and the possibility of developing the method for use with inverse filtering strategies.  相似文献   

6.
Modulation of the acoustic amplitude and fundamental frequency of a sustained vowel across the heart cycle was examined via signal-averaging. Ten normal young and ten normal elderly men prolonged phonations of the vowel /a/. Consistent with previous studies, the young men's maximal heart-beat-related frequency and amplitude variations averaged 1.0% and 8.4% of their respective means. Such modulation was estimated to account for 6.6% of the absolute jitter and 11.3% of the shimmer measured in these voice samples. The extent of this systematic variation was significantly greater in the older voices, averaging 2.4% of the mean frequency and 15.4% of the mean amplitude and was estimated to account for approximately 12.9% and 15.8% of their mean absolute jitter and shimmer, respectively. This age-related difference is thought to be a manifestation of involutional changes in laryngeal vascular and soft tissues and in ventilatory biomechanics.  相似文献   

7.
SUMMARY: Because of the aperiodicity of many tracheoesophageal voices, acoustic analysis of the tracheoesophageal voice is less straightforward than that of the normal voice. This study presents the development and testing of an acoustic signal typing system based on visual inspection of a narrow-band spectrogram that can be used by researchers for classification of voice quality in tracheoesophageal speech. In addition to this classification system, a selection of acoustic measures [median fundamental frequency, standard deviation of fundamental frequency, jitter, percentage of voiced (%Voiced), harmonics-to-noise ratio (HNR), glottal-to-noise excitation (GNE) ratio, and band energy difference (BED)] was computed to provide more insight into the acoustic components of tracheoesophageal voice quality. For clinical relevance, relationships between the acoustic signal types and an overall judgment of the voice were investigated as well. Results showed that the four acoustic signal types form a good basis for performing more acoustic analyses and give a good impression of the overall quality of the voice.  相似文献   

8.
This study investigates the relationship between rough voice and the presence of Subharmonics, which correspond to smaller yet distinct peaks located between two consecutive harmonic peaks in the power spectrum. Spectrum analysis was undertaken in 389 pathologic voices, of which 20 had subharmonics. Although all 20 voices had roughness perceptually, 8 had normal jitter and/or shimmer. The degree of roughness had a significant inverse relationship with the frequency of subharmonics. By digital signal processing, sound samples with various types of subharmonics were synthesized and perceptually analyzed. Power and frequency of subharmonics in the synthesized sound also had significant relationships with the degree of roughness. Rough voice is acoustically characterized not only by jitter and shimmer but also by the presence of subharmonics in the power spectrum. Subharmonics are important acoustic properties for objective evaluation of rough voices.  相似文献   

9.
《Journal of voice》2020,34(2):301.e7-301.e11
BackgroundAdequate phonation is self-regulated by auditory feedback. Children with bilateral profound hearing loss (PHL) lack this feedback resulting in abnormal voice. Adequate hearing aid use and auditory-verbal therapy (AVT) may improve voice quality in deaf children.ObjectiveTo study whether hearing aid use and AVT approach improve acoustic parameters of voice of children with bilateral PHL.Materials and methodsNineteen children with bilateral PHL were studied. Age range 2–5 years (X = 53.04 months; SD = 9.54). All children were fitted with hearing aids according to auditory testing and they underwent a 1-year auditory habilitation period using the AVT approach. Acoustic analysis of voice including F0, shimmer, and jitter was performed at the onset and at the end of the auditory habilitation period. Final acoustic data were compared to a matched control group of 19 children, age range 2–5 years (X = 52.85; SD = 9.74) with normal hearing.ResultsMean fundamental frequency (F0) was significantly increased after AVT intervention. Shimmer and jitter significantly (P < 0.05) improved after the intervention period. However, despite the improvements, mean F0 at the end of the intervention period was still significantly (P < 0.05) decreased as compared to controls. Also, mean shimmer and jitter at the end of the habilitation period were still significantly (P < 0.05) higher as compared to controls.ConclusionsThe results of this preliminary study suggest that hearing aid use and auditory habilitation with AVT approach improved acoustic voice parameters of children with PHL. However, acoustic parameters persisted abnormal as compared to matched normal hearing controls. AVT approach and regular hearing aid use seem to be safe and reliable clinical tools for improving voice quality of children with PFL.  相似文献   

10.
Harmonics-to-Noise Ratio: An Index of Vocal Aging   总被引:3,自引:0,他引:3  
Distinguishing between vocal changes that occur with normal aging and those that are associated with disease is an important goal of research in voice. Several acoustic measures have been used in an attempt to illuminate the integrity of the vocal mechanism, including harmonics-to-noise ratio (HNR), jitter, and fundamental frequency (F0). HNR is a measure that quantifies the amount of additive noise in the voice signal; jitter reflects the periodicity of vocal fold vibration. In this study, measures of HNR, jitter and F0 were used to compare vocal function in three groups of normally speaking women: young adults, middle-aged adults, and elderly adults. Significant differences in HNR emerged between the elderly women and the other two groups. F0 differences were also apparent between the elderly group and the two younger groups; there were no significant differences in jitter between the three groups. HNR was found to be a more sensitive index of vocal function than jitter. The significant lowering of HNR evident in the elderly speakers may be attributable in part to medications taken by the majority of these elderly subjects.  相似文献   

11.
The aim of the study was to identify the acoustic correlates of female teachers' subjective voice complaints by recording their voices in their working environment. The subjects made recordings during lessons (N = 10) and breaks (N = 11). The subjects were divided into 2 groups: those with few voice complaints (FC group) and those with many voice complaints (MC group). The speech sample made in the breaks was maximally sustained /a/, from which fundamental frequency (F0), jitter, and shimmer were analyzed. The classroom samples were analyzed for F0, sound pressure level (SPL), and F0 time (the active vibration time of the vocal folds). Additionally, an index for assessing voice loading is presented. The results revealed a tendency of the MC group to have higher F0 and lower SPL and perturbation values than the FC group. The index values correlated moderately with the subjective vocal complaints.  相似文献   

12.
The purpose was to determine the clinical value of a multiparametric objective voice evaluation protocol including acoustic and aerodynamic parameters measured mainly on a sustained /a/. This was done by comparison with perceptual analysis of continuous speech by a jury composed of 6 experienced listeners. Voice samples (continuous speech) from 63 male patients with dysphonia and 21 control subjects with normal voices were recorded and assesed by a jury of listeners. The jury was instructed to classify voice samples according to the G (overall dysphonia) component of the GRBAS score on a 4-point scale ranging from 0 for normal to 3 for severe dysphonia. Objective parameters were recorded on an EVA® workstation. As usual with this type of system, parameters were measured mainly on a sustained /a/. Measured parameters included fundamental frequency (F0), intensity, jitter, shimmer, signal-to-noise ratio, Lyapunov coefficient (LC), oral airflow (OAF), maximum phonatory time (MPT), and vocal range (range). Estimated subglottic pressure (ESGP) was determined on a series of /pa/. Discriminant analysis was performed to detect correlation between jury classification and combinations of parameters. Results showed that a nonlinear combination of only six parameters (range, LC, ESGP, MPT, signal-to-noise ratio, and F0) allowed 86% concordance with jury classification. Discussion deals with the relative importance of the different objective parameters for discriminant analysis. Special emphasis is placed on two measurements rarely made in routine clinical workup, i.e., estimated subglottic pressure and Lyapunov coefficient.  相似文献   

13.
Many persons with Parkinson's disease (PD) will eventually experience vocal impairment as their condition advances. Using standard perturbation analyses (parameters like jitter and shimmer) to measure fluctuations in phonatory signal may inhibit researchers from recognizing severely disordered patterns that seem to be present in the voices of some PD patients. Nonlinear dynamic analysis can quantify these aperiodic patterns, which indicate severe pathology that is usually characterized perceptually by hoarseness. Here, sustained vowel phonations of a heterogeneous group of PD subjects (20 women and 21 men) were compared with those of a control group (22 women and 18 men) based on results of nonlinear dynamic analyses (D(2)) and perturbation analyses. Results showed PD subjects as a whole to have significantly higher D(2) values than control subjects (P = 0.016), which indicates increased signal complexity in PD vocal pathology. Differences in the comparison of these two groups were significant in jitter (P = 0.014) but nonsignificant in shimmer (P = 0.695). Furthermore, the performance on these three measures was affected by subject sex. Nonlinear dynamic analysis showed significantly higher D(2) in the female PD group than in the female control group (P = 0.001), but jitter and shimmer did not show such a difference. The male PD group had statistically higher jitter than the male control group (P = 0.036), but these groups did not differ in D(2) or shimmer. Overall, nonlinear dynamic analysis may be a valuable method for the diagnosis of Parkinsonian laryngeal pathology.  相似文献   

14.
This study was designed to investigate objective voice quality measurements in unilateral vocal fold paralysis (UVFP) by eliminating intersubject variability. To our knowledge this is the first report objectively analyzing paralytic dysphonia as compared to the same voice before onset of UVFP. The voices of two male subjects were prospectively recorded before and after the onset of iatrogenic UVFP (thoracic surgery).The following acoustic measurements of the vowel /a/ were performed using the CSL and MDVP (Kay Elemetrics): jitter, shimmer, harmonics-to-noise ratio, cepstral peak prominence, the relative energy levels of the first harmonic, the first formant and the third formant, the spectral slope in the low-frequency zone (0-1 kHz and 0-2 kHz), and the relative level of energy above 6 kHz. Distribution of spectral energy was analyzed from a long-term average spectrum of 40 seconds of text. Laryngeal aerodynamic measurements were obtained for one patient before and after onset of paralysis using the Aerophone II (Kay Elemetrics). Pitch and amplitude perturbation increased secondary to UVFP, while the harmonics-to-noise ratio and the cepstral peak prominence decreased. A relative increase in the mid-frequency and high-frequency ranges and a decrease in the low-frequency spectral slope were observed. Mean airflow rate and intraoral pressure increased, and glottal resistance and vocal efficiency decreased secondary to UVFP. The findings of this self-paired study confirm some but not all the results of previous studies. Measures involving the fundamental and the formants did not corroborate previous findings. Further investigation with vocal tract modeling is warranted.  相似文献   

15.
Disruption of auditory feedback such as masking has been shown to influence vocal production. A reliable finding is an increase in intensity level; an increase in fundamental frequency (F0) is a less robust finding. Research is lacking concerning the effects of auditory masking on measures of phonatory stability such as jitter and harmonics-to-noise ratio (HNR). This study investigated changes in intensity, F0, jitter, and HNR in 22 normally speaking college aged women. Subjects produced the vowel /a/ under three conditions: no masking level (0-dB ML), 50-dB ML, and 80-dB ML. Significant differences between conditions emerged for intensity; means for the other measures were not significantly different. Intraindividual differences between conditions for each variable are discussed in the framework of auditory versus kinesthetic feedback.  相似文献   

16.
17.
Accuracy of acoustic voice analysis is influenced by the quality of recording. Lately, articles have suggested that soundcards perform equivalently to specialized professional-grade data acquisition (DA) systems. The purpose of this study was to investigate the influence of DA environment (DA system and microphone) on acoustic voice quality measurement (VQM) while balancing for gender, age, intersubject and intrasubject variability, and analysis software. More specifically, the relative performance of different hardware environments and the relationship between their technical characteristics and VQM performance was investigated. The discretization error and the effective dynamic range of the different DA environments were measured. We used 3 software systems to record and measure separately 2000 acoustic samples of sustained phonation for fundamental frequency, jitter, and shimmer. Analyses of variance (ANOVA) were performed with these parameters as the dependent variables. The results of the study suggested that professional-grade DA hardware is strongly recommended to provide accurate and valid voice assessment. The fundamental frequency measurement differences across DA environments were highly correlated to the discretization error (r=1.00), whereas jitter and shimmer were highly correlated to the effective dynamic range of the DA environments (r=-0.68 and r=-0.86, respectively).  相似文献   

18.
The effect of noise on computer-derived samples of voice was compared across three different hardware/software configurations. The hardware/software systems included a stand-alone A/D converter (CSL Module 4300B) coupled to a custom Pentium PC used in conjunction with the Multi-Dimensional Voice Program (MDVP) software, and a Creative Labs A/D converter coupled to the same custom PC under software control of MDVP/Multispeech and CSpeechSP. Voice samples were taken from 10 female subjects, then mixed with computer fan noise creating three different signal-to-noise (S/N) levels. Mixed signals were analyzed on the three hardware/software systems. Results revealed that fundamental frequency was most resistant to the degradation effect of noise across systems; jitter and shimmer values, however, were more variable across all configurations. Jitter and shimmer values were significantly higher under certain S/N levels for the MDVP 4300B based system as compared to MDVP for Multi-Speech and CSpeechSP. The findings punctuate the need for sensitivity to recording environments, careful selection of hardware/software equipment arrays, and the establishment of minimal recording conditions (>25 dBA S/N) for voice sampling and analysis using computer-assisted methods.  相似文献   

19.
This study investigated the relation of symptoms of vocal fatigue to acoustic variables reflecting type of voice production and the effects of vocal loading. Seventy-nine female primary school teachers volunteered as subjects. Before and after a working day, (1) a 1-minute text reading sample was recorded at habitual loudness and loudly (as in large classroom), (2) a prolonged phonation on [a:] was recorded at habitual speaking pitch and loudness, and (3) a questionnaire about voice quality, ease, or difficulty of phonation and tiredness of throat was completed. The samples were analyzed for average fundamental frequency (F0), sound pressure level (SPL), and phonation type reflecting alpha ratio (SPL [1-5 kHz]-SPL [50 Hz-1 kHz]). The vowel samples were additionally analyzed for perturbation (jitter and shimmer). After a working day, F0, SPL, and alpha ratio were higher, jitter and shimmer values were lower, and more tiredness of throat was reported. The average levels of the acoustic parameters did not correlate with the symptoms. Increase in jitter and mean F0 in loud reading correlated with tiredness of throat. The results seem to suggest that, at least among experienced vocal professionals, voice production type had little relevance from the point of view of vocal fatigue reported. Differences in the acoustic parameters after a vocally loading working day mainly seem to reflect increased muscle activity as a consequence of vocal loading.  相似文献   

20.
Values for acoustic voice measurements were obtained from 88 normal individuals and 98 pathological cases of mass lesions of vocal fold and 50 cases of unilateral vocal fold paralysis. Overall, all items reflecting perturbations of pitch and amplitude as well as glottal noise were significantly higher in the groups of patients compared with the normal group. The measurement of normalized noise energy (NNE) was found to be an optimum parameter for discrimination of normal/abnormal voices. The voices of patients with vocal fold nodules and vocal fold polyps were analyzed before endolaryngeal phonomicrosurgery (EPM) and 2 weeks after. Statistically significant (p < 0.01) improvement was achieved both in perceptual and acoustic analysis. EPM resulted in a significant decrease of mean jitter, shimmer, and NNE. Clinically, these measures provided documentable and measurable evidence of vocal function and were helpful for comparing patients with normal speakers. They also were useful for a thorough documentation of patient's voice pathology and for evaluation of the presurgical and postsurgical voice status.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号