首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 437 毫秒
1.
Accuracy of acoustic voice analysis is influenced by the quality of recording. Lately, articles have suggested that soundcards perform equivalently to specialized professional-grade data acquisition (DA) systems. The purpose of this study was to investigate the influence of DA environment (DA system and microphone) on acoustic voice quality measurement (VQM) while balancing for gender, age, intersubject and intrasubject variability, and analysis software. More specifically, the relative performance of different hardware environments and the relationship between their technical characteristics and VQM performance was investigated. The discretization error and the effective dynamic range of the different DA environments were measured. We used 3 software systems to record and measure separately 2000 acoustic samples of sustained phonation for fundamental frequency, jitter, and shimmer. Analyses of variance (ANOVA) were performed with these parameters as the dependent variables. The results of the study suggested that professional-grade DA hardware is strongly recommended to provide accurate and valid voice assessment. The fundamental frequency measurement differences across DA environments were highly correlated to the discretization error (r=1.00), whereas jitter and shimmer were highly correlated to the effective dynamic range of the DA environments (r=-0.68 and r=-0.86, respectively).  相似文献   

2.
To determine whether a correlation exists between the Grade, Roughness, Breathiness, Aesthenia, Strain (GRBAS) scale (a subjective measure of voice) and the Multi-Dimensional Voice Program (MDVP) scale (an objective measure of voice). A retrospective review of 37 voice patients (12 male/25 female) was conducted. Each voice was perceptually evaluated using the GRBAS scale by an experienced speech pathologist and acoustically analyzed using the MDVP scale. Statistical analysis using a multivariate regression model identified a significant correlation between the noise-related parameters of MDVP and the components of the GRBAS scale. Grade correlated with voice turbulence index (VTI), noise harmonic ratio (NHR), and soft phonation index (SPI). Roughness correlated with NHR only. Breathiness correlated with SPI only. Aesthenia also correlated with SPI only. Of the 19 acoustic variables measured by the MDVP system, only three noise parameters significantly correlated with the GRBAS perceptual voice analysis. Perhaps "noise" is the perceived acoustical quality of the dysphonic voice. A voice quantifying measure such as a "voice index score" could be proposed using the GRBAS scoring and the three clinically relevant MDVP values following further studies.  相似文献   

3.
OBJECTIVES: To evaluate the voice quality in patients with mild-to-moderate asthma by subjective and objective methods. STUDY DESIGN: Comparative, controlled, cross-sectional study. METHODS: Patients with mild-to-moderate asthma (n=40) and age- and sex-matched healthy controls (n=40) were included. Acoustic analyses were performed by the Multi-Dimensional Voice Program (MDVP; Kay Elemetrics Corporation, Lincoln Park, NJ) and the movements of the vocal cords were examined by videolaryngostroboscopy (VLS). In addition, the duration of illness, maximum phonation time, "s/z" values, and vital capacity were evaluated. Voice Handicap Index (VHI) and GRB scales were used for subjective evaluations. RESULTS: Maximum phonation time values were significantly shorter both in male and female asthma patients compared with controls (P<0.0001). Also, average shimmer values in MDVP were higher for both sexes in the patient group compared with controls (P=0.002 and P=0.04, respectively). There was a significant difference between female patients and sex-matched controls with regard to mean noise-to-harmonic ratio values (P=0.006). Female patients with asthma had higher average jitter values compared with sex-matched controls (P<0.0001). A significant difference was noted between asthma and control groups with regard to GRB scale (P<0.0001, P<0.001, and P<0.0001, respectively). The VHI score was above the normal limit in 16 (40%), and VLS findings were abnormal in 39 (97.5%) asthmatics. CONCLUSION: In asthmatic patients, maximum phonation time, frequency, and amplitude perturbation parameters were impaired, but the vital capacity and the duration of illness did not correlate with these findings.  相似文献   

4.
This study was designed to examine the relationship between the Voice Handicap Index (VHI) and acoustic measures of voice samples common in clinical practice. Fifty participants, 38 women and 12 men, ranging in age from 19 to 80 years, with a mean age of 49 years, served as participants. Of these 50 participants, 17 participants could be included in the acoustic analysis of voice based on measures of error calculated with the TF32 software. All participants completed the VHI and provided voice samples including three trials of the sustained vowel /A/ at a comfortable loudness level as well as a connected speech sample consisting of the Zoo Passage. Acoustic measures were made with TF32 and Cool Edit software and included fundamental frequency, jitter %, shimmer %, signal-to-noise ratio, mean root-mean-square intensity, fundamental frequency standard deviation, aphonic periods, and breath groups. Results indicate that these measures were not predictive of overall VHI score, and no cohesive or predictable pattern was identified when comparing individual measures with overall VHI or with each subscale item. Likely contributions to this lack of correlation and subsequent clinical implications are discussed, as well as the direction for further research.  相似文献   

5.
行驶汽车环境中的话音活动检测研究   总被引:1,自引:0,他引:1       下载免费PDF全文
话音活动检测是语音交互和通信系统的重要部分,其作用是区分输入信号中的语音段和背景噪声段,检测的依据主要是语音和噪声的各种时频特性,其中,浊语音的周期性和谐波特性是一种广泛应用的特征。但是在行驶的汽车环境中,由于噪声非平稳且信噪比较低,这类特征较难得到可靠的检测。为此,本文根据浊音谐波结构的基本规律,利用时变噪声环境中各频带信噪比不同的特点,提出一种较为鲁棒的谐波快速检测算法。算法以较小的时频块为分析单元,利用一组基频在对数尺度上变化的谐波模板,自适应地搜索谐波结构清晰的部分,并以此检测浊语音信号。实验证明,该算法能够在行驶的汽车环境中达到较可靠的话音/非话音区别效果。  相似文献   

6.
Voice analysis was performed on 21 “standard” laryngectomized, male patients with a Provox® voice prosthesis, along with an age- and sex-matched control group of 20 normal speakers, using acoustical analyses (MDVP and CSL, Kay Elemetrics Corp.), maximum phonation time measurements, and perceptual evaluations. Comparison between MDVP and CSL revealed that the latter was not useful for the analysis of laryngectomized prosthetic voices. In contrast, MDVP seems suitable for this purpose, and contains a large number of parameters that significantly differentiate between patient and control speakers, as did the perceptual ratings and the maximum phonation time. Fundamental frequency appeared to be comparable for patients and control speakers. A significant influence of stoma occlusion and age was found for some voice parameters. Factor analyses showed correlations between the different MDVP parameters and correlations between the MDVP parameters and the perceptual ratings.  相似文献   

7.

Objectives

The present study was performed to examine which factors among self-rated scales, perceptual evaluations, and acoustic parameters, calculated from sustained vowels, are reliable indicators of physical and mental fatigues.

Methods

A total of 73 volunteers (male:female, 52:21), aged 19–24 years, were enrolled in this study. We defined the high- and low-fatigue groups using the Chalder Fatigue Scale score. For assessment of self-rated symptoms, each subject was asked to complete Voice Handicap Index (VHI) and Voice Rating Scale (VRS). For perceptual evaluations, three clinicians assessed each subject’s vocal quality on the Grade, Roughness, Breathiness, Asthenia, Strain Scale. For acoustic analysis, each subject was asked to produce sustained vowels /a/, /e/, /i/, /o/, and /u/ for 3 seconds. Then, the habitual fundamental frequency (F0), jitter, shimmer, F0 tremor, mean F0, standard deviation of F0, maximum F0, minimum F0, normalized noise energy, harmonic-to-noise ratio (HNR), signal-to-noise ratio (SNR), amplitude tremor, and ratio within 2–4 kHz were calculated using Dr. Speech software.

Results

In men, VHI, VRS, F0 tremor, shimmer, HNR, SNR, and amplitude tremor were related to mental fatigue. In women, only VHI was related to physical fatigue, and none of the acoustic parameters was related to the fatigue score. Perceptual evaluations were not related to fatigue in men or women.

Conclusions

These findings suggest that self-rated symptoms and acoustic parameters related to voice quality are indicative of mental fatigue, and these features are prominent in men.  相似文献   

8.
Values for acoustic voice measurements were obtained from 88 normal individuals and 98 pathological cases of mass lesions of vocal fold and 50 cases of unilateral vocal fold paralysis. Overall, all items reflecting perturbations of pitch and amplitude as well as glottal noise were significantly higher in the groups of patients compared with the normal group. The measurement of normalized noise energy (NNE) was found to be an optimum parameter for discrimination of normal/abnormal voices. The voices of patients with vocal fold nodules and vocal fold polyps were analyzed before endolaryngeal phonomicrosurgery (EPM) and 2 weeks after. Statistically significant (p < 0.01) improvement was achieved both in perceptual and acoustic analysis. EPM resulted in a significant decrease of mean jitter, shimmer, and NNE. Clinically, these measures provided documentable and measurable evidence of vocal function and were helpful for comparing patients with normal speakers. They also were useful for a thorough documentation of patient's voice pathology and for evaluation of the presurgical and postsurgical voice status.  相似文献   

9.
This study aimed to verify whether the resonant voice based on Lessac's Y-Buzz can be perceived by listeners as resonant and different from habitual voice and to compare them to determine whether this sound exploration improves the vocal production. Nine newly graduated actors, six men and three women without voice complaints, were the subjects. They received a session of Lessac's Y-Buzz training from the primary investigator. Before training, they were asked to sustain the vowel /i/ at comfortable frequency and habitual loudness. After training, they were requested to sustain the Y-Buzz they had learned at a comfortable frequency and habitual loudness. Three speech-language pathologists (SLP) trained in voice developed an auditory-perceptive analysis. The pre- and posttraining voice samples were randomly spliced together, edited, and presented in pairs to perceptual judges who were asked to identify the most resonant of the pair. The voice samples were also acoustically compared through the Hoarseness Diagram and acoustic measures using the VoxMetria Software (CTS, version 2.0s, Brazil). The Y-Buzz trials were identified as resonant voice in 74% of the comparisons. The acoustic measures showed a statistically significant decrease of irregularity (P = 0.002) and shimmer (P = 0.38). The Hoarseness Diagram demonstrated how the resonant voice moved toward the normality for irregularity and noise components. The results showed that the resonant voice based on the Y-Buzz can be identified as resonant and different from normal voicing in the same subject, and it apparently implies a better vocal production demonstrating a significant decrease of shimmer and irregularity through the Hoarseness Diagram evaluation.  相似文献   

10.
Unilateral vocal fold paralysis (UVFP) is associated with changes in acoustic and aerodynamic voice measurements and can have a significant impact on a patient's quality of life. Few objective data regarding the efficacy of voice therapy for UVFP exist. The aim of this study was to retrospectively analyze voice modifications in a group of patients with UVFP before and after voice therapy. Forty patients with UVFP of different etiology were included in the study. Each subject had voice therapy with an experienced speech/language pathologist twice a week; the mean number of sessions was 12.6. A multidimensional assessment protocol was used; it included videoendoscopy, the maximum phonation time (MPT), the GIRBAS scale, spectrograms and a perturbation analysis, and the Voice Handicap Index (VHI). Pre- and posttreatment data were compared by means of the Wilcoxon and Student's t tests. A complete glottal closure was seen in 8 patients before voice therapy and in 14 afterward. Mean MPT increased significantly. In the perceptual assessment, the difference was significant for five out of six parameters. A significant improvement was found on spectrographic analysis; as for perturbation analysis, the differences in jitter, shimmer, and noise-to-harmonic ratio values were significant. VHI values showed a clear and significant improvement. A significant improvement of voice quality and quality of life after voice therapy is an often reached and reasonable goal in patients with UVFP.  相似文献   

11.
To quantify several acoustic features of the voice in patients withParkinson's disease (PD), 41 patients and 28 age and sex-matched controls were studied. PD severity was assessed with the Unified PD Rating Scale (UPDRS) and the Hoehn and Yahr staging. The Computerized Speech Lab 4300 program (Kay Elemetrics) was used. Two seconds of a sustained /a/ and a sentence were captured with a microphone and laryngograph equipment. Measures included fundamental frequency (FO), frequency perturbation (fitter), intensity perturbation (shimmer), and harmonic/noise ratio (H/N) of the vowel /a/, and frequency and intensity variability of a sentence, phonational range, dynamic range at the natural frequency, maximum phonational time and s/z ratio. All subjects underwent indirect laryngoscopy and/or laryngeal fibroscopy. When compared with controls, PD patients showed higher jitter, lower H/N ratio, lower frequency and intensity variability of the sentence, and lower phonational range and reported a higher frequency of the presence of low voice-intensity, monopitch, voice arrests, and struggle. These features seem to be unaffected by the duration and severity of the disease.  相似文献   

12.
Predicting Mutational Change in the Speaking Voice of Boys   总被引:1,自引:0,他引:1  
SUMMARY: The authors investigated whether acoustic speaking voice analyses can be used to predict the beginning of mutation in 21 male members of a professional boys' choir. Over a period of 3 years before mutation, children were examined every 3 months by ear, nose, and throat (ENT) and phoniatric specialists. At the same time, the voice was evaluated acoustically using analysis features of the Goettingen Hoarseness Diagram (GHD). Irregularity component and noise component, jitter, shimmer, mean waveform correlation coefficient, and fundamental frequency were determined from recordings of the speaking voice. Significant changes of acoustic features appeared 7 and 5 months before mutation onset, which indicates that vocal function is already restricted 6 months before mutation onset. This acoustic voice analysis is therefore suitable to support the care of the professional singing voice.  相似文献   

13.
An accurate analysis of voice quality is imperative when using acoustic measurements to diagnose vocal pathologies. It is known that noise has a significant effect on the reliability and validity of acoustic voice measurements, but the precise relationship has not been established. The purpose of this study was to investigate the influence of noise on the accuracy, reliability, and validity of acoustic voice quality measurements while balancing for gender, age, intersubject and intrasubject variability, microphones, computer hardware, analysis software, and type of noise. Level of noise was precisely controlled. The specific focus of interest was to determine the critical levels of noise that can invalidate voice quality measurements and to generate practical recommendations. Results suggest that the recommended, acceptable, and unacceptable levels of noise in the acoustic environment are above 42 dB, above 30 dB, and below 30 dB signal-to-noise ratio, respectively.  相似文献   

14.
The categorization of voice into quality type (ie, normal, breathy, hoarse, rough) is often a traditional part of the voice diagnostic. The goal of this study was to assess the contributions of various time and spectral-based acoustic measures to the categorization of voice type for a diverse sample of voices collected from both functionally dysphonic (breathy, hoarse, and rough) (n=83) and normal women (n=51). Before acoustic analyses, 12 judges rated all voice samples for voice quality type. Discriminant analysis, using the modal rating of voice type as the dependent variable, produced a 5-variable model (comprising time and spectral-based measures) that correctly classified voice type with 79.9% accuracy (74.6% classification accuracy on cross-validation). Voice type classification was achieved based on two significant discriminant functions, interpreted as reflecting measures related to "Phonatory Instability" and "F(0) Characteristics." A cepstrum-based measure (CPP/EXP ratio) consistently emerged as a significant factor in predicting voice type; however, variables such as shimmer (RMS dB) and a measure of low- vs. high-frequency spectral energy (the Discrete Fourier Transformation ratio) also added substantially to the accurate profiling and prediction of voice type. The results are interpreted and discussed with respect to the key acoustic characteristics that contributed to the identification of specific voice types, and the value of identifying a subset of time and spectral-based acoustic measures that appear sensitive to a perceptually diverse set of dysphonic voices.  相似文献   

15.
The aim of the study was to outline the multidimensional perceptual, subjective, and instrumental acoustic voice changes in the group of reflux laryngitis (RL) patients. Data of multidimensional voice assessment of 108 RL patients and 90 healthy persons of the control group were subjected to comparative analysis. A slight hoarseness according to the GRB (G-grade, R- rough, B-breathy) scale was prevailing in the RL patients group. Statistically significant difference (P < 0.001) between RL patients group and the control group was found of all voice parameters measured, with the patients having worse results--increased mean jitter, shimmer, normalized noise energy, voice handicap index (VHI), and decreased parameters of phonetogram. The results of the study demonstrated that multidimensional voice assessment documented deteriorated voice quality and restricted phonation capabilities in the tested group of RL patients.  相似文献   

16.
《Journal of voice》2020,34(4):649.e15-649.e20
ObjectiveTo demonstrate the surgical efficacy of septoplasty using acoustic rhinometry (AR) and anterior rhinomanometry (ARM) and to evaluate the effect of septoplasty on voice performance through subjective voice analysis methods.Materials and MethodsThis prospective study enrolled a total of 62 patients who underwent septoplasty with the diagnosis of deviated nasal septum. Thirteen patients with no postoperative improvement versus preoperative period as shown by AR and/or ARM tests and three patients with postoperative complications and four patients who were lost to follow-up were excluded. As a result, a total of 42 patients were included in the study. Objective tests including AR, ARM, acoustic voice analysis and spectrographic analysis were performed before the surgery and at 1 month and 3 months after the surgery. Subjective measures included the Nasal Obstruction Symptom Evaluation questionnaire to evaluate surgical success and Voice Handicap Index-30 tool for assessment of voice performance postoperatively, both completed by all study patients.ResultsAmong acoustic voice analysis parameters, F0, jitter, Harmonics-to-Noise Ratio values as well as formant frequency (F1-F2-F3-F4) values did not show significant differences postoperatively in comparison to the preoperative period (P > 0.05). Only the shimmer value was statistically significantly reduced at 1 month (P < 0.05) and 3 months postoperatively (P < 0.05) versus baseline. Statistically significant reductions in Voice Handicap Index-30 scores were observed at postoperative 1 month (P < 0.001) and 3 months (P < 0.001) compared to the preoperative period and between postoperative 1 month and 3 months (P < 0.05).ConclusionIn this study, first operative success of septoplasty was demonstrated through objective tests and then objective voice analyses were performed to better evaluate the overall effect of septoplasty on voice performance. Shimmer value was found to be improved in the early and late postoperative periods.  相似文献   

17.
This study was designed to investigate objective voice quality measurements in unilateral vocal fold paralysis (UVFP) by eliminating intersubject variability. To our knowledge this is the first report objectively analyzing paralytic dysphonia as compared to the same voice before onset of UVFP. The voices of two male subjects were prospectively recorded before and after the onset of iatrogenic UVFP (thoracic surgery).The following acoustic measurements of the vowel /a/ were performed using the CSL and MDVP (Kay Elemetrics): jitter, shimmer, harmonics-to-noise ratio, cepstral peak prominence, the relative energy levels of the first harmonic, the first formant and the third formant, the spectral slope in the low-frequency zone (0-1 kHz and 0-2 kHz), and the relative level of energy above 6 kHz. Distribution of spectral energy was analyzed from a long-term average spectrum of 40 seconds of text. Laryngeal aerodynamic measurements were obtained for one patient before and after onset of paralysis using the Aerophone II (Kay Elemetrics). Pitch and amplitude perturbation increased secondary to UVFP, while the harmonics-to-noise ratio and the cepstral peak prominence decreased. A relative increase in the mid-frequency and high-frequency ranges and a decrease in the low-frequency spectral slope were observed. Mean airflow rate and intraoral pressure increased, and glottal resistance and vocal efficiency decreased secondary to UVFP. The findings of this self-paired study confirm some but not all the results of previous studies. Measures involving the fundamental and the formants did not corroborate previous findings. Further investigation with vocal tract modeling is warranted.  相似文献   

18.
许松伟  胡晓吉 《应用声学》2015,23(10):64-64
为满足某控制系统实时记录语音和播放录音的实际需求,基于语音压缩编码技术,以现场可编程门阵列(FPGA)为控制核心,设计实现了16个通道采集语音信号,进行ADPCM编码,将语音文件以WAVE格式存储,集监听、播放指定语音通道及任意时间段录音于一体的紧凑型PCI(CPCI)板卡。该板卡区别于传统语音卡的创新点在于运用语音叠加技术使板卡具有混音功能,能够将不同通道语音混音、记录存储,同时具有压缩比可自由选择的功能。详细介绍了板卡的组成机构、工作原理、硬件设计和软件设计。最后进行实验测试,板卡语音存储、语音回放等各项功能正常,音质良好,验证了设计方案的可行性与实用性。  相似文献   

19.
To quantify several acoustic features of the voice in patients with essentialtremor (ET), 28 patients and 28 age- and sex-matched controls were studied. ET severity was assessed with the rating scale for tremor of Fahn, Tolosa, and Marín. The Computerized Speech Lab 4300 program (Kay Elemetrics) was used. Two-second samples of a sustained /a/ and a sentence were captured with a microphone and laryngograph equipment. Measures included fundamental frequency (F0), frequency perturbation (fitter, Koike algorithm), intensity perturbation (shimmer, Horii algorithm), and harmonic-to-noise ratio (H/N, Yumoto algorithm) of the vowel /a/, and the frequency and intensity variability of the sentence, phonational range, and dynamic range at the natural frequency, maximum phonational time, and s/z ratio. All subjects underwent indirect laryngoscopy and/or laryngeal fibroscopy. When compared with controls, ET patients showed higher jitter, lower H/N ratio (the last one only with laryngographic signal), of the vowel /a/, lower frequency variability in the microphonc signal, lower intensity variability in the laryngographic signal of the sentence, and significantly lower dynamic range at natural frequency of phonation. ET patients reported higher frequency of the presence of high voice intensity, tremor, and struggle. Several acoustic parameters were influenced by the severity of the disease, including shimmer, jitter, H/N ratio, frequency variability of the sentence, and s/z ratio, although neither of the acoustic analysis values or the phonetometric measurements were affected by the presence of voice tremor or by a successful pharmacological treatment of ET.  相似文献   

20.
Unification of perturbation measures in speech signals   总被引:2,自引:0,他引:2  
Voice perturbation measures, formerly defined in a somewhat ad hoc fashion, are discussed within the framework of signal theory. An attempt is made to unify a variety of existing jitter, shimmer, and noise measures on the basis of common underlying perturbation functions and their derivatives. Some simple modulations (sinusoid, Gaussian noise, linear trend) are imposed on cyclic parameters in phonation (e.g., amplitude and fundamental frequency) to test the ability of perturbation measures to detect or reject these types of modulations. It is expected that this systematic approach to perturbation analysis will be helpful in identifying the sources of irregularity in the voice and, thereby, in the detection of laryngeal disorders.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号