首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 78 毫秒
1.
《Journal of voice》2020,34(3):485.e33-485.e43
PurposeThe present study aimed at measuring the smoothed and non-smoothed cepstral peak prominence (CPPS and CPP) in teachers who considered themselves to have normal voice but some of them had laryngeal pathology. The changes of CPP, CPPS, sound pressure level (SPL) and perceptual ratings with different voice tasks were investigated and the influence of vocal pathology on these measures was studied.MethodEighty-four Finnish female primary school teachers volunteered as participants. Laryngoscopically, 52.4% of these had laryngeal changes (39.3% mild, 13.1% disordered). Sound recordings were made for phonations of comfortable sustained vowel, comfortable speech, and speech produced at increased loudness level as used during teaching. CPP, CPPS and SPL values were extracted using Praat software for all three voice samples. Sound samples were also perceptually evaluated by five voice experts for overall voice quality (10 point scale from poor to excellent) and vocal firmness (10 point scale from breathy to pressed, with normal in the middle).ResultsThe CPP, CPPS and SPL values were significantly higher for vowels than for comfortable speech and for loud speech compared to comfortable speech (P < 0.001). Significant correlations were found between SPL and cepstral measures. The loud speech was perceived to be firmer and have a better voice quality than comfortable speech. No significant relationships of the laryngeal pathology status with cepstral values, perceptual ratings, or voice SPLs were found (P > 0.05).ConclusionNeither the acoustic measures (CPP, CPPS, and SPL) nor the perceptual evaluations could clearly distinguish teachers with laryngeal changes from laryngeally healthy teachers. Considering no vocal complaints of the subjects, the data could be considered representative of teachers with functionally healthy voice.  相似文献   

2.
Fundamental frequency (F0) perturbation has been found to be useful as an acoustic correlate of the perception of dysphonia in adult voices. In a previous investigation, we showed that hoarseness in children's voices is a stable concept composed mainly of three predictors: hyperfunction, breathiness, and roughness. In the present investigation, the relation between F0 perturbation and hoarseness as well as its predictors was analyzed in running speech of six children representing different degrees of hoarseness. Two perturbation measures were used: the standard deviation of the distribution of perturbation data and the mean of the absolute value of perturbation. The results revealed no clear relation.  相似文献   

3.
Spectral amplitude measures are sensitive to varying degrees of vocal fold adduction in normal speakers. This study examined the applicability of harmonic amplitude differences to adductor spasmodic dysphonia (ADSD) in comparison with normal controls. Amplitudes of the first and second harmonics (H1, H2) and of harmonics affiliated with the first, second, and third formants (A1, A2, A3) were obtained from spectra of vowels and /i/ excerpted from connected speech. Results indicated that these measures could be made reliably in ADSD. With the exception of H1(*)-H2(*), harmonic amplitude differences (H1(*)-A1, H1(*)-A2, and H1(*)-A3(*)) exhibited significant negative linear relationships (P < 0.05) with clinical judgments of overall severity. The four harmonic amplitude differences significantly differentiated between pre-BT and post-BT productions (P < 0.05). After treatment, measurements from detected significant differences between ADSD and normal controls (P < 0.05), but measurements from /i/ did not. LTAS analysis of ADSD patients' speech samples proved a good fit with harmonic amplitude difference measures. Harmonic amplitude differences also significantly correlated with perceptual judgments of breathiness and roughness (P < 0.05). These findings demonstrate high clinical applicability for harmonic amplitude differences for characterizing phonation in the speech of persons with ADSD, as well as normal speakers, and they suggest promise for future application to other voice pathologies.  相似文献   

4.
OBJECTIVES/HYPOTHESIS: The purpose of this study was (1) to determine whether changes in intra- and interrater reliability occur for inexperienced listeners' judgments of overall severity, roughness, and breathiness in dysphonic and normal speakers after 2 hours of listener training; and (2) to determine the acoustic bases of inexperienced listeners' judgments before and after training. STUDY DESIGN: Prospective, single group, pre- and postdesign. METHODS: Thirty adult dysphonic and six normal speaker samples were selected from a database. Samples included 21 test stimuli and 15 training stimuli of both sustained vowels and connected speech. Sixteen inexperienced listeners judged all samples for overall severity, roughness, and breathiness using visual analog scales. Each listener provided pretraining ratings at baseline. Listeners were then trained using 15 anchor voice samples and 15 training stimuli. During training, listeners were provided with definitions of rating dimensions, accuracy feedback, and anchor samples. Listeners then judged test stimuli in a posttraining session. Speaker samples also were analyzed acoustically. RESULTS: Intrarater reliability was least variable for judgments of overall severity, but improved further with training. Listener judgments of roughness and breathiness in vowels were least reliable at baseline, but they significantly improved between listeners after training. Finally, measures of cepstral peak prominence significantly predicted all voice quality judgments except roughness in vowels, which was predicted by shimmer. The acoustic bases of group perceptual judgments did not seem to change with training. CONCLUSIONS: These findings have implications for developing training programs in perceptual evaluation and mapping relationships between acoustic and perceptual characteristics of voice disorders.  相似文献   

5.
The purpose of this study was to investigate univariate relationships between perceived dysphonia and variation in pitch perturbation, amplitude perturbation, and additive noise. A time-domain, pitch-synchronous synthesis technique was used to generate sustained vowels varying in each of the three acoustic dimensions. A panel of trained listeners provided direct magnitude estimates of roughness in the case of the stimuli varying in pitch and amplitude perturbation, and breathiness in the case of the stimuli varying in additive noise. Very strong relationships were found between perceived roughness and either pitch or amplitude perturbation. However, unlike results reported previously for nonspeech stimuli, the subjective quality associated with pitch perturbation was quite different from that associated with amplitude perturbation. Results also showed that perceived roughness was affected not only by the amount of perturbation, but also by the degree of correlation between adjacent pitch or amplitude values. A strong relationship was found between perceived breathiness and signal-to-noise ratio. Contrary to previous findings, there was no interaction between signal-to-noise ratio and the amount of high-frequency energy in the periodic component of the stimulus: Stimuli with similar signal-to-noise ratios received similar ratings, regardless of differences in the spectral slope of the periodic component.  相似文献   

6.
《Journal of voice》2020,34(3):486.e13-486.e22
ObjectivesThe study aimed to investigate the short-term and long-term effects of voice rehabilitation in patients treated with radiotherapy for laryngeal cancer as measured by both the acoustic measure smoothed cepstral peak prominence (CPPS) and perceptual measures. A secondary aim was to investigate the relationship between acoustic and perceptual measures.MethodsIn total, 37 patients received voice rehabilitation post-radiotherapy and 37 patients constituted the irradiated control group. Outcome measures were mean CPPS for connected speech and ratings with the auditory-perceptual Grade, Roughness, Breathiness, Asthenia and Strain (GRBAS) scale. Outcome measures were analyzed 1 (baseline), 6, 12, and 24 months post-radiotherapy, where voice rehabilitation was conducted between the first two time-points. Additional recordings were acquired from vocally healthy participants for comparison.ResultsCPPS values of the voice rehabilitation group and vocally healthy group were not significantly different at 24 months post-radiotherapy. Ten out of 19 patients who received voice rehabilitation yielded a CPPS value above the threshold for normal voice 24 months post-radiotherapy, compared to 11 out of 26 in the irradiated control group. No statistically significant correlations were found between CPPS and perceptual parameters of GRBAS.ConclusionVoice rehabilitation for irradiated laryngeal cancer patients may have positive effects on voice quality up to 24 months post-radiotherapy. The relationship between CPPS and GRBAS as well as the applicability of CPPS for evaluation over several points of measurement needs to be studied further.  相似文献   

7.
This study was designed to investigate how variations in patterns of injection could improve the efficacy of botulinum toxin injections in relieving the symptoms of adductor spasmodic dysphonia. A total of 64 adductor spasmodic dysphonia patients who were injected using indirect laryngoscopic localization (for a total of 426 injections) were analyzed retrospectively using their own subjective data on duration of voice improvement, optimal voice improvement, breathiness side effects, and intervals between treatments. Injection to both the thyroarytenoid (TA) and the lateral cricoarytenoid (LCA) simultaneously gave the best voice results; the overall improvement from baseline was the longest lasting, and the period during which the voice was the best was the longest lasting. TA + LCA also gave the shortest duration of undesirable breathiness side effect. On the basis of these data, it seems reasonable to recommend that initial botulinum toxin therapy for adductor spasmodic dysphonia patients should be a single unilateral injection placed strategically at the posterior portion of the TA and directed toward the LCA so that both muscle groups are affected.  相似文献   

8.
The purpose of this study was (1) to determine the relationship between acoustic measures and auditory-perceptual dimensions of overall voice severity and pleasantness and (2) to evaluate the ability of acoustic and auditory-perceptual measures to discriminate normal from dysphonic voices. Thirty adult dysphonic speakers and six, age-matched normal control speakers were asked to provide oral reading samples of the Rainbow Passage. Acoustic analysis of the speech samples was used to identify abnormal phonatory events associated with dysphonia. The acoustic program calculated long-term average spectral measures, glottal noise measures, and those measures based on linear prediction (LP) modeling. Twelve adult listeners judged overall voice severity and pleasantness from the connected speech samples using direct magnitude estimation (DME) procedures. The acoustic measures accounted for 48% of overall voice severity and 40% of voice pleasantness for dysphonic speakers. The classification performance of the acoustic measures and auditory-perceptual measures was quantified using logistic regression analysis. When acoustic measures or auditory-perceptual measures were considered in isolation, classification was generally accurate and similar across measures. Classification accuracy improved to 100% when acoustic and auditory-perceptual measures were combined. These data provide further support for use of both auditory-perceptual evaluation and acoustic analyses for classifying and evaluating dysphonia.  相似文献   

9.
Injection of botulinum toxin (Botox) into the laryngeal muscleshas become the treatment of choice for controlling the symptoms of spasmodic dysphonia (SD). Currently, no specific battery of objective tests to assess the outcome is universally accepted. The purpose of this study was to investigate demographic, clinical, and treatment factors with voice outcome following Botox injection. Sixty-eight patients with adductor SD who underwent at least one Botox injection during a 5-year period were studied. Voice outcome measures were made from patient self-reporting scales and included overall vocal quality, length of response, and duration of breathiness. Vocal quality was significantly correlated with the underlying severity of vocal symptoms prior to treatment, incidence of breathiness and unilateral versus bilateral injection. The length of response was greater in males and following bilateral injections. An increased period of breathiness significantly correlated with bilateral injections.  相似文献   

10.
Noise-to-Harmonics Ratio as an Acoustic Measure of Voice Disorders in Boys   总被引:2,自引:0,他引:2  
This prospective study assessed the efficacy of computerized noise-to-harmonics ratio (NHR) to quantify perceptual and endoscopic findings of dysphonia and/or structural lesion of the vocal fold. Fifty Brazilian boys without vocal complaints were submitted to computerized, perceptual, and endoscopic examination. Thirty boys were dysphonic--3 were classified into the grade category, 5 into breathiness, 9 into roughness, and 15 into grade/breathiness. Vocal fold lesions were observed in 25 boys (17 nodules and 8 cysts). The Mann-Whitney U test revealed that NHR was significantly higher in boys with a structural lesion (p = 0.007) and in boys with dysphonia (p < 0.0001). However, according to a logistic regression model, only the occurrence of dysphonia was explained by NHR; the risk for having dysphonia increased approximately twice (odds ratio = 1.92, 95% confidence interval = 1.3-2.9) with each increase of 0.01 in NHR. Our results suggest that noise is a useful quantitative index to confirm a perceptual diagnosis of dysphonia and to evaluate quantitative changes in a dysphonic voice over time. However, we believe that computerized analysis should be used as a complement, rather than a substitute, for perceptual evaluation. Further studies with a larger sample are required to investigate the relationship between noise and lesions of the vocal folds.  相似文献   

11.
《Journal of voice》2020,34(5):806.e7-806.e18
There is a high prevalence of dysphonia among professional voice users and the impact of the disordered voice on the speaker is well documented. However, there is minimal research on the impact of the disordered voice on the listener. Considering that professional voice users include teachers and air-traffic controllers, among others, it is imperative to determine the impact of a disordered voice on the listener. To address this, the objectives of the current study included: (1) determine whether there are differences in speech intelligibility between individuals with healthy voices and those with dysphonia; (2) understand whether cognitive-perceptual strategies increase speech intelligibility for dysphonic speakers; and (3) determine the relationship between subjective voice quality ratings and speech intelligibility. Sentence stimuli were recorded from 12 speakers with dysphonia and four age- and gender-matched typical, healthy speakers and presented to 129 healthy listeners divided into one of three strategy groups (ie, control, acknowledgement, and listener strategies). Four expert raters also completed a perceptual voice assessment using the Consensus Assessment Perceptual Evaluation of Voice for each speaker. Results indicated that dysphonic voices were significantly less intelligible than healthy voices (P0.001) and the use of cognitive-perceptual strategies provided to the listener did not significantly improve speech intelligibility scores (P = 0.602). Using the subjective voice quality ratings, regression analysis found that breathiness was able to predict 41% of the variance associated with number of errors (P = 0.008). Overall results of the study suggest that speakers with dysphonia demonstrate reduced speech intelligibility and that providing the listener with specific strategies may not result in improved intelligibility.  相似文献   

12.
《Journal of voice》2019,33(6):838-845
BackgroundA limited number of experiments have investigated the perception of strain compared to the voice qualities of breathiness and roughness despite its widespread occurrence in patients who have hyperfunctional voice disorders, adductor spasmodic dysphonia, and vocal fold paralysis among others.ObjectiveThe purpose of this study is to determine the perceptual basis of strain through identification and exploration of acoustic and psychoacoustic measures.MethodsTwelve listeners evaluated the degree of strain for 28 dysphonic phonation samples on a five-point rating scale task. Computational estimates based on cepstrum, sharpness, and spectral moments (linear and transformed with auditory processing front-end) were correlated to the perceptual ratings.ResultsPerceived strain was strongly correlated with cepstral peak prominence, sharpness, and a subset of the spectral metrics. Spectral energy distribution measures from the output of an auditory processing front-end (ie, excitation pattern and specific loudness pattern) accounted for 77–79% of the model variance for strained voices in combination with the cepstral measure.ConclusionsModeling the perception of strain using an auditory front-end prior to acoustic analysis provides better characterization of the perceptual ratings of strain, similar to our prior work on breathiness and roughness. Results also provide evidence that the sharpness model of Fastl and Zwicker (2007) is one of the strong predictors of strain perception.  相似文献   

13.
We analyzed frequency and duration parameters of voice and speech in two men with adductor spasmodic dysphonia (SD). One was treated with botulinum toxin injection; the other received acupuncture therapy. Im provement after acupuncture therapy in terms of standard deviation of fundamental frequency, acoustic perturbation measurements, durational measurements of voice and speech, and spectrographic analysis was comparable to the results achieved with botulinum toxin injection. Voice and speech parameters were stable I year after acupuncture therapy.  相似文献   

14.
The acoustic characteristics of sustained vowel have been widely investigated across various languages and ethnic groups. These acoustic measures, including fundamental frequency (F0), jitter (Jitt), relative average perturbation (RAP), five-point period perturbation quotient (PPQ5), shimmer (Shim), and 11-point amplitude perturbation quotient (APQ11) are not well established for Malaysian Malay young adults. This article studies the acoustic measures of Malaysian Malay adults using acoustical analysis. The study analyzed six sustained Malay vowels of 60 normal native Malaysian Malay adults with a mean of 21.19 years. The F0 values of Malaysian Malay males and females were reported as 134.85 ± 18.54 and 238.27 ± 24.06 Hz, respectively. Malaysian Malay females had significantly higher F0 than that of males for all the vowels. However, no significant differences were observed between the genders for the perturbation measures in all the vowels, except RAP in /e/. No significant F0 differences between the vowels were observed. Significant differences between the vowels were reported for all perturbation measures in Malaysian Malay males. As for Malaysian Malay females, significant differences between the vowels were reported for Shim and APQ11. Multiethnic comparisons indicate that F0 varies between Malaysian Malay and other ethnic groups. However, the perturbation measures cannot be directly compared, where the measures vary significantly across different speech analysis softwares.  相似文献   

15.
The aim of this study was to investigate the acoustic and electroglottographic characteristics of patients with mutational dysphonia before and after voice therapy. The clinical records of 15 patients with mutational dysphonia were reviewed, and their voice recordings were analyzed with the help of the Lx Speech Studio program (Laryngograph Ltd, London, UK). After voice therapy combined with the manual compression method, the subjects' voices lowered in pitch and improved in quality. In addition, we classified the mutational dysphonia into four categories according to the presence of diplophonia and closed quotients. The most common type among the categories was characterized by a bimodal distribution of fundamental frequency (diplophonia), accompanied by a low closed quotient (falsetto voice) at high frequencies. However, the results also showed that mutational dysphonia cannot be generalized as always having a falsetto voice, as shown in other types. The effect of therapy was different for each type, and those cases with both diplophonia and a non-trained falsetto voice could be treated more readily. Consequently, the diplophonia and closed quotient, which were easily analyzed using Lx Speech Studio program, are important factors in the classification of mutational dysphonia. Identification of these characteristics may affect treatment choices, facilitate monitoring of the efficacy of therapy, and aid in estimating prognosis.  相似文献   

16.
The perception of breathiness in vowels is cued by multiple acoustic cues, including changes in aspiration noise (AH) and the open quotient (OQ) [Klatt and Klatt, J. Acoust. Soc. Am. 87(2), 820-857 (1990)]. A loudness model can be used to determine the extent to which AH masks the harmonic components in voice. The resulting "partial loudness" (PL) and loudness of AH ["noise loudness" (NL)] have been shown to be good predictors of perceived breathiness [Shrivastav and Sapienza, J. Acoust. Soc. Am. 114(1), 2217-2224 (2003)]. The levels of AH and OQ were systematically manipulated for ten synthetic vowels. Perceptual judgments of breathiness were obtained and regression functions to predict breathiness from the ratio of NL to PL (η) were derived. Results show that breathiness can be modeled as a power function of η. The power parameter of this function appears to be affected by the fundamental frequency of the vowel. A second experiment was conducted to determine if the resulting power function could estimate breathiness in a different set of voices. The breathiness of these stimuli, both natural and synthetic, was determined in a listening test. The model estimates of breathiness were highly correlated with perceptual data but the absolute predicted values showed some discrepancies.  相似文献   

17.

Aim

To describe the laryngeal configuration and the voice of male patients diagnosed with unilateral vocal fold paralysis (UVFP) before and after medialization.

Methods

A retrospective study involving the collection of data from medical records of 142 patients diagnosed with UVFP from January 2003 to April 2009, submitted to auditory-perceptual assessment of voices and visual perception of laryngeal images before and after medialization.

Results

The study included data from 24 male patients, with an average of 60.7 years, who underwent three surgical medialization techniques (injection of hyaluronic acid, type I thyroplasty, and injection of Teflon). Before treatment, the position of the paralyzed vocal fold was seen to have a significant influence to the passing of the healthy vocal fold beyond the midline and on the overall degree of dysphonia. After treatment, the complete glottic closure; the free margin of the linear vocal fold; paralyzed vocal fold in the median position, reduction of hoarseness, roughness and breathiness (more frequently mild), and asthenia (more frequently normal and mild); tension and instability (more frequency normal); and a decrease in the overall degree of dysphonia were found to be significant.

Conclusion

The position of the paralyzed vocal fold influences the position of the healthy vocal fold in relation to the midline and the overall degree of dysphonia. All three treatments improved the glottic configuration and the voice of patients with UVFP.  相似文献   

18.
Within-subject variation of three vocal frequency perturbation indices was compared across multiple sessions. The magnitude of jitter factor (JF), pitch perturbation quotient (PPQ), and directional perturbation quotient (DPF) was measured every other day for 33 consecutive days for ten female and five male normal young adult speakers. Perturbation measures were calculated using a zero-crossing analysis of taped [i] and [u] productions. Pearson product-moment correlations among the three perturbation indices were calculated to examine their relation over time. Coefficients of variation for JF, PPQ, and DPF were considered indicative of the temporal stability of the three measures. JF and PPQ provided redundant information about laryngeal behaviors in steady-state productions. DPF, however, appeared to measure different laryngeal behaviors. Also, JF and PPQ varied considerably within individuals across sessions while DPF was the more temporally stable measure. Multiple sampling sessions and measurement of both the magnitude and direction of period differences are advised for future investigations of vocal frequency perturbation.  相似文献   

19.
There is only very limited information on the prevalence of voice disorders, particularly for the pediatric population. This study examined the prevalence of dysphonia in a large cohort of children (n = 7389) at 8 years of age. Data were collected within a large prospective epidemiological study and included a formal assessment by one of five research speech and language therapists as well as a parental report of their child's voice. Common risk factors that were also analyzed included sex, sibling numbers, asthma, regular conductive hearing loss, and frequent upper respiratory infection. The research clinicians identified a dysphonia prevalence of 6% compared with a parental report of 11%. Both measures suggested a significant risk of dysphonia for children with older siblings. Other measures were not in agreement between clinician and parental reports. The clinician judgments also suggested significant risk factors for sex (male) but not for any common respiratory or otolaryngological conditions that were analyzed. Parental report suggested significant risk factors with respect to asthma and tonsillectomy. These results are discussed in detail.  相似文献   

20.
The purpose was to determine the clinical value of a multiparametric objective voice evaluation protocol including acoustic and aerodynamic parameters measured mainly on a sustained /a/. This was done by comparison with perceptual analysis of continuous speech by a jury composed of 6 experienced listeners. Voice samples (continuous speech) from 63 male patients with dysphonia and 21 control subjects with normal voices were recorded and assesed by a jury of listeners. The jury was instructed to classify voice samples according to the G (overall dysphonia) component of the GRBAS score on a 4-point scale ranging from 0 for normal to 3 for severe dysphonia. Objective parameters were recorded on an EVA® workstation. As usual with this type of system, parameters were measured mainly on a sustained /a/. Measured parameters included fundamental frequency (F0), intensity, jitter, shimmer, signal-to-noise ratio, Lyapunov coefficient (LC), oral airflow (OAF), maximum phonatory time (MPT), and vocal range (range). Estimated subglottic pressure (ESGP) was determined on a series of /pa/. Discriminant analysis was performed to detect correlation between jury classification and combinations of parameters. Results showed that a nonlinear combination of only six parameters (range, LC, ESGP, MPT, signal-to-noise ratio, and F0) allowed 86% concordance with jury classification. Discussion deals with the relative importance of the different objective parameters for discriminant analysis. Special emphasis is placed on two measurements rarely made in routine clinical workup, i.e., estimated subglottic pressure and Lyapunov coefficient.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号