期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

1.

Reliability of calculating the cepstral peak without linear regression analysis

Yolanda D. Heman-Ackah 《Journal of voice》2004,18(2):203-208

Measures of cepstral peak prominence, using the smoothing algorithm and linear regression analysis software developed by Hillenbrand, have been shown to be reliable predictors of dysphonia in voice samples.(1-4) Recently, the Computerized Speech Laboratory [(CSL) Kay Elemetrics, Pinebrook, New Jersey] has introduced cepstral analysis as a component of that software package. The cepstral peak, in this instance, is calculated by the voice clinician analyzing the phonatory sample by subtracting the value of the peak from the apparent baseline signal. This study compares the ability of cepstral peak values calculated from the CSL software to predict dysphonia reliably with that of the values produced by the smoothing algorithm and linear regression analysis of Hillenbrand. The results of this study show that linear regression analysis is an important step in calculating the cepstral peak prominence, thus limiting the usefulness of software programs that do not employ this step. 相似文献

2.

Vocal Improvement After Voice Therapy in Unilateral Vocal Fold Paralysis

Antonio Schindler Alessandro Bottero Pasquale Capaccio Daniela Ginocchio Fulvio Adorni Francesco Ottaviani 《Journal of voice》2008,22(1):113-118

Unilateral vocal fold paralysis (UVFP) is associated with changes in acoustic and aerodynamic voice measurements and can have a significant impact on a patient's quality of life. Few objective data regarding the efficacy of voice therapy for UVFP exist. The aim of this study was to retrospectively analyze voice modifications in a group of patients with UVFP before and after voice therapy. Forty patients with UVFP of different etiology were included in the study. Each subject had voice therapy with an experienced speech/language pathologist twice a week; the mean number of sessions was 12.6. A multidimensional assessment protocol was used; it included videoendoscopy, the maximum phonation time (MPT), the GIRBAS scale, spectrograms and a perturbation analysis, and the Voice Handicap Index (VHI). Pre- and posttreatment data were compared by means of the Wilcoxon and Student's t tests. A complete glottal closure was seen in 8 patients before voice therapy and in 14 afterward. Mean MPT increased significantly. In the perceptual assessment, the difference was significant for five out of six parameters. A significant improvement was found on spectrographic analysis; as for perturbation analysis, the differences in jitter, shimmer, and noise-to-harmonic ratio values were significant. VHI values showed a clear and significant improvement. A significant improvement of voice quality and quality of life after voice therapy is an often reached and reasonable goal in patients with UVFP. 相似文献

3.

Level and Center Frequency of the Singer''s Formant 总被引：2，自引：0，他引：2

Johan Sundberg 《Journal of voice》2001,15(2):176-186

The "singer's formant" is a prominent spectrum envelope peak near 3 kHz, typically found in voiced sounds produced by classical operatic singers. According to previous research, it is mainly a resonatory phenomenon produced by a clustering of formants 3, 4, and 5. Its level relative to the first formant peak varies depending on vowel, vocal loudness, and other factors. Its dependence on vowel formant frequencies is examined. Applying the acoustic theory of voice production, the level difference between the first and third formant is calulated for some standard vowels. The difference between observed and calculated levels is determined for various voices. It is found to vary considerably more between vowels sung by professional singers than by untrained voices. The center frequency of the singer's formant as determined from long-term spectrum analysis of commercial recordings is found to increase slightly with the pitch range of the voice classification. 相似文献

4.

A Comparative Study of Acoustic Voice Measurements by Means of Dr. Speech and Computerized Speech Lab

Ilse Smits Piet Ceuppens Marc S. De Bodt 《Journal of voice》2005,19(2):38-196

In this study, the calculations and results of acoustic voice analysis as calculated by two different analysis systems (Doctor Speech (DRS), Tiger Electronics, Neu-Anspach, Germany, and Computerized Speech Lab (CSL), Kay Elemetrics Corporation, Lincoln Park, NJ) are compared. A group of 120 normal voices was selected for analysis of the objective parameters: fundamental frequency (F(0)), variation of F(0) (F(0)SD), jitter, shimmer, and harmonics-to-noise ratio (HNR). The subject group was a random selection of normal voices of adults. The aim of this comparison was to find determined differences and similarities in data measurements between both systems to make data transfer possible. A significant correlation was found for F(0), HNR, and shimmer relative. The correlation for jitter (relative and absolute) and F(0)SD was weak. DRS and CSL are not comparable in absolute figures, but their judgment against normative data is identical. Further research is necessary to explore the affect on pathological voices or child voices. 相似文献

5.

Noise estimation in voice signals using short-term cepstral analysis

Murphy PJ Akande OO 《The Journal of the Acoustical Society of America》2007,121(3):1679-1690

相似文献

6.

Objective Indices of Perceived Vocal Strain

《Journal of voice》2019,33(6):838-845

BackgroundA limited number of experiments have investigated the perception of strain compared to the voice qualities of breathiness and roughness despite its widespread occurrence in patients who have hyperfunctional voice disorders, adductor spasmodic dysphonia, and vocal fold paralysis among others.ObjectiveThe purpose of this study is to determine the perceptual basis of strain through identification and exploration of acoustic and psychoacoustic measures.MethodsTwelve listeners evaluated the degree of strain for 28 dysphonic phonation samples on a five-point rating scale task. Computational estimates based on cepstrum, sharpness, and spectral moments (linear and transformed with auditory processing front-end) were correlated to the perceptual ratings.ResultsPerceived strain was strongly correlated with cepstral peak prominence, sharpness, and a subset of the spectral metrics. Spectral energy distribution measures from the output of an auditory processing front-end (ie, excitation pattern and specific loudness pattern) accounted for 77–79% of the model variance for strained voices in combination with the cepstral measure.ConclusionsModeling the perception of strain using an auditory front-end prior to acoustic analysis provides better characterization of the perceptual ratings of strain, similar to our prior work on breathiness and roughness. Results also provide evidence that the sharpness model of Fastl and Zwicker (2007) is one of the strong predictors of strain perception. 相似文献

7.

Voice and Laryngeal Configuration of Men With Unilateral Vocal Fold Paralysis Before and After Medialization

Karine Schwarz Carla Aparecida Cielo Nédio Steffen Jéfferson Becker Geraldo Pereira Jotz 《Journal of voice》2011,25(5):611-618

Aim

To describe the laryngeal configuration and the voice of male patients diagnosed with unilateral vocal fold paralysis (UVFP) before and after medialization.

Methods

A retrospective study involving the collection of data from medical records of 142 patients diagnosed with UVFP from January 2003 to April 2009, submitted to auditory-perceptual assessment of voices and visual perception of laryngeal images before and after medialization.

Results

The study included data from 24 male patients, with an average of 60.7 years, who underwent three surgical medialization techniques (injection of hyaluronic acid, type I thyroplasty, and injection of Teflon). Before treatment, the position of the paralyzed vocal fold was seen to have a significant influence to the passing of the healthy vocal fold beyond the midline and on the overall degree of dysphonia. After treatment, the complete glottic closure; the free margin of the linear vocal fold; paralyzed vocal fold in the median position, reduction of hoarseness, roughness and breathiness (more frequently mild), and asthenia (more frequently normal and mild); tension and instability (more frequency normal); and a decrease in the overall degree of dysphonia were found to be significant.

Conclusion

The position of the paralyzed vocal fold influences the position of the healthy vocal fold in relation to the midline and the overall degree of dysphonia. All three treatments improved the glottic configuration and the voice of patients with UVFP. 相似文献

8.

Laryngeal function and vocal fatigue after prolonged reading in individuals with unilateral vocal fold paralysis

Lisa N. Kelchner Linda Lee Joseph C. Stemple 《Journal of voice》2003,17(4):513-528

The purpose of the present study was to examine the effect of prolonged loud reading, intended to induce fatigue, on vocal function in adults with unilateral vocal fold paralysis (UVFP). Subjects were 20 adults, 37–60 years old, with UVFP secondary to recurrent laryngeal nerve paralysis. Subjective ratings and instrumental measures of vocal function were obtained before and after reading. Statistical analysis revealed subjects rated their vocal quality and physical effort for voicing more severely following prolonged loud reading, whereas expert raters did not detect a significant perceptual difference in vocal quality. Reading fundamental frequency (Fo) was significantly increased following prolonged loud reading, as were mean airflow rates at all pitch conditions. Maximum phonation times for comfort and low pitches significantly decreased during posttests. Multiple regression analyses revealed significant associations between ratings of posttest physical effort and select posttest measures. Interpretation of results indicates the prolonged loud reading task was successful in vocally fatiguing most of the UVFP subjects. Key physiologic correlates of vocal fatigue, in individuals with UVFP, include further reduction of glottic efficiency, resulting in decreased regulation of glottic airflow and a temporary destabilization of speaking fundamental frequency. 相似文献

9.

Loud speech over noise: some spectral attributes, with gender differences

Ternström S Bohman M Södersten M 《The Journal of the Acoustical Society of America》2006,119(3):1648-1665

相似文献

10.

Nasalance Changes After Functional Endoscopic Sinus Surgery 总被引：3，自引：0，他引：3

Renata Soneghet Rodrigo Paula Santos Mara Behlau Walter Habermann Gerhard Friedrich Heinz Stammberger 《Journal of voice》2002,16(3):392-397

Forty adult patients diagnosed with chronic rhinosinusitis who underwent functional endoscopic sinus surgery (FESS), were analyzed with respect to postoperative resonatory voice changes. For evaluation the patients were asked about their subjective impression of voice changes using a questionnaire. An objective assessment was performed by determining the so-called nasalance using the Nasometer (Kay Elemetrics), preoperatively, on the immediate postoperative follow-up (2 days after surgery), and approximately 1 month after surgery. The mean nasalance values increased significantly one month after FESS whereas the immediate postoperative control (2 days after surgery) showed a decrease of nasalance. Although FESS is a minimally invasive procedure, it can change the acoustic characteristics of the vocal tract in the long term and produce a significant increase in nasality. The authors strongly recommend that clinicians inform all patients, in particular voice professionals, about the possible effects of endonasal sinus surgery on voice quality. 相似文献

11.

The sound level of the singer's formant in professional singing 总被引：2，自引：0，他引：2

G Bloothooft R Plomp 《The Journal of the Acoustical Society of America》1986,79(6):2028-2033

The relative sound level of the "singer's formant," measured in a 1/3-oct band with a center frequency of 2.5 kHz for males and of 3.16 kHz for females, has been investigated for 14 professional singers, nine different modes of singing, nine different vowels, variations in overall sound-pressure level, and fundamental frequencies ranging from 98 up to 880 Hz. Variation in the sound level of the singer's formant due to differences among male singers was small (4 dB), the factors vowels (16 dB) and fundamental frequency (9-14 dB) had an intermediate effect, while the largest variation was found for differences among female singers (24 dB), between modes of singing (vocal effort) (23 dB), and in overall sound-pressure level (more than 30 dB). In spite of this great potential variability, for each mode of singing the sound level of the singer's formant was remarkably constant up to F0 = 392 Hz, due to adaptation of vocal effort. This may be explained as the result of the perceptual demand of a constant voice quality. The definition of the singer's formant is discussed. 相似文献

12.

Vocal violence in actors: An investigation into its acoustic consequences and the effects of hygienic laryngeal release training

Nelson Roy Karen S. Ryker Diane M. Bless 《Journal of voice》2000,14(2):215-230

Acoustic analysis techniques were used to investigate the short-term consequences of vocally violent behavior, and to compare voice production before and after training in hygienic laryngeal release (HLR) techniques. Twenty-seven actors ranging in age from 17 to 48 years were audiorecorded before and after multiple productions of 4 vocally violent behaviors: grunting, groaning, sobbing, and shouting. After training in HLR techniques, the experimental protocol was repeated. Audiorecordings of vowels (produced at 3 pitch levels: modal F0, minimum F0, maximum F0) before and after vocal violence, and before and after HLR training, were analyzed using the Multidimensional Voice Program (4305, Kay Elemetrics Corp, Lincoln Park, NJ). After vocal violence, no consistent acoustic changes were detected for voice generated at modal and minimum F0; however, significant increases in both fundamental frequency range and maximum F0 were observed. After training in HLR techniques, acoustic measures sensitive to pitch and amplitude perturbation, and non-harmonic noise, improved across pitch levels. The results also indicated that vocal training does defend the laryngeal system from undesirable changes related to vocally violent maneuvers that might surface at the extremes of an actor's pitch range. Because the HLR technique used in this investigation was multimodal, interesting questions are raised regarding which aspect of training is primarily responsible for the observed effects. Further study is required to identify such factors. 相似文献

13.

Personality and voice disorders: A multitrait-multidisorder analysis

Nelson Roy Diane M. Bless Dennis Heisey 《Journal of voice》2000,14(4):521-548

相似文献

14.

Acoustic and Perceptual Analyses of Brazilian Male Actors' and Nonactors' Voices: Long-term Average Spectrum and the “Actor's Formant”

Suely Master Noemi De Biase Brasília Maria Chiari Anne-Maria Laukkanen 《Journal of voice》2008,22(2):146-154

SUMMARY: This study investigates the possible differences between actors' and nonactors' vocal projection strategies using acoustic and perceptual analyses. A total of 11 male actors and 10 male nonactors volunteered as subjects, reading an extended text sample in habitual, moderate, and loud levels. The samples were analyzed for sound pressure level (SPL), alpha ratio (difference between the average SPL of the 1-5kHz region and the average SPL of the 50Hz-1kHz region), fundamental frequency (F0), and long-term average spectrum (LTAS). Through LTAS, the mean frequency of the first formant (F1) range, the mean frequency of the "actor's formant," the level differences between the F1 frequency region and the F0 region (L1-L0), and the level differences between the strongest peak at 0-1kHz and that at 3-4kHz were measured. Eight voice specialists evaluated perceptually the degree of projection, loudness, and tension in the samples. The actors had a greater alpha ratio, stronger level of the "actor's formant" range, and a higher degree of perceived projection and loudness in all loudness levels. SPL, however, did not differ significantly between the actors and nonactors, and no differences were found in the mean formant frequencies ranges. The alpha ratio and the relative level of the "actor's formant" range seemed to be related to the degree of perceived loudness. From the physiological point of view, a more favorable glottal setting, providing a higher glottal closing speed, may be characteristic of these actors' projected voices. So, the projected voices, in this group of actors, were more related to the glottic source than to the resonance of the vocal tract. 相似文献

15.

Spastic/spasmodic vs. tremulous vocal quality: motor speech profile analysis

Donna S. Lundy Soham Roy Jun W. Xue Roy R. Casiano Daniel Jassir 《Journal of voice》2004,18(1):146-152

Strained, strangled, and tremulous vocal qualities that are typically seen in adductor spasmodic dysphonia (ADSD), voice tremor (Tremor), and the spastic dysarthria of amyotrophic lateral sclerosis (ALS) may sound similar and be difficult to differentiate. The purpose of this study was to determine if these vocal qualities of neurologic origin could be differentiated on the basis of acoustic and motor speech parameters. Three groups of subjects (ADSD, ALS, and Tremor) were analyzed by the Motor Speech Profile System (Kay Elemetrics, Lincoln Park, NJ) for fundamental frequency (Fo), standard deviation of Fo, diadochokinetic rate (ddk), standard deviation of ddk, mean intensity and standard deviation of ddk, frequency and amplitude variability in connected speech, and speaking rate in connected speech. Profiles of the three groups are presented with the significant features that differentiated one from the other. 相似文献

16.

Correlation of VHI-10 to Voice Laboratory Measurements Across Five Common Voice Disorders

《Journal of voice》2014,28(4):440-448

ObjectiveTo correlate change in Voice Handicap Index (VHI)-10 scores with corresponding voice laboratory measures across five voice disorders.Study DesignRetrospective study.MethodsOne hundred fifty patients aged >18 years with primary diagnosis of vocal fold lesions, primary muscle tension dysphonia-1, atrophy, unilateral vocal fold paralysis (UVFP), and scar. For each group, participants with the largest change in VHI-10 between two periods (T_A and T_B) were selected. The dates of the VHI-10 values were linked to corresponding acoustic/aerodynamic and audio-perceptual measures. Change in voice laboratory values were analyzed for correlation with each other and with VHI-10.ResultsVHI-10 scores were greater for patients with UVFP than other disorders. The only disorder-specific correlation between voice laboratory measure and VHI-10 was average phonatory airflow in speech for patients with UVFP. Average airflow in repeated phonemes was strongly correlated with average airflow in speech (r = 0.75). Acoustic measures did not significantly change between time points.ConclusionsThe lack of correlations between the VHI-10 change scores and voice laboratory measures may be due to differing constructs of each measure; namely, handicap versus physiological function. Presuming corroboration between these measures may be faulty. Average airflow in speech may be the most ecologically valid measure for patients with UVFP. Although aerodynamic measures changed between the time points, acoustic measures did not. Correlations to VHI-10 and change between time points may be found with other acoustic measures. 相似文献

17.

The Speaker's Formant

Irene Velsvik Bele 《Journal of voice》2006,20(4):555-578

The current study concerns speaking voice quality in two groups of professional voice users, teachers (n = 35) and actors (n = 36), representing trained and untrained voices. The voice quality of text reading at two intensity levels was acoustically analyzed. The central concept was the speaker's formant (SPF), related to the perceptual characteristics "better normal voice quality" (BNQ) and "worse normal voice quality" (WNQ). The purpose of the current study was to get closer to the origin of the phenomenon of the SPF, and to discover the differences in spectral and formant characteristics between the two professional groups and the two voice quality groups. The acoustic analyses were long-term average spectrum (LTAS) and spectrographical measurements of formant frequencies. At very high intensities, the spectral slope was rather quandrangular without a clear SPF peak. The trained voices had a higher energy level in the SPF region compared with the untrained, significantly so in loud phonation. The SPF seemed to be related to both sufficiently strong overtones and a glottal setting, allowing for a lowering of F4 and a closeness of F3 and F4. However, the existence of SPF also in LTAS of the WNQ voices implies that more research is warranted concerning the formation of SPF, and concerning the acoustic correlates of the BNQ voices. 相似文献

18.

Resonant Voice: Spectral and Nasendoscopic Analysis

Cara G. Smith Eileen M. Finnegan Michael P. Karnell 《Journal of voice》2005,19(4):607-622

Although resonant voice therapy is a widely used therapeutic approach, little is known about what characterizes resonant voice and how it is physiologically produced. The purpose of this study was to test the hypothesis that resonant voice is produced by narrowing the laryngeal vestibule and is characterized by first formant tuning and more ample harmonics. Videonasendoscopic recordings of the laryngeal vestibule were made during nonresonant and resonant productions of /i/ in six subjects. Spectrums of the two voice types were also obtained. Spectral analysis showed that first formant tuning was exhibited during resonant voice productions and that the degree of harmonic enhancement in the range of 2.0 to 3.5 kHz was related to voice quality: nonresonant voice had the least amount of energy in this range, whereas a resonant-relaxed voice had more energy, and a resonant-bright voice had the greatest amount of energy. Visual-perceptual judgments of the videoendoscopic data indicated that laryngeal vestibule constriction was not consistently associated with resonant voice production. 相似文献

19.

Effects of Tonsillectomy on Speech Spectrum

Hakki Gkhan Ilk Osman Ero ul Bülent Satar Yalin zkaptan 《Journal of voice》2002,16(4):580-586

Changes in the speech spectrum of vowels and consonants before and after tonsillectomy were investigated to find out the impact of the operation on speech quality. Speech recordings obtained from patients were analyzed using the Kay Elemetrics, Multi-Dimensional Voice Processing (MDVP Advanced) software. Examination of the time-course changes after the operation revealed that certain speech parameters changed. These changes were mainly F3 (formant center frequency) and B3 (formant bandwidth) for the vowel /o/ and a slight decrease in B1 and B2 for the vowel /a/. The noise-to-harmonic ratio (NHR) also decreased slightly, suggesting less nasalized vowels. It was also observed that the fricative, glottal consonant /h/ has been affected. The larger the tonsil had been, the more changes were seen in the speech spectrum. The changes in the speech characteristics (except F3 and B3 for the vowel /o/) tended to recover, suggesting an involvement of auditory feedback and/or replacement of a new soft tissue with the tonsils. Although the changes were minimal and, therefore, have little effect on the extracted acoustic parameters, they cannot be disregarded for those relying on their voice for professional reasons, that is, singers, professional speakers, and so forth. 相似文献

20.

Deviant Vocal Fold Vibration as Observed During Videokymography: The Effect on Voice Quality

Irma M. Verdonck-de Leeuw Joost M. Festen Hans F. Mahieu 《Journal of voice》2001,15(3):313-322

Videokymographic images of deviant or irregular vocal fold vibration, including diplophonia, the transition from falsetto to modal voice, irregular vibration onset and offset, and phonation following partial laryngectomy were compared with the synchronously recorded acoustic speech signals. A clear relation was shown between videokymographic image sequences and acoustic speech signals, and the effect of irregular or incomplete vocal fold vibration patterns was recognized in the amount of perceived breathiness and roughness and by the harmonics-to-noise ratio in the speech signal. Mechanisms causing roughness are the presence of mucus, phase differences between the left and right vocal fold, and short-term frequency and amplitude modulation. It can be concluded that the use of simultaneously recorded videokymographic image sequences and speech signals contributes to the understanding of the effect of irregular vocal fold vibration on voice quality. 相似文献