期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

1.

Perceptual evaluation of voice quality and its correlation with acoustic measurements 总被引：2，自引：0，他引：2

Tarika Bhuta Linda Patrick James D. Garnett 《Journal of voice》2004,18(3):299-304

To determine whether a correlation exists between the Grade, Roughness, Breathiness, Aesthenia, Strain (GRBAS) scale (a subjective measure of voice) and the Multi-Dimensional Voice Program (MDVP) scale (an objective measure of voice). A retrospective review of 37 voice patients (12 male/25 female) was conducted. Each voice was perceptually evaluated using the GRBAS scale by an experienced speech pathologist and acoustically analyzed using the MDVP scale. Statistical analysis using a multivariate regression model identified a significant correlation between the noise-related parameters of MDVP and the components of the GRBAS scale. Grade correlated with voice turbulence index (VTI), noise harmonic ratio (NHR), and soft phonation index (SPI). Roughness correlated with NHR only. Breathiness correlated with SPI only. Aesthenia also correlated with SPI only. Of the 19 acoustic variables measured by the MDVP system, only three noise parameters significantly correlated with the GRBAS perceptual voice analysis. Perhaps "noise" is the perceived acoustical quality of the dysphonic voice. A voice quantifying measure such as a "voice index score" could be proposed using the GRBAS scoring and the three clinically relevant MDVP values following further studies. 相似文献

2.

Acoustical analysis and perceptual evaluation of tracheoesophageal prosthetic voice

Corina J. van As Frans J.M. Hilgers Irma M. Verdonck-de Leeuw Florien J. Koopmans-van Beinum 《Journal of voice》1998,12(2):239-248

Voice analysis was performed on 21 “standard” laryngectomized, male patients with a Provox® voice prosthesis, along with an age- and sex-matched control group of 20 normal speakers, using acoustical analyses (MDVP and CSL, Kay Elemetrics Corp.), maximum phonation time measurements, and perceptual evaluations. Comparison between MDVP and CSL revealed that the latter was not useful for the analysis of laryngectomized prosthetic voices. In contrast, MDVP seems suitable for this purpose, and contains a large number of parameters that significantly differentiate between patient and control speakers, as did the perceptual ratings and the maximum phonation time. Fundamental frequency appeared to be comparable for patients and control speakers. A significant influence of stoma occlusion and age was found for some voice parameters. Factor analyses showed correlations between the different MDVP parameters and correlations between the MDVP parameters and the perceptual ratings. 相似文献

3.

Speech transmission performance and the effect of acoustical remedies in a dome

Satoshi Inoue Kiyoshi Sugino Hiroyuki Imaizumi 《Applied Acoustics》2009,70(1):221-230

For the purpose of improving speech transmission performance in a dome space, the acoustical properties in a dome having a diameter of 20 m were examined. The acoustical properties measured evenly on the floor of the dome were evaluated both objectively and subjectively and the interrelationship of the objective measures and subjective measures were also examined. Then, on the basis of the results of the study, simplified acoustical remedies were applied to the dome to improve speech intelligibility and the effect of the remedies was also examined. The following findings were obtained from this investigation.(1) The speech transmission performance in the dome space without treatment by absorptive materials varies greatly with the locations of sound sources and observation points: a range of 0.17-0.59 for RASTI value and a range of 30-97% for speech intelligibility test results. (2) There are peculiar observation points at which speech transmission quality is very high due to a considerable sum of the energy arriving in the first 0.06 s after the arrival of the direct sound. (3) Of all the measured acoustical parameters, RASTI, EDT in 1 kHz band, early-to-late arriving sound energy ratio, and Ts corresponded well to the speech intelligibility test scores. (4) Rubber tiles, cotton canvas 12 m in length, and glass wool board, are effective in improving speech intelligibility remarkably due to increased sound absorption and the diffusion effect. 相似文献

4.

S.K. Tang 《Applied Acoustics》2008,69(12):1318-1331

A survey on the speech related acoustical parameters in the Hong Kong classrooms having standardized architectural layouts is carried out in the present study. Results suggest that these acoustical parameters are highly correlated with each other even across different octave bands. It is also found that the relationships between parameters of different kinds do not depend on the frequency bands. Besides, the present results indicate that the sound pulse decay inside a not very reverberant classroom consists of an initial fast decay, leading to deviations of the field survey results from those predicted by the exponential decay under the uniform sound energy decay assumption. It is believed that the strong correlations between the various speech related acoustical parameters and the regression information obtained in the present study can help the estimation of the speech quality of the classrooms in the design stage. 相似文献

5.

Perceptual evaluations of voice scatter in unison choir sounds

Sten Ternstrm 《Journal of voice》1993,7(2)

The preferences of experienced listeners for pitch and formant frequency dispersion in unison choir sounds were explored using synthesized stimuli. Two types of dispersion were investigated: (a) pitch scatter, which arises when voices in an ensemble exhibit small differences in mean fundamental frequency, and (b) spectral smear, defined as such dispersion of formants 3 to 5 as arises from differences in vocal tract length. Each stimulus represented a choir section of five bass, tenor, alto, or soprano voices, producing the vowel [u], [a], or [w]. Subjects chose one dispersion level out of six available, selecting the “maximum tolerable” in a first run and the “preferred” in a second run. The listeners were very different in their tolerance for dispersion. Typical scatter choices were 14 cent standard deviation for “tolerable” and 0 or 5 cent for “preferred.” The smear choices were less consistent; the standard deviations were 12 and 7%, respectively. In all modes of assessment, the largest dispersion was chosen for the vowel [u] on a bass tone. There was a vowel effect on the smear choices. The effects of voice category were not significant. 相似文献

6.

Acoustic characteristics of rough voice: Subharmonics

Koichi Omori Hisayoshi Kojima Rajesh Kakani David H. Slavit Stanley M. Blaugrund 《Journal of voice》1997,11(1):40-47

This study investigates the relationship between rough voice and the presence of Subharmonics, which correspond to smaller yet distinct peaks located between two consecutive harmonic peaks in the power spectrum. Spectrum analysis was undertaken in 389 pathologic voices, of which 20 had subharmonics. Although all 20 voices had roughness perceptually, 8 had normal jitter and/or shimmer. The degree of roughness had a significant inverse relationship with the frequency of subharmonics. By digital signal processing, sound samples with various types of subharmonics were synthesized and perceptually analyzed. Power and frequency of subharmonics in the synthesized sound also had significant relationships with the degree of roughness. Rough voice is acoustically characterized not only by jitter and shimmer but also by the presence of subharmonics in the power spectrum. Subharmonics are important acoustic properties for objective evaluation of rough voices. 相似文献

7.

Singing power ratio: Quantitative evaluation of singing voice quality

Koichi Omori Ashutosh Kacker Linda M. Carroll William D. Riley Stanley M. Blaugrund 《Journal of voice》1996,10(3):228-235

This paper presents a parameter for objectively evaluating singing voice quality. Power spectrum of vowel sound / a / was analyzed by Fast Fourier Transform. The greatest harmonics peak between 2 and 4 kHz and the greatest harmonics peak between 0 and 2 kHz were identified. Power ratio of these peaks, termed singing power ratio (SPR), was calculated in 37 singers and 20 nonsingers. SPR of sung / a / in singers was significantly greater than in nonsingers. In singers, SPR of sung / a / was significantly greater than that of spoken / a /. By digital signal processing, power spectrum of sung / a / was varied, and the processed sounds were perceptually analyzed. SPR had a significant relationship with perceptual scores of “ringing” quality. SPR provides an important quantitative measurement for evaluating singing voice quality for all voice types, including soprano. 相似文献

8.

The acoustical characterization of orchestra platforms and uncertainty estimation of the results

Maria Giovannini Arianna Astolfi 《Applied Acoustics》2010,71(10):889-901

Measurements have been carried out on furnished orchestra platforms in four concert halls in Italy in order to describe the sound field perceived by musicians. The heterogeneous nature of the orchestra suggested a procedure able to take into account the mutual hearing between instrumental sections. The measured parameters were the early, late and total support, the reverberation time, the early decay time and the clarity index. A part of the study has been devoted to the measurement uncertainty estimation. The source directivity and the small displacements of the microphone influence the early decay time to a great extent while the on-platform spatial variability affects both the early decay time and the clarity index. Per-section early support shows differences that render the overall spatial mean inappropriate to describe the stage as a whole. For the other parameters an overall mean platform value can instead be suitable, even though, for the case of clarity a more evident group variability is observed. The values of late support, reverberation time, early decay time and clarity index, proposed in literature as suitable measures of reverberance for musicians, are not all intercorrelated, indicating that not all these parameters can be associated to the same subjective impression. 相似文献

9.

Voice-over: Perceptual and Acoustic Analysis of Vocal Features

Reny Medrado Leslie Piccolotto Ferreira Mara Behlau 《Journal of voice》2005,19(3):340-349

Voice-overs are professional voice users who use their voices to market products in the electronic media. The purposes of this study were to (1) analyze voice-overed and non-overed productions of an advertising text in two groups consisting of 10 male professional voice-overs and 10 male non-voice-overs; and (2) determine specific acoustic features of voice-over productions in both groups. A na?ve group of listeners were engaged for the perceptual analysis of the recorded advertising text. The voice-overed production samples from both groups were submitted for analysis of acoustic and temporal features. The following parameters were analyzed: (1) the total text length, (2) the length of the three emphatic pauses, (3) values of the mean, (4) minimum, (5) maximum fundamental frequency, and (6) the semitone range. The majority of voice-overs and non-voice-overs were correctly identified by the listeners in both productions. However voice-overs were more consistently correctly identified than non-voice-overs. The total text length was greater for voice-overs. The pause time distribution was statistically more homogeneous for the voice-overs. The acoustic analysis indicated that the voice-overs had lower values of mean, minimum, and maximum fundamental frequency and a greater range of semitones. The voice-overs carry the voice-overed production features to their non-voice-overed production. 相似文献

10.

Acoustic analysis of voice in individuals with amyotrophic lateral sclerosis and perceptually normal vocal quality

Alice K. Silbergleit Alex F. Johnson Barbara H. Jacobson 《Journal of voice》1997,11(2):222-231

Currently, early phonatory changes in amyotrophic lateral sclerosis(ALS) are not well understood. The aim of this study was to compare acoustic parameters of voice in ALS subjects who demonstrated perceptually normal vocal quality on sustained phonation with a control group. We hypothesized that objective analysis of voice would reveal significant differences on specific acoustic parameters of voice compared to the control group. Results revealed statistically significant differences between the two groups on measures related to frequency range and phonatory stability. The findings suggest that early bulbar signs affecting the laryngeal system may be present in patients with ALS before the occurrence of perceptually aberrant vocal characteristics. 相似文献

11.

Deviant Vocal Fold Vibration as Observed During Videokymography: The Effect on Voice Quality

Irma M. Verdonck-de Leeuw Joost M. Festen Hans F. Mahieu 《Journal of voice》2001,15(3):313-322

Videokymographic images of deviant or irregular vocal fold vibration, including diplophonia, the transition from falsetto to modal voice, irregular vibration onset and offset, and phonation following partial laryngectomy were compared with the synchronously recorded acoustic speech signals. A clear relation was shown between videokymographic image sequences and acoustic speech signals, and the effect of irregular or incomplete vocal fold vibration patterns was recognized in the amount of perceived breathiness and roughness and by the harmonics-to-noise ratio in the speech signal. Mechanisms causing roughness are the presence of mucus, phase differences between the left and right vocal fold, and short-term frequency and amplitude modulation. It can be concluded that the use of simultaneously recorded videokymographic image sequences and speech signals contributes to the understanding of the effect of irregular vocal fold vibration on voice quality. 相似文献

12.

Pitch Characteristics of Homosexual Males

Heidi Baeck Paul Corthals John Van Borsel 《Journal of voice》2011,25(5):e211

Objective

To investigate the common stereotype that homosexual males show pitch patterns that mirror those of heterosexual females.

Study Design

Static group comparison.

Method

Comparison of speaking fundamental frequency and pitch variation of 30 homosexual males, 56 heterosexual age-matched males, and 54 age-matched heterosexual females as demonstrated in a sample of read speech.

Results

In the homosexual males, average fundamental frequency and pitch variation were significantly higher than in the heterosexual males but also significantly lower than in the heterosexual females.

Conclusions

Results do not confirm the stereotype that gay male speech mirrors the patterns of women’s speech with respect to pitch characteristics. It would seem that the pitch patterns of gay male speakers constitute an example of sociophonetic variation. 相似文献

13.

Influence of absorption properties of materials on the accuracy of simulated acoustical measures in 1:10 scale model test

Jin Yong Jeon Jong Kwan Ryu Yong Hee Kim Shin-ichi Sato 《Applied Acoustics》2009,70(4):615-625

This study investigated the absorption characteristics of materials in a multi-purpose hall using computer models, 1:10 scale model and actual hall measurements of Gimhae Arts Hall (GAH), in order to predict and evaluate the acoustical characteristics. The elements of this scale model, such as reflecting walls, seats, audience, and absorption banners, were made with materials selected according to their absorption coefficients, measured in a 1:10 scale model reverberation chamber. After the real hall was completed, in situ acoustical measurements were conducted in the GAH and compared with those of the scale model hall. Comparison of these measurements showed that the delay time of the major reflections in the scale model hall was similar to that of the real hall. However, the reverberation time especially at low frequencies showed a difference between the scale model hall and the real hall measurements. The results of computer simulations for both scale model and actual hall showed that the absorption of seats and audience, the structural detail of the reflecting walls with different thickness and air spaces, and the duct facilities in the open-type ceiling are the major differences. It was confirmed that there are more complicated absorption characteristics in the scale model design of a multi-purpose hall than a concert hall. 相似文献

14.

Perceptual attributes of voice: Development and use of rating scales

Marylou Pausewang Gelfer 《Journal of voice》1988,2(4):320-326

相似文献

15.

Voice characteristics, effects of voice therapy, and long-term follow-up of contact granuloma patients 总被引：2，自引：0，他引：2

Riitta Ylitalo Britta Hammarberg 《Journal of voice》2000,14(4):557-566

This study evaluates the laryngoscopic findings and voice characteristics of male contact granuloma patients before and after voice therapy and at a follow-up about 9 years later. Pre- and posttherapy recordings as well as follow-up recordings were made for 19 granuloma patients. Pretherapy revealed the most salient perceptual voice characteristics were low pitch, monotony, and a high degree of vocal fry and hyperfunction. Interjudge reliability for these traits was high. Immediately following therapy the healed patients (n = 10) had a decrease in hyperfunction, vocal fry, and monotony, while the unhealed patients (n = 9) had an increase in hyperfunction and vocal fry decreased only marginally. Monotony decreased significantly in this group. As regards the acoustic analyses, no significant differences were found in mean fundamental frequency (F₀) or perturbation. At the follow-up assessment 4 patients had granuloma while 15 had normal laryngeal status. Perceptually their voice characteristics resembled those pretherapy independently of the laryngeal findings. The results suggest that reduced hyperfunction and decreased vocal fry may create better circumstances for the healing process at the posterior glottis. 相似文献

16.

Acoustic and Perceptual Appraisal of Vocal Gestures in the Female Classical Voice

Dianna T. Kenny Helen F. Mitchell 《Journal of voice》2006,20(1):55-70

Long-term average spectra (LTAS) have identified features in the sounds of singers and have compared different vocal qualities based on energy changes that occur during different vocal tasks. In this study, we compared the perceptual ratings of vocal quality of expert pedagogues with acoustic measures performed on LTAS. Fifteen expert judges rated 24 samples with six repeats of six advanced singing students under two conditions: "optimal" (O), which represented the application of the maximal open throat technique; and "suboptimal" (SO), which represented the application of the reduced open throat technique. LTAS were performed on each singing sample, and two conventional assessments of peak energy height [singing power ratio (SPR)] and peak area [energy ratio (ER)] were calculated on each LTAS. Perceptual scores, SPR, and ER were rank ordered. We then compared perceptual rankings with rankings of acoustic measures (SPR and ER) to assess whether these acoustic measurements matched the perceptual judgments of vocal quality. Although we found the expected significant relationship between SPR and ER, there was no relationship between perceptual ratings of vocal samples or singers based on SPR or ER. These findings suggest that because LTAS measures are not consistent with perceptual ratings of vocal quality, such measurements cannot define a voice of quality. Future research with LTAS to assess vocal quality should consider alternative measures that are more sensitive to subtle differences in vocal parameters. 相似文献

17.

Comparison of acoustic and perceptual measures of voice in male-to-female transsexuals perceived as female versus those perceived as male

Marylou Pausewang Gelfer Kevin J. Schofield 《Journal of voice》2000,14(1):22-33

The present study explored significant differences between male-to-female transgendered speakers perceived as male and those perceived as female in terms of speaking fundamental frequency (SFF) and its variability, vowel formants for /a/ and /i/, and intonation measures. Fifteen individuals who identified themselves as male-to-female transsexuals served as speaker subjects, in addition to 6 biological female control subjects and 3 biological male control subjects. Each subject was recorded reading the Rainbow Passage and producing the isolated vowels /a/ and /i/. Twenty undergraduate psychology students served as listeners. Results indicated that subjects perceived as female had a higher mean SFF and higher upper limit of SFF than subjects perceived as male. A significant correlation between upper limit of SFF and ratings of femininity was achieved. 相似文献

18.

The Effect of Experience on Perceptual Spaces When Judging Synthesized Voice Quality: A Multidimensional Scaling Study

《Journal of voice》2014,28(5):548-553

相似文献

19.

可见光与红外彩色融合图像感知清晰度评价模型

高绍姝金伟其王霞王岭雪骆媛《光谱学与光谱分析》2012,32(12):3197-3202

提出了感知清晰度评价模型,来评价人眼对红外与可见光彩色融合图像细节和边缘的可辨识度。首先,利用人眼对比度敏感函数模型,抑制在特定观察条件下图像中人眼不敏感的频率成分。之后,在局部频带对比度模型基础上,结合人眼亮度掩模特性构造了感知对比度模型。最后,计算融合图像人眼兴趣区域(细节和边缘区域)的感知对比度,进而评价融合图像的感知清晰度。实验结果表明,与现有的五种彩色图像清晰(模糊)度的客观评价模型相比,考虑人眼视觉特性感知清晰度模型的计算结果与人眼主观感受具有较好的一致性,可以有效地对彩色融合图像清晰度进行客观评价。相似文献

20.

Documentation of progress in voice therapy: perceptual, acoustic, and laryngostroboscopic findings pretherapy and posttherapy

R. Speyer G. H. Wieneke P. H. Dejonckere 《Journal of voice》2004,18(3):325-340

The effect of voice therapy in a group of chronically dysphonic patients with diverse diagnoses was studied according to the normal clinical procedure. The results were evaluated by perceptual rating, acoustic analysis, and the assessment of laryngostroboscopic recordings. Although the group effects for the differences between posttherapy and pretherapy data were clearly significant, the effects of voice therapy for the individual patients were divergent. For each of the three evaluation methods, a significant improvement was found for about 40% to 50% of the patients. The diversity of the therapy outcome among the patients could not be explained by the pretherapy status nor by age, gender, or diagnosis groups. In general, the perceptual ratings and the acoustic parameters from the baseline data were clearly correlated. However, these characterizations of the voice were only moderately correlated with the visual evaluation of the vocal fold vibrations. Relations among the three evaluation tools for the changes caused by voice therapy were very weak. The low correlation among the three methods suggests that a multidimensional evaluation of the voice is necessary to give a complete picture of the therapy outcome. 相似文献