首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 15 毫秒
1.
To determine whether a correlation exists between the Grade, Roughness, Breathiness, Aesthenia, Strain (GRBAS) scale (a subjective measure of voice) and the Multi-Dimensional Voice Program (MDVP) scale (an objective measure of voice). A retrospective review of 37 voice patients (12 male/25 female) was conducted. Each voice was perceptually evaluated using the GRBAS scale by an experienced speech pathologist and acoustically analyzed using the MDVP scale. Statistical analysis using a multivariate regression model identified a significant correlation between the noise-related parameters of MDVP and the components of the GRBAS scale. Grade correlated with voice turbulence index (VTI), noise harmonic ratio (NHR), and soft phonation index (SPI). Roughness correlated with NHR only. Breathiness correlated with SPI only. Aesthenia also correlated with SPI only. Of the 19 acoustic variables measured by the MDVP system, only three noise parameters significantly correlated with the GRBAS perceptual voice analysis. Perhaps "noise" is the perceived acoustical quality of the dysphonic voice. A voice quantifying measure such as a "voice index score" could be proposed using the GRBAS scoring and the three clinically relevant MDVP values following further studies.  相似文献   

2.
Voice analysis was performed on 21 “standard” laryngectomized, male patients with a Provox® voice prosthesis, along with an age- and sex-matched control group of 20 normal speakers, using acoustical analyses (MDVP and CSL, Kay Elemetrics Corp.), maximum phonation time measurements, and perceptual evaluations. Comparison between MDVP and CSL revealed that the latter was not useful for the analysis of laryngectomized prosthetic voices. In contrast, MDVP seems suitable for this purpose, and contains a large number of parameters that significantly differentiate between patient and control speakers, as did the perceptual ratings and the maximum phonation time. Fundamental frequency appeared to be comparable for patients and control speakers. A significant influence of stoma occlusion and age was found for some voice parameters. Factor analyses showed correlations between the different MDVP parameters and correlations between the MDVP parameters and the perceptual ratings.  相似文献   

3.
For the purpose of improving speech transmission performance in a dome space, the acoustical properties in a dome having a diameter of 20 m were examined. The acoustical properties measured evenly on the floor of the dome were evaluated both objectively and subjectively and the interrelationship of the objective measures and subjective measures were also examined. Then, on the basis of the results of the study, simplified acoustical remedies were applied to the dome to improve speech intelligibility and the effect of the remedies was also examined. The following findings were obtained from this investigation.(1) The speech transmission performance in the dome space without treatment by absorptive materials varies greatly with the locations of sound sources and observation points: a range of 0.17-0.59 for RASTI value and a range of 30-97% for speech intelligibility test results. (2) There are peculiar observation points at which speech transmission quality is very high due to a considerable sum of the energy arriving in the first 0.06 s after the arrival of the direct sound. (3) Of all the measured acoustical parameters, RASTI, EDT in 1 kHz band, early-to-late arriving sound energy ratio, and Ts corresponded well to the speech intelligibility test scores. (4) Rubber tiles, cotton canvas 12 m in length, and glass wool board, are effective in improving speech intelligibility remarkably due to increased sound absorption and the diffusion effect.  相似文献   

4.
S.K. Tang 《Applied Acoustics》2008,69(12):1318-1331
A survey on the speech related acoustical parameters in the Hong Kong classrooms having standardized architectural layouts is carried out in the present study. Results suggest that these acoustical parameters are highly correlated with each other even across different octave bands. It is also found that the relationships between parameters of different kinds do not depend on the frequency bands. Besides, the present results indicate that the sound pulse decay inside a not very reverberant classroom consists of an initial fast decay, leading to deviations of the field survey results from those predicted by the exponential decay under the uniform sound energy decay assumption. It is believed that the strong correlations between the various speech related acoustical parameters and the regression information obtained in the present study can help the estimation of the speech quality of the classrooms in the design stage.  相似文献   

5.
The preferences of experienced listeners for pitch and formant frequency dispersion in unison choir sounds were explored using synthesized stimuli. Two types of dispersion were investigated: (a) pitch scatter, which arises when voices in an ensemble exhibit small differences in mean fundamental frequency, and (b) spectral smear, defined as such dispersion of formants 3 to 5 as arises from differences in vocal tract length. Each stimulus represented a choir section of five bass, tenor, alto, or soprano voices, producing the vowel [u], [a], or [w]. Subjects chose one dispersion level out of six available, selecting the “maximum tolerable” in a first run and the “preferred” in a second run. The listeners were very different in their tolerance for dispersion. Typical scatter choices were 14 cent standard deviation for “tolerable” and 0 or 5 cent for “preferred.” The smear choices were less consistent; the standard deviations were 12 and 7%, respectively. In all modes of assessment, the largest dispersion was chosen for the vowel [u] on a bass tone. There was a vowel effect on the smear choices. The effects of voice category were not significant.  相似文献   

6.
This study investigates the relationship between rough voice and the presence of Subharmonics, which correspond to smaller yet distinct peaks located between two consecutive harmonic peaks in the power spectrum. Spectrum analysis was undertaken in 389 pathologic voices, of which 20 had subharmonics. Although all 20 voices had roughness perceptually, 8 had normal jitter and/or shimmer. The degree of roughness had a significant inverse relationship with the frequency of subharmonics. By digital signal processing, sound samples with various types of subharmonics were synthesized and perceptually analyzed. Power and frequency of subharmonics in the synthesized sound also had significant relationships with the degree of roughness. Rough voice is acoustically characterized not only by jitter and shimmer but also by the presence of subharmonics in the power spectrum. Subharmonics are important acoustic properties for objective evaluation of rough voices.  相似文献   

7.
This paper presents a parameter for objectively evaluating singing voice quality. Power spectrum of vowel sound / a / was analyzed by Fast Fourier Transform. The greatest harmonics peak between 2 and 4 kHz and the greatest harmonics peak between 0 and 2 kHz were identified. Power ratio of these peaks, termed singing power ratio (SPR), was calculated in 37 singers and 20 nonsingers. SPR of sung / a / in singers was significantly greater than in nonsingers. In singers, SPR of sung / a / was significantly greater than that of spoken / a /. By digital signal processing, power spectrum of sung / a / was varied, and the processed sounds were perceptually analyzed. SPR had a significant relationship with perceptual scores of “ringing” quality. SPR provides an important quantitative measurement for evaluating singing voice quality for all voice types, including soprano.  相似文献   

8.
Measurements have been carried out on furnished orchestra platforms in four concert halls in Italy in order to describe the sound field perceived by musicians. The heterogeneous nature of the orchestra suggested a procedure able to take into account the mutual hearing between instrumental sections. The measured parameters were the early, late and total support, the reverberation time, the early decay time and the clarity index. A part of the study has been devoted to the measurement uncertainty estimation. The source directivity and the small displacements of the microphone influence the early decay time to a great extent while the on-platform spatial variability affects both the early decay time and the clarity index. Per-section early support shows differences that render the overall spatial mean inappropriate to describe the stage as a whole. For the other parameters an overall mean platform value can instead be suitable, even though, for the case of clarity a more evident group variability is observed. The values of late support, reverberation time, early decay time and clarity index, proposed in literature as suitable measures of reverberance for musicians, are not all intercorrelated, indicating that not all these parameters can be associated to the same subjective impression.  相似文献   

9.
Voice-overs are professional voice users who use their voices to market products in the electronic media. The purposes of this study were to (1) analyze voice-overed and non-overed productions of an advertising text in two groups consisting of 10 male professional voice-overs and 10 male non-voice-overs; and (2) determine specific acoustic features of voice-over productions in both groups. A na?ve group of listeners were engaged for the perceptual analysis of the recorded advertising text. The voice-overed production samples from both groups were submitted for analysis of acoustic and temporal features. The following parameters were analyzed: (1) the total text length, (2) the length of the three emphatic pauses, (3) values of the mean, (4) minimum, (5) maximum fundamental frequency, and (6) the semitone range. The majority of voice-overs and non-voice-overs were correctly identified by the listeners in both productions. However voice-overs were more consistently correctly identified than non-voice-overs. The total text length was greater for voice-overs. The pause time distribution was statistically more homogeneous for the voice-overs. The acoustic analysis indicated that the voice-overs had lower values of mean, minimum, and maximum fundamental frequency and a greater range of semitones. The voice-overs carry the voice-overed production features to their non-voice-overed production.  相似文献   

10.
Currently, early phonatory changes in amyotrophic lateral sclerosis(ALS) are not well understood. The aim of this study was to compare acoustic parameters of voice in ALS subjects who demonstrated perceptually normal vocal quality on sustained phonation with a control group. We hypothesized that objective analysis of voice would reveal significant differences on specific acoustic parameters of voice compared to the control group. Results revealed statistically significant differences between the two groups on measures related to frequency range and phonatory stability. The findings suggest that early bulbar signs affecting the laryngeal system may be present in patients with ALS before the occurrence of perceptually aberrant vocal characteristics.  相似文献   

11.
Videokymographic images of deviant or irregular vocal fold vibration, including diplophonia, the transition from falsetto to modal voice, irregular vibration onset and offset, and phonation following partial laryngectomy were compared with the synchronously recorded acoustic speech signals. A clear relation was shown between videokymographic image sequences and acoustic speech signals, and the effect of irregular or incomplete vocal fold vibration patterns was recognized in the amount of perceived breathiness and roughness and by the harmonics-to-noise ratio in the speech signal. Mechanisms causing roughness are the presence of mucus, phase differences between the left and right vocal fold, and short-term frequency and amplitude modulation. It can be concluded that the use of simultaneously recorded videokymographic image sequences and speech signals contributes to the understanding of the effect of irregular vocal fold vibration on voice quality.  相似文献   

12.

Objective

To investigate the common stereotype that homosexual males show pitch patterns that mirror those of heterosexual females.

Study Design

Static group comparison.

Method

Comparison of speaking fundamental frequency and pitch variation of 30 homosexual males, 56 heterosexual age-matched males, and 54 age-matched heterosexual females as demonstrated in a sample of read speech.

Results

In the homosexual males, average fundamental frequency and pitch variation were significantly higher than in the heterosexual males but also significantly lower than in the heterosexual females.

Conclusions

Results do not confirm the stereotype that gay male speech mirrors the patterns of women’s speech with respect to pitch characteristics. It would seem that the pitch patterns of gay male speakers constitute an example of sociophonetic variation.  相似文献   

13.
This study investigated the absorption characteristics of materials in a multi-purpose hall using computer models, 1:10 scale model and actual hall measurements of Gimhae Arts Hall (GAH), in order to predict and evaluate the acoustical characteristics. The elements of this scale model, such as reflecting walls, seats, audience, and absorption banners, were made with materials selected according to their absorption coefficients, measured in a 1:10 scale model reverberation chamber. After the real hall was completed, in situ acoustical measurements were conducted in the GAH and compared with those of the scale model hall. Comparison of these measurements showed that the delay time of the major reflections in the scale model hall was similar to that of the real hall. However, the reverberation time especially at low frequencies showed a difference between the scale model hall and the real hall measurements. The results of computer simulations for both scale model and actual hall showed that the absorption of seats and audience, the structural detail of the reflecting walls with different thickness and air spaces, and the duct facilities in the open-type ceiling are the major differences. It was confirmed that there are more complicated absorption characteristics in the scale model design of a multi-purpose hall than a concert hall.  相似文献   

14.
15.
This study evaluates the laryngoscopic findings and voice characteristics of male contact granuloma patients before and after voice therapy and at a follow-up about 9 years later. Pre- and posttherapy recordings as well as follow-up recordings were made for 19 granuloma patients. Pretherapy revealed the most salient perceptual voice characteristics were low pitch, monotony, and a high degree of vocal fry and hyperfunction. Interjudge reliability for these traits was high. Immediately following therapy the healed patients (n = 10) had a decrease in hyperfunction, vocal fry, and monotony, while the unhealed patients (n = 9) had an increase in hyperfunction and vocal fry decreased only marginally. Monotony decreased significantly in this group. As regards the acoustic analyses, no significant differences were found in mean fundamental frequency (F0) or perturbation. At the follow-up assessment 4 patients had granuloma while 15 had normal laryngeal status. Perceptually their voice characteristics resembled those pretherapy independently of the laryngeal findings. The results suggest that reduced hyperfunction and decreased vocal fry may create better circumstances for the healing process at the posterior glottis.  相似文献   

16.
Long-term average spectra (LTAS) have identified features in the sounds of singers and have compared different vocal qualities based on energy changes that occur during different vocal tasks. In this study, we compared the perceptual ratings of vocal quality of expert pedagogues with acoustic measures performed on LTAS. Fifteen expert judges rated 24 samples with six repeats of six advanced singing students under two conditions: "optimal" (O), which represented the application of the maximal open throat technique; and "suboptimal" (SO), which represented the application of the reduced open throat technique. LTAS were performed on each singing sample, and two conventional assessments of peak energy height [singing power ratio (SPR)] and peak area [energy ratio (ER)] were calculated on each LTAS. Perceptual scores, SPR, and ER were rank ordered. We then compared perceptual rankings with rankings of acoustic measures (SPR and ER) to assess whether these acoustic measurements matched the perceptual judgments of vocal quality. Although we found the expected significant relationship between SPR and ER, there was no relationship between perceptual ratings of vocal samples or singers based on SPR or ER. These findings suggest that because LTAS measures are not consistent with perceptual ratings of vocal quality, such measurements cannot define a voice of quality. Future research with LTAS to assess vocal quality should consider alternative measures that are more sensitive to subtle differences in vocal parameters.  相似文献   

17.
The present study explored significant differences between male-to-female transgendered speakers perceived as male and those perceived as female in terms of speaking fundamental frequency (SFF) and its variability, vowel formants for /a/ and /i/, and intonation measures. Fifteen individuals who identified themselves as male-to-female transsexuals served as speaker subjects, in addition to 6 biological female control subjects and 3 biological male control subjects. Each subject was recorded reading the Rainbow Passage and producing the isolated vowels /a/ and /i/. Twenty undergraduate psychology students served as listeners. Results indicated that subjects perceived as female had a higher mean SFF and higher upper limit of SFF than subjects perceived as male. A significant correlation between upper limit of SFF and ratings of femininity was achieved.  相似文献   

18.
19.
提出了感知清晰度评价模型,来评价人眼对红外与可见光彩色融合图像细节和边缘的可辨识度。首先,利用人眼对比度敏感函数模型,抑制在特定观察条件下图像中人眼不敏感的频率成分。之后,在局部频带对比度模型基础上,结合人眼亮度掩模特性构造了感知对比度模型。最后,计算融合图像人眼兴趣区域(细节和边缘区域)的感知对比度,进而评价融合图像的感知清晰度。实验结果表明,与现有的五种彩色图像清晰(模糊)度的客观评价模型相比,考虑人眼视觉特性感知清晰度模型的计算结果与人眼主观感受具有较好的一致性,可以有效地对彩色融合图像清晰度进行客观评价。  相似文献   

20.
The effect of voice therapy in a group of chronically dysphonic patients with diverse diagnoses was studied according to the normal clinical procedure. The results were evaluated by perceptual rating, acoustic analysis, and the assessment of laryngostroboscopic recordings. Although the group effects for the differences between posttherapy and pretherapy data were clearly significant, the effects of voice therapy for the individual patients were divergent. For each of the three evaluation methods, a significant improvement was found for about 40% to 50% of the patients. The diversity of the therapy outcome among the patients could not be explained by the pretherapy status nor by age, gender, or diagnosis groups. In general, the perceptual ratings and the acoustic parameters from the baseline data were clearly correlated. However, these characterizations of the voice were only moderately correlated with the visual evaluation of the vocal fold vibrations. Relations among the three evaluation tools for the changes caused by voice therapy were very weak. The low correlation among the three methods suggests that a multidimensional evaluation of the voice is necessary to give a complete picture of the therapy outcome.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号