首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 31 毫秒
1.
The purpose of this study was to determine the accuracy with which listeners could identify the gender of a speaker from a synthesized isolated vowel based on the natural production of that speaker when (1) the fundamental frequency was consistent with the speaker's gender, (2) the fundamental frequency was inconsistent with the the speaker's gender, and (3) the speaker was transgendered. Ten male-to-female transgendered persons, 10 men and 10 women, served as subjects. Each speaker produced the vowels /i/, /u/, and //. These vowels were analyzed for fundamental frequency and the first three formant frequencies and bandwidths. Formant frequency and bandwidth information was used to synthesize two vowel tokens for each speaker, one at a fundamental frequency of 120 Hz and one at 240 Hz. Listeners were asked to listen to these tokens and determine whether the original speaker was male or female. Listeners were not aware of the use of transgendered speakers. Results showed that, in all cases, gender identifications were based on fundamental frequency, even when fundamental frequency and formant frequency information was contradictory.  相似文献   

2.
The "hot potato voice" is widely recognized as a symptom of peritonsillar cellulitis or abscess; yet there have been no studies assessing the resonance characteristics of the vocal tract in peritonsillitis. Analysis was undertaken of formant frequencies in the articulation of the vowels /i:/. /a:/ and /u:/ in six subjects with peritonsillitis and compared with articulation once the peritonsillitis had settled. Significant variation was found in F1 when articulating /i:/ and in F2 when articulating /a:/, which are explainable by dyskinesis of the peritonsillar musculature. These findings were compared with six subjects articulating the same vowels with and without a hot potato in their mouth. Variation was found in both F1 and F2 when articulating /i:/, which can be related to interference of the potato with movement of the anterior tongue. The changes in the vocal tract differ in these two cases and the title "hot potato voice" in peritonsillitis is a misnomer.  相似文献   

3.
The purpose of this investigation was to gather information on the extent to which intraspeaker variability on measures of jitter (%) and fundamental frequency standard deviation (F0 s.d.) is age related in women. Fifteen repeat productions of the vowels /i/, /a/, and /u/ from 22 young women (18-22 years) were analyzed for F0 s.d. and jitter. Findings for these young speakers were compared with those for elderly speakers tested previously (Linville and Korabic, 1987). Results indicate that the aging process brings about increases in the variability individual women demonstrate on measures of F0 stability when producing sustained vowels as steadily as possible. Further, young speakers differed markedly from elderly speakers in the pattern of frequency instability variations observed across the three vowels tested.  相似文献   

4.
This study investigated changes in maximum phonation time andacoustic and perceptual measures of voice following topical anesthesia and laryngeal endoscopy with the flexible endoscope. Forty-four females, aged 18–33 years and with normal voices, performed four vocal tasks: (a) 3-second /i/ prolongation, (b) maximum phonation time on /i/, (c) stepwise scale-singing, and (d) reading a standard passage. Subjects performed these tasks prior to anesthesia, after anesthesia, and again during laryngeal endoscopy. Voice samples were analyzed for jitter, shimmer, harmonic-to-noise ratio, speaking fundamental frequency, maximum phonational frequency range, maximum phonation time, harshness, and breathiness. Results demonstrated significant reductions in maximum phonational frequency range following anesthesia and, during laryngeal endoscopy, reductions in maximum phonation time and increases in speaking fundamental frequency, minimum fundamental frequency on scale-singing, and breathiness. Clinicians using laryngeal endoscopy for evaluation and management of vocal dysfunction should, therefore, consider the possible effects of these procedures on vocal functioning.  相似文献   

5.
Twenty-four normal adult women read part of the Rainbow Passage and sustained vowels three trials each. Utterances were assessed for selected parameters measured by Visi-Pitch (average and SD of fundamental frequency (F0), average and SD of dBA, perturbation, and percent voiced/unvoiced/pause). Assessment of each parameter included measures of central tendency, dispersion, and distribution characteristics (skewness and kurtosis) of the data and of the ranges of values that would include 95% of the scores (95% fiduciary limits). Generally, differences for the group between the three trials were not significant. Intersubject variability for only a few parameters was less than 20% of the parameter's mean. For vowels, variability of jitter was 30–48% of the mean. Eight subjects provided performances 2 months later to obtain an estimate of intrasubject variability over time. There were desirable intrasubject correlations between performances for mean F0, jitter in reading and on vowels /i/ and /a/, and percent of voicing. Inter- and intrasubject variability seems restricted and the data appear to resemble a normally distributed function for mean F0 on reading, jitter on /i/, and percent of voicing. Thus, these parameters may have statistical merit for use in vocal testing.  相似文献   

6.
The purpose of this investigation was to gather information on how much variability on measures of jitter and fundamental frequency standard deviation (F0 s.d.) can be expected within individual elderly women when phonating sustained vowels "as steadily as possible." Fifteen repeat productions of the vowels /i/, /a/, and /u/ from 18 elderly women (69-90 years) were analyzed for F0 s.d. and jitter. Results indicate that intraspeaker variability on jitter and F0 s.d. measures in elderly women's sustained vowel productions can be quite considerable in some cases. This is a factor which needs to be considered in establishing normative data on elderly speakers' vocal capabilities.  相似文献   

7.
基于目前国内规模最大的激光驱动器——“神光Ⅱ”八路基频光已经实现功率平衡运行,通过改变其中若干路三倍频系统各调谐量的偏离,对输出三倍频波形进行束与束之间的横向对比研究.研究发现,对于Ⅱ类-Ⅱ类偏振失配三倍频系统,在影响转换效率的三个调谐量中,偏振分配角失配Δθp对三倍频波形影响最大;在入射基频功率密度约为1.0GW/cm2情况下,当三倍频系统三个调谐量都处在最佳匹配时,三倍频波形半峰全宽τ最小.研究工作为最终实现“神光Ⅱ”八路光束三倍频功率平衡输出提供了晶体调试 关键词: 三倍频 时间波形 功率平衡  相似文献   

8.

Objectives

The present study was performed to examine which factors among self-rated scales, perceptual evaluations, and acoustic parameters, calculated from sustained vowels, are reliable indicators of physical and mental fatigues.

Methods

A total of 73 volunteers (male:female, 52:21), aged 19–24 years, were enrolled in this study. We defined the high- and low-fatigue groups using the Chalder Fatigue Scale score. For assessment of self-rated symptoms, each subject was asked to complete Voice Handicap Index (VHI) and Voice Rating Scale (VRS). For perceptual evaluations, three clinicians assessed each subject’s vocal quality on the Grade, Roughness, Breathiness, Asthenia, Strain Scale. For acoustic analysis, each subject was asked to produce sustained vowels /a/, /e/, /i/, /o/, and /u/ for 3 seconds. Then, the habitual fundamental frequency (F0), jitter, shimmer, F0 tremor, mean F0, standard deviation of F0, maximum F0, minimum F0, normalized noise energy, harmonic-to-noise ratio (HNR), signal-to-noise ratio (SNR), amplitude tremor, and ratio within 2–4 kHz were calculated using Dr. Speech software.

Results

In men, VHI, VRS, F0 tremor, shimmer, HNR, SNR, and amplitude tremor were related to mental fatigue. In women, only VHI was related to physical fatigue, and none of the acoustic parameters was related to the fatigue score. Perceptual evaluations were not related to fatigue in men or women.

Conclusions

These findings suggest that self-rated symptoms and acoustic parameters related to voice quality are indicative of mental fatigue, and these features are prominent in men.  相似文献   

9.
Formant frequencies in an old Estonian folk song performed by two female voices were estimated for two back vowels /a/ and /u/, and for two front vowels /e/ and /i/. Comparison of these estimates with formant frequencies in spoken Estonian vowels indicates a trend of the vowels to be clustered into two sets of front and back ones in the F1/F2 plane. Similar clustering has previously been shown to occur in opera and choir singing, especially with increasing fundamental frequency. The clustering in the present song, however, may also be due to a tendency for a mid vowel to be realized as a higher-beginning diphthong, which is characteristic of the North-Estonian coastal dialect area where the singers come from. No evidence of a "singer's formant" was found.  相似文献   

10.
This study investigated the relationship among the magnitude of jaw opening, intrinsic fundamental frequency (F0), and glottal parameters in natural speech. Acoustic, jaw opening, and electroglottographic (EGG) signals were simultaneously recorded. The subjects were 10 healthy men with New Zealand English as their native language. Subjects were asked to repeat a standard nonemphasized sentence in which one of the target vowels (/a/, /e/, /i/, /o/, and /u/) was embedded in various contexts. The glottal parameters F0, open quotient (OQ), and speed quotient (SQ) were measured from the EGG signal. Results of a series of one-way repeated-measures analyses of variance (ANOVA) showed a significant vowel effect on the magnitude of jaw opening [F(4, 24) = 25.512, P < .001], F0 [F(4, 28) = 45.415, P < .001] and speed quotient [F(4, 28) = 5.233, P = .003], but not on the open quotient [F(4, 28) = 0.501, P = .735]. The magnitude of jaw opening was found to be inversely related with F0 (r = -0.624, n = 25, P = .0009). These findings showed that the magnitude of jaw opening was related to F0 and that jaw opening might be a control signal for simulation of long-term F0 variation to achieve a higher degree of naturalness in artificial voice.  相似文献   

11.
To quantify several acoustic features of the voice in patients with essentialtremor (ET), 28 patients and 28 age- and sex-matched controls were studied. ET severity was assessed with the rating scale for tremor of Fahn, Tolosa, and Marín. The Computerized Speech Lab 4300 program (Kay Elemetrics) was used. Two-second samples of a sustained /a/ and a sentence were captured with a microphone and laryngograph equipment. Measures included fundamental frequency (F0), frequency perturbation (fitter, Koike algorithm), intensity perturbation (shimmer, Horii algorithm), and harmonic-to-noise ratio (H/N, Yumoto algorithm) of the vowel /a/, and the frequency and intensity variability of the sentence, phonational range, and dynamic range at the natural frequency, maximum phonational time, and s/z ratio. All subjects underwent indirect laryngoscopy and/or laryngeal fibroscopy. When compared with controls, ET patients showed higher jitter, lower H/N ratio (the last one only with laryngographic signal), of the vowel /a/, lower frequency variability in the microphonc signal, lower intensity variability in the laryngographic signal of the sentence, and significantly lower dynamic range at natural frequency of phonation. ET patients reported higher frequency of the presence of high voice intensity, tremor, and struggle. Several acoustic parameters were influenced by the severity of the disease, including shimmer, jitter, H/N ratio, frequency variability of the sentence, and s/z ratio, although neither of the acoustic analysis values or the phonetometric measurements were affected by the presence of voice tremor or by a successful pharmacological treatment of ET.  相似文献   

12.
Ten male-to-female transsexuals participated in five sessions of oral resonance voice therapy targeting lip spreading and forward tongue carriage. Acoustic analysis of recordings made pre- and posttherapy found that participant formant frequency values (F1, F2, and F3, from the vowels /a/, /i/, and /mho/), as well as fundamental frequency (F0), underwent a general increase posttherapy. F3 values, in particular, increased significantly posttreatment. Trends in listener ratings of these recordings showed that the majority of participants were perceived to sound more feminine following treatment. Participants' self-ratings of their voices pre- and posttreatment also indicated that participants perceived their voices as sounding more feminine and that they were more satisfied with their voices following treatment. The present study supports the findings of previous studies that have demonstrated that resonance characteristics in male-to-female transsexuals can be changed to more closely approximate those of females through oral resonance therapy. This intervention study also demonstrates that a spontaneous increase in F0 is achieved during the course of therapy. Further, this study provides preliminary evidence to suggest that oral resonance therapy may be effective in increasing femininity of voice in male-to-female transsexual clients.  相似文献   

13.
Frequency modulation coherence was investigated as a possible cue for the perceptual segregation of concurrent sound sources. Synthesized chords of 2-s duration and comprising six permutations of three sung vowels (/a/, /i/, /o/) at three fundamental frequencies (130.8, 174.6, and 233.1 Hz) were constructed. In one condition, no vowels were modulated, and, in a second, all three were modulated coherently such that the ratio relations among all frequency components were maintained. In a third group of conditions, one vowel was modulated, while the other two remained steady. In a fourth group, one vowel was modulated independently of the two other vowels, which were modulated coherently with one another. Subjects were asked to judge the perceived prominence of each of the three vowels in each chord. Judged prominence increased significantly when the target vowel was modulated compared to when it was not, with the greatest increase being found for higher fundamental frequencies. The increase in prominence with modulation was unaffected by whether the target was modulated coherently or not with nontarget vowels. The modulation and pitch position of nontarget vowels had no effect on target vowel prominence. These results are discussed in terms of possible concurrent auditory grouping principles.  相似文献   

14.
Noise temperature of a SIS quantum mixer has been calculated as function of local oscillator voltage and signal source conductance on the basis of a measured I–V characteristic. Applying Tucker's quantum theory of mixing /1/, it is shown that the SIS mixer is quantum noise limited. Using cryogenic intermediate frequency amplifier, receiver noise temperature of 20 K seems to be possible at mm wavelength.  相似文献   

15.
This paper presents the fabrication and characterization of four-barrier planar heterostructure-barrier-varactors (HBVs) to be used in frequency triplers. The fabrication process and the DC and RF testing results are discussed. The measured results are evaluated by a newly developed combined genetic algorithm/harmonic balance simulator to calculate the optimum impedance and output power at 255 GHz. Different HBV structures were fabricated, and a comparison of their conversion efficiency is presented.  相似文献   

16.
The cyclic irradiation sidebands appearing in homonuclear adiabatic decoupling are calculated in detail, which reveals the origin of the antisymmetric sidebands. The sidebands can be inverted by inserting an initial decoupling with a different period, but the same f1rms as the main decoupling that is required for Bloch–Siegert shift compensation. The sidebands can be eliminated in a broad decoupling range by adding spectra of opposite sidebands. Based on this scheme, an offset-independent double-adiabatic decoupling, named Bloch–Siegert Shift Eliminated and Cyclic Sideband Trimmed Double-Adiabatic Decoupling, or “BEST” decoupling for short, is constructed, which not only compensates the Bloch–Siegert shift as shown earlier by Zhang and Gorenstein (1998) but also eliminates residual sidebands effectively.  相似文献   

17.
This study examined the effect of noise on the identification of four synthetic speech continua (/ra/-/la/, /wa/-/ja/, /i/-/u/, and say-stay) by adults with cochlea implants (CIs) and adults with normal-hearing (NH) sensitivity in quiet and noise. Significant group-by-SNR interactions were found for endpoint identification accuracy for all continua except /i/-/u/. The CI listeners showed the least NH-like identification functions for the /ra/-/la/ and /wa/-/ja/ continua. In a second experiment, NH adults identified four- and eight-band cochlear implant stimulations of the four continua, to examine whether group differences in frequency selectivity could account for the group differences in the first experiment. Number of bands and SNR interacted significantly for /ra/-/la/, /wa/-/ja/, and say-stay endpoint identification; strongest effects were found for the /ra/-/la/ and say-stay continua. Results suggest that the speech features that are most vulnerable to misperception in noise by listeners with CIs are those whose acoustic cues are rapidly changing spectral patterns, like the formant transitions in the /wa/-/ja/ and /ra/-/la/ continua. However, the group differences in the first experiment cannot be wholly attributable to frequency selectivity differences, as the number of bands in the second experiment affected performance differently than suggested by group differences in the first experiment.  相似文献   

18.
The present study explored significant differences between male-to-female transgendered speakers perceived as male and those perceived as female in terms of speaking fundamental frequency (SFF) and its variability, vowel formants for /a/ and /i/, and intonation measures. Fifteen individuals who identified themselves as male-to-female transsexuals served as speaker subjects, in addition to 6 biological female control subjects and 3 biological male control subjects. Each subject was recorded reading the Rainbow Passage and producing the isolated vowels /a/ and /i/. Twenty undergraduate psychology students served as listeners. Results indicated that subjects perceived as female had a higher mean SFF and higher upper limit of SFF than subjects perceived as male. A significant correlation between upper limit of SFF and ratings of femininity was achieved.  相似文献   

19.
Harmonic-intensity analysis of normal and hoarse voices   总被引:1,自引:0,他引:1  
Objective evaluation of normal and hoarse voices is performed considering the characteristic that hoarse voices show a prominent fundamental frequency intensity compared with harmonics in the voice spectrum. The relative harmonic intensity Hr, obtained from a stable portion of the sustained vowel/a/, is defined as the intensity of the second and higher harmonics expressed as a percentage of the total voice intensity. Ninety-five percent of the normal voices examined have Hr larger than the critical value of 67.2%, whereas 90% of the hoarse voices have Hr smaller than the critical value. The harmonic-intensity analysis thus provides good discrimination between normal and hoarse voices.  相似文献   

20.
The term “compensatory falsetto”, for the purpose of this investigation, refers to the development of an abnormally high-pitched voice in the presence of laryngeal pathology where more socially acceptable lower pitched voice production is possible. The purpose of this investigation was to compare laryngeal compensations and their effects on objective measures of vocal function during production of compensatory falsetto voice. Eighteen patients with abnormally high-pitched voice in the presence of underlying laryngeal pathology were evaluated in the Department of Otolaryngology at the University of Miami School of Medicine from January 1988 through December 1992 and were diagnosed with “compensatory falsetto”. Vocal fold paralysis (n = 11) was the most common laryngeal pathology. Vibratory characteristics were evaluated through videostrobolaryngoscopic examination. Acoustic and aerodynamic parameters assessed included fundamental frequency, jitter rate, harmonic-to-noise ratio, glottal air flow, and maximum phonation time. Production of a higher-pitched voice appeared to improve glottic closure and decrease the amount of air loss during phonation. A corresponding increase in maximum phonation time and improvement in acoustic characteristics of jitter and harmonic-to-noise ratio was also observed.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号