首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 31 毫秒
1.
This study concerns the premier singing voice and its relationship to physiological aptitude. Research literature is reviewed that indicates that during singing the trained singer uses different physiological strategies in comparison with the untrained singer, and that the noted physiological differences (respiratory, laryngeal, articulatory) occur during singing only and not during speech. Further, a study was conducted that compared the ability of trained singers versus untrained individuals to (a) discriminate differences in self-generated air pressures and (b) produce and maintain a constant level of air pressure. No significant differences were found between the trained and untrained groups in their ability to discriminate and/or control breath pressure. Combined results of previous studies and present findings lead to the tentative conclusion that the excelled singer is not physiologically endowed and/or “gifted,” but rather has benefited from technical voice training  相似文献   

2.
OBJECTIVES/HYPOTHESIS: The purpose of this study was to examine the temporal-acoustic differences between trained singers and nonsingers during speech and singing tasks. METHODS: Thirty male participants were separated into two groups of 15 according to level of vocal training (ie, trained or untrained). The participants spoke and sang carrier phrases containing English voiced and voiceless bilabial stops, and voice onset time (VOT) was measured for the stop consonant productions. RESULTS: Mixed analyses of variance revealed a significant main effect between speech and singing for /p/ and /b/, with VOT durations longer during speech than singing for /p/, and the opposite true for /b/. Furthermore, a significant phonatory task by vocal training interaction was observed for /p/ productions. CONCLUSIONS: The results indicated that the type of phonatory task influences VOT and that these influences are most obvious in trained singers secondary to the articulatory and phonatory adjustments learned during vocal training.  相似文献   

3.
《Journal of voice》2019,33(6):945.e19-945.e25
Three electroglottographic parameters, fundamental frequency, contact quotient, and speed quotient were analyzed for two singers of Young girl role in Kunqu Opera. Each singer performed three conditions, singing, stage speech, and reading lyrics. The phonation types adopted in different conditions were explored based on electroglottographic parameters. Fundamental frequency, contact quotient, and speed quotient showed different distributions among conditions. Five phonation types were used in singing and stage speech, which include (1) breathy voice, (2) modal voice with low degree of posterior glottal adduction, (3) modal voice, (4) falsetto, and (5) falsetto with high degree of posterior glottal adduction. The phonation strategies partly showed differences between singers. Different phonation type collocations were employed in singing and stage speech. The relationship between phonation types and pitch was complex. The phonation types actually used were different from and more complex than those in traditional Kunqu Opera singing theory.  相似文献   

4.
Voice source characteristics as derived from inverse filtering were analyzed in 6 country singers' speech and singing. Results showed that the closed quotient varied systematically with vocal loudness, and that glottal compliance (the ratio between transglottal AC volume displacement and subglottal pressure) decreased with increases in fundamental frequency but remained unaffected by vocal loudness. No striking differences were found in source characteristics between speech and singing within subjects. The degree of phonatory press, as judged by a panel of 19 expert listeners, appeared related to the range in which the singer was singing and to the sound pressure level gain from a doubling of subglottal pressure.  相似文献   

5.
Many studies have described and analyzed the singer's formant. A similar phenomenon produced by trained speakers led some authors to examine the speaker's ring. If we consider these phenomena as resonance effects associated with vocal tract adjustments and training, can we hypothesize that trained singers can carry over their singing formant ability into speech, also obtaining a speaker's ring? Can we find similar differences for energy distribution in continuous speech? Forty classically trained singers and forty untrained normal speakers performed an all-voiced reading task and produced a sample of a sustained spoken vowel /a/. The singers were also requested to perform a sustained sung vowel /a/ at a comfortable pitch. The reading was analyzed by the long-term average spectrum (LTAS) method. The sustained vowels were analyzed through power spectrum analysis. The data suggest that singers show more energy concentration in the singer's formant/speaker's ring region in both sung and spoken vowels. The singers' spoken vowel energy in the speaker's ring area was found to be significantly larger than that of the untrained speakers. The LTAS showed similar findings suggesting that those differences also occur in continuous speech. This finding supports the value of further research on the effect of singing training on the resonance of the speaking voice.  相似文献   

6.
Five premier male country singers involved in our previous studies spoke and sang the words of both the national anthem and a country song of their choice. Long-term-average spectra were made of the spoken and sung material of each singer. The spectral characteristics of county singers' speech and singing were similar. A prominent peak in the upper part of the spectrum, previously described as the "speaker's formant," was found in the county singers' speech and singing. The singer's formant, a strong spectral peak near 2.8 kHz, an important part of the spectrum of classically trained singers, was not found in the spectra of the country singers. The results support the conclusion that the resonance characteristics in speech and singing are similar in country singing and that county singing is not characterized by a singer's formant.  相似文献   

7.
HearFones (HF) have been designed to enhance auditory feedback during phonation. This study investigated the effects of HF (1) on sound perceivable by the subject, (2) on voice quality in reading and singing, and (3) on voice production in speech and singing at the same pitch and sound level.

Test 1: Text reading was recorded with two identical microphones in the ears of a subject. One ear was covered with HF, and the other was free. Four subjects attended this test. Tests 2 and 3: A reading sample was recorded from 13 subjects and a song from 12 subjects without and with HF on. Test 4: Six females repeated [pa:p:a] in speaking and singing modes without and with HF on same pitch and sound level.

Long-term average spectra were made (Tests 1–3), and formant frequencies, fundamental frequency, and sound level were measured (Tests 2 and 3). Subglottic pressure was estimated from oral pressure in [p], and simultaneously electroglottography (EGG) was registered during voicing on [a:] (Test 4). Voice quality in speech and singing was evaluated by three professional voice trainers (Tests 2–4).

HF seemed to enhance sound perceivable at the whole range studied (0–8 kHz), with the greatest enhancement (up to ca 25 dB) being at 1–3 kHz and at 4–7 kHz. The subjects tended to decrease loudness with HF (when sound level was not being monitored). In more than half of the cases, voice quality was evaluated “less strained” and “better controlled” with HF. When pitch and loudness were constant, no clear differences were heard but closed quotient of the EGG signal was higher and the signal more skewed, suggesting a better glottal closure and/or diminished activity of the thyroarytenoid muscle.  相似文献   


8.
Vocal directivity refers to how directional the sound is that comes from a singer's mouth, that is, whether the sound is focused into a narrow stream of sound projecting in front of the singers or whether it is spread out all around the singer. This study investigates the long-term vocal directivity and acoustic power of professional opera singers and how these vary among subjects, among singing projections, and among vastly different acoustic environments. The vocal sound of eight professional opera singers (six females and two males) was measured in anechoic and reverberant rooms and in a recital hall. Subjects sang in four different ways: (1) paying great attention to intonation; (2) singing as in performance, with all the emotional connection intended by the composer; (3) imagining a large auditorium; and (4) imagining a small theatre. The same song was sung by all singers in all conditions. A head and torso simulator (HATS), radiating sound from its mouth, was used for comparison in all situations. Results show that individual singers have quite consistent long-term average directivity, even across conditions. Directivity varies substantially among singers. Singers are more directional than the standard HATS (which is a physical model of a talking person). The singer's formant region of the spectrum exhibits greater directivity than the lower-frequency range, and results indicate that singers control directivity (at least, incidentally) for different singing conditions as they adjust the spectral emphasis of their voices through their formants.  相似文献   

9.
Scientists have made great strides toward understanding the mechanisms of speech production and perception. However, the complex relationships between the acoustic structures of speech and the resulting psychological percepts have yet to be fully and adequately explained, especially in speech produced by younger children. Thus, this study examined the acoustic structure of voiceless fricatives (/f, theta, s, S/) produced by adults and typically developing children from 3 to 6 years of age in terms of multiple acoustic parameters (durations, normalized amplitude, spectral slope, and spectral moments). It was found that the acoustic parameters of spectral slope and variance (commonly excluded from previous studies of child speech) were important acoustic parameters in the differentiation and classification of the voiceless fricatives, with spectral variance being the only measure to separate all four places of articulation. It was further shown that the sibilant contrast between /s/ and /S/ was less distinguished in children than adults, characterized by a dramatic change in several spectral parameters at approximately five years of age. Discriminant analysis revealed evidence that classification models based on adult data were sensitive to these spectral differences in the five-year-old age group.  相似文献   

10.
This letter describes a data acquisition setup for recording, and processing, running speech from a person in a magnetic resonance imaging (MRI) scanner. The main focus is on ensuring synchronicity between image and audio acquisition, and in obtaining good signal to noise ratio to facilitate further speech analysis and modeling. A field-programmable gate array based hardware design for synchronizing the scanner image acquisition to other external data such as audio is described. The audio setup itself features two fiber optical microphones and a noise-canceling filter. Two noise cancellation methods are described including a novel approach using a pulse sequence specific model of the gradient noise of the MRI scanner. The setup is useful for scientific speech production studies. Sample results of speech and singing data acquired and processed using the proposed method are given.  相似文献   

11.
The present study addresses two questions: (a) Is the action and/orposture of the velopharyngeal valve conducive to allow significant resonance during Western tradition classical singing? (b) How do the actions of the velo-pharyngeal valve observed in this style of singing compare with normal speech? A photodetector system was used to observe the area function of the velopharyngeal port during speech and classical style singing. Identical speech samples were produced by each subject in a normal speaking voice and then in the low, medium, and high singing ranges. Results indicate that in these four singers the velopharyngeal port was closed significantly longer in singing than in speaking samples. The amount of time the velopharyngeal port was opened was greatest in speech and diminished as the singer ascended in pitch. In the high voice condition, little or no opening of the velopharyngeal port was measured.  相似文献   

12.
Vowel prolongation is often used to evaluate disordered voice production. In light of previous findings showing that co-articulation has significant influence on laryngeal function measures, the practice of using prolonged vowels to represent a speech sample is questioned. To test whether disordered and normal voice during vowel production is generalizable to connected speech, three speaking tasks were investigated: sustained vowel prolongation, syllable repetition and reading. Statistical differences were found between these tasks for certain amplitude and time based laryngeal function measures for adult women with disordered and normal voice. However, for the specific measures which were statistically different, the actual numerical and perceptual differences may be quite small. From a clinical assessment standpoint, the choice of the speech task may not make an apparent difference in the objective evaluation of disordered voice.  相似文献   

13.
This paper examines whether correlations between speech perception and speech production exist, and, if so, whether they might provide a way of evaluating different acoustic metrics. The cues listeners use for many phonemic distinctions are not known, often because many different acoustic cues are highly correlated with one another, making it difficult to distinguish among them. Perception-production correlations may provide a new means of doing so. In the present paper, correlations were examined between acoustic measures taken on listeners' perceptual prototypes for a given speech category and on their average production of members of that category. Significant correlations were found for VOT among stop consonants, and for spectral peaks (but not centroids or skewness) for voiceless fricatives. These results suggest that correlations between speech perception and production may provide a methodology for evaluating different proposed acoustic metrics.  相似文献   

14.
A growing body of contemporary research has investigated differences between trained and untrained singing voices. However, few studies have separated untrained singers into those who do and do not express abilities related to singing talent, including accurate pitch control and production of a pleasant timbre (voice quality). This investigation studied measures of the singing power ratio (SPR), which is a quantitative measure of the resonant quality of the singing voice. SPR reflects the amplification or suppression in the vocal tract of the harmonics produced by the sound source. This measure was acquired from the voices of untrained talented and nontalented singers as a means to objectively investigate voice quality differences. Measures of SPR were acquired from vocal samples with fast Fourier transform (FFT) power spectra to analyze the amplitude level of the partials in the acoustic spectrum. Long-term average spectra (LTAS) were also analyzed. Results indicated significant differences in SPR between groups, which suggest that vocal tract resonance, and its effect on perceived vocal timbre or quality, may be an important variable related to the perception of singing talent. LTAS confirmed group differences in the tuning of vocal tract harmonics.  相似文献   

15.
According to classical concepts, the relationship between the first two formants is the feature that determines the identification of long vowels in speech. However, the characteristics of vowels may considerably vary depending on the conditions of their production. Thus, the aforementioned features that are valid for adult speech cannot be extended to speech signals with high fundamental frequencies, such as infant speech or singing. On the basis of the studies of preverbal infant vocalizations, singing, and speech imitation by talkingbirds, it is shown that the stable features of vowel-like sounds are the positions and amplitude ratios of the most pronounced spectral maxima (including those corresponding to the fundamental frequency). The results of the studies suggest that precisely these features determine the categorical identification of vowels. The role of the relationship between the frequency and amplitude characteristics in the vowel identification irrespective of the way the vowel is produced and the age and state of the speaker, as well as in the case of speech imitation by talkingbirds, is discussed.  相似文献   

16.
The relationship between auditory perception and vocal production has been typically investigated by evaluating the effect of either altered or degraded auditory feedback on speech production in either normal hearing or hearing-impaired individuals. Our goal in the present study was to examine this relationship in individuals with superior auditory abilities. Thirteen professional musicians and thirteen nonmusicians, with no vocal or singing training, participated in this study. For vocal production accuracy, subjects were presented with three tones. They were asked to reproduce the pitch using the vowel /a/. This procedure was repeated three times. The fundamental frequency of each production was measured using an autocorrelation pitch detection algorithm designed for this study. The musicians' superior auditory abilities (compared to the nonmusicians) were established in a frequency discrimination task reported elsewhere. Results indicate that (a) musicians had better vocal production accuracy than nonmusicians (production errors of 1/2 a semitone compared to 1.3 semitones, respectively); (b) frequency discrimination thresholds explain 43% of the variance of the production data, and (c) all subjects with superior frequency discrimination thresholds showed accurate vocal production; the reverse relationship, however, does not hold true. In this study we provide empirical evidence to the importance of auditory feedback on vocal production in listeners with superior auditory skills.  相似文献   

17.
In this paper, the concept of directivity is generalised to acoustic sources radiating transients or signals evolutive with time. (These are the most common cases in the power or reinforcement electroacoustic systems.) The generalisation proposed is based upon the anamorphism relating the signal levels emitted into the free space in different directions. The relationship between the signals observed in two arbitrary directions is essentially independent of time. Therefore, the anamorphical relationship offers the possibility of obtaining the directivity patterns simply by using as a test signal that signal commonly emitted by the system (i.e. speech in a reinforcement system for a conference room). This principle and method can be applied without major restrictions to any other system, or piece or part of machinery emitting acoustic energy in discontinuous form.Concerning electroacoustic sources, it appears advantageous to replace the usual test signal consisting of pure tones by the signal proper to the system (music, speech, etc.) filtered into the standardised frequency bands. The complete signal (not filtered) can also give significant results. As a simplifying and reasonable compromise regarding the directivity for speech and music, bands of white noise are proposed as test signals.  相似文献   

18.
Peta White   《Journal of voice》1999,13(4):570-582
High-pitched productions present difficulties in formant frequency analysis due to wide harmonic spacing and poorly defined formants. As a consequence, there is little reliable data regarding children's spoken or sung vowel formants. Twenty-nine 11-year-old Swedish children were asked to produce 4 sustained spoken and sung vowels. In order to circumvent the problem of wide harmonic spacing, F1 and F2 measurements were taken from vowels produced with a sweeping F0. Experienced choir singers were selected as subjects in order to minimize the larynx height adjustments associated with pitch variation in less skilled subjects. Results showed significantly higher formant frequencies for speech than for singing. Formants were consistently higher in girls than in boys suggesting longer vocal tracts in these preadolescent boys. Furthermore, formant scaling demonstrated vowel dependent differences between boys and girls suggesting non-uniform differences in male and female vocal tract dimensions. These vowel-dependent sex differences were not consistent with adult data.  相似文献   

19.
《Journal of voice》2020,34(3):487.e11-487.e20
IntroductionKinesio Taping (KT) application in speech therapy has been studied in a few works about dysphonia, facial nerve palsy, sialorrhea, atypical deglutition, postsurgical recovery after thyroidectomy and laryngectomy. The aim of this study was the evaluation of the possible role of KT in supporting speech therapy in singers complaining of dysphonia using singing voice handicap index (SVHI), fundamental frequency (F0), shimmer, jitter and harmonic to noise ratio (mean H/N).Materials and methodsWe enrolled consecutive singers and singing students complaining of dysphonia and voice problems. Control group (DG1) was composed of 15 individuals who underwent traditional speech therapy only, while Case group (DG2), also composed of 15 subjects, underwent traditional speech therapy associated with KT application. A computerized voice analysis was conducted using PRAAT software observing F0, jitter, shimmer and mean H/N before (t1), at mid (t2) and after (t3) the treatment. Moreover, each patient filled in the SVHI before (t1) and after (t3) the complete speech therapy treatment.ResultsThe mean F0 and H/N measured before, during and after the logopedic treatment, showed a notable increase over time (P value <0.001) both for DG1 and DG2. However, no significant difference was found comparing the two groups. Jitter and Shimmer after treatment were clearly seen to be lower than before in both groups (P value <0.001), and followed a significantly different trend over time (P value <0.001). Moreover, unlike F0 and mean H/N, these parameters underwent a significantly greater decrease in DG2 compared to DG1. Lastly, SVHI improved at t3 and although these reductions were clear in both groups, it was greater in DG2 than in DG1.Discussion and ConclusionsOur findings are encouraging and suggest the possibility of using KT in case of vocal pathologies in singers. It is imperative to underline that the tape does not replace speech therapy, but could possibly enhance the effects of the treatment.  相似文献   

20.
This study examined whether cochlear implant users must perceive differences along phonetic continua in the same way as do normal hearing listeners (i.e., sharp identification functions, poor within-category sensitivity, high between-category sensitivity) in order to recognize speech accurately. Adult postlingually deafened cochlear implant users, who were heterogeneous in terms of their implants and processing strategies, were tested on two phonetic perception tasks using a synthetic /da/-/ta/ continuum (phoneme identification and discrimination) and two speech recognition tasks using natural recordings from ten talkers (open-set word recognition and forced-choice /d/-/t/ recognition). Cochlear implant users tended to have identification boundaries and sensitivity peaks at voice onset times (VOT) that were longer than found for normal-hearing individuals. Sensitivity peak locations were significantly correlated with individual differences in cochlear implant performance; individuals who had a /d/-/t/ sensitivity peak near normal-hearing peak locations were most accurate at recognizing natural recordings of words and syllables. However, speech recognition was not strongly related to identification boundary locations or to overall levels of discrimination performance. The results suggest that perceptual sensitivity affects speech recognition accuracy, but that many cochlear implant users are able to accurately recognize speech without having typical normal-hearing patterns of phonetic perception.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号