首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 10 毫秒
1.
This study represents a first step toward understanding the contribution formant frequency makes to the perception of female voice categories. The effects of formant frequency and pitch on the perception of voice category were examined by constructing a perceptual study that used two sets of synthetic stimuli at various pitches throughout the female singing range. The first set was designed to test the effects of systematically varying formants 1 through 4. The second set was designed to test the relative effects of lower frequency formants (F1 and F2) versus higher frequency formants (F3 and F4) through construction of mixed stimuli. Generally, as the frequencies of all four formants decreased, perception of soprano voice category decreased at all but the highest pitch, A5. However, perception of soprano voice category also increased as a function of pitch. Listeners appeared to need agreement between all four formants to perceive voice categories. When upper and lower formants are inconsistent in frequency, listeners were unable to judge voice category, but they could use the inconsistent patterns to form perceptions about degree of jaw opening.  相似文献   

2.
Timbre is typically investigated as a perceptual attribute that differentiates a sound source at one pitch and loudness. Yet the perceptual usefulness of timbre is that it allows the listener to recognize one sound source at different pitches. This paper investigated the ability of listeners to identify which pitch in an ascending or descending sequence of three or six stimuli was sung by a different singer. For three-note sequences, the task was extremely difficult, and with rare exceptions, listeners chose the most dissimilarly pitched stimulus as coming from the oddball singer. For six-note sequences, the detection of the oddball singer was much improved in spite of the added complexity of the task. These results support the idea that timbre should be understood as a transformation that connects the different sounds of one source and that a "rich" set of sounds is necessary to discover the trajectory.  相似文献   

3.
4.
Variation in duration and frequency during three readings of each of eight sentences by 9 normal and 4 voice-disordered subjects are compared. Instructions to the subjects varied with respect to the amount and type of cognitive cueing presented in the trials, and the sentences were read in random order. Variability in fundamental frequency (F0) was greater when pitch variation was specifically cued. Also, the portion of the sentence that was cued had greater variability in F0 than other parts of the sentence. Variation in fundamental frequency was significantly greater in the cued versus uncued sentence trials for the voice-disordered subjects but not for the normal subjects. However, all subjects exhibited significantly greater duration for cued versus uncued readings of the same sentences. Implications for theory and practice are discussed.  相似文献   

5.
Two vocal qualities, ringquality and pressed quality, were analyzed perceptually. Listeners were asked to rate (on a scale from 0 to 10) the “amount of ring” in one listening and the “amount of pressedness” in another listening. The stimulus was the synthesized utterance /ya-ya-ya-ya-ya/. In the continuum representation of ring, the skewing quotient and the cross section of the epilaryngeal tube area were systematically varied, independently and by a covariation rule. In the continuum representation of pressed, the flow amplitude and open quotient were similarly varied. Results indicated that the crossover point between ring and no ring occurred with an epilaryngeal area of around 1.0 cm2, and the crossover point between pressed and not pressed quality occurred at an open quotient of about 0.4. Fundamental frequency also had an effect on the perceptions, with a higher fundamental frequency receiving higher ratings of ring and pressed for otherwise the same parameters. Listeners demonstrated highly variable perceptions in both continua with poor intersubject, intrasubject, and intergroup reliability.  相似文献   

6.
Traditionally, timbre has been defined as that perceptual attribute that differentiates two sounds when pitch and loudness are equal, and thus is a measure of dissimilarity. By such a definition, each voice possesses a set of timbres, and the ability to identify any voice or voice category across different pitch-loudness-vowel combinations must be due to an ability to "link" these timbres by abstracting the "timbre transformation," the manner in which timbre subtly changes across pitch and loudness for a specific voice or voice category. Using stimuli produced across the singing range by singers from different voice categories, this study sought to examine how timbre and pitch interact in the perception of dissimilarity in male singing voices. This study also investigated whether or not listener experience affects the perception of timbre as a function of pitch. The resulting multidimensional scaling (MDS) representations showed that for all stimuli and listeners, dimension 1 correlated with pitch, while dimension 2 correlated with spectral centroid and separated vocal stimuli into the categories baritone and tenor. Dimension 3 appeared highly idiosyncratic depending on the nature of the stimuli and on the experience of the listener. Inexperienced listeners appeared to rely more heavily on pitch in making dissimilarity judgments than did experienced listeners. The resulting MDS representations of dissimilarity across pitch provide a glimpse of the timbre transformation of voice categories across pitch.  相似文献   

7.
8.
Traditionally, timbre has been defined as that perceptual attribute that differentiates two sounds when pitch and loudness are equal and thus is a measure of dissimilarity. By such a definition, each voice possesses a set of timbres, and the identity of any voice or voice category across different pitch-loudness-vowel combinations must be due to an abstraction of the pattern of timbre transformation. Using stimuli produced across the singing range by singers from different voice categories, this study sought to examine how timbre and pitch interact in the perception of dissimilarity. This study also investigated whether listener experience affects the perception of timbre as a function of pitch. The resulting multidimensional scaling (MDS) representations showed that for all stimuli and listeners, dimension 1 correlated with pitch, whereas dimension 2 correlated with spectral centroid and separated vocal stimuli into the categories mezzo-soprano and soprano. Dimension 3 appeared highly idiosyncratic depending on the nature of the stimuli and on the experience of the listener. Inexperienced listeners appeared to rely more heavily on pitch in making dissimilarity judgments than did experienced listeners. The resulting MDS representations of dissimilarity across pitch provide a glimpse of the timbre transformation of voice categories across pitch.  相似文献   

9.
In the past decade the number of voice laboratories has increased dramatically. Their primary mission is to enhance patient care by the application of knowledge gained from basic research. They also are dedicated to further improvement of diagnostic and therapeutic resources. The strength of the voice laboratory lies in collaboration between the clinician and the scientist.  相似文献   

10.
In dynamical motor theory, skill acquisition occurs as a modification of preexisting coordination patterns or attractor states. The purpose of this study was to assess how different levels of voice onset, voice quality, and fundamental frequency (F0) combine to form the attractor states common to voice motor control. Three levels of voice onset (glottal, simultaneous, and breathy), voice quality (modal speech, mixed, and falsetto), and fundamental frequency (low, mid, and high) were manipulated by vocally untrained, female subjects. Percent correct of acquisition trials and self-report of effort were used as measures of stable phonations indicative of an attractor state. Using intensity as a covariate, the results provided support for two of the three predicted triads representing attractor states in female speakers: (1) glottal onset/modal speech quality/low F0; and (2) breathy onset/falsetto quality/high F0. The results of this study suggest that certain parameters of voice motor control, such as onset, quality, and F0, exist as part of a dynamical system that can be identified and manipulated in voice motor acquisition and learning.  相似文献   

11.
Researchers long have searched for invariant acoustic features that can be used to identify singing voice categories or even individual singers. Few researchers have examined how listeners perceive singing voice categories or individual voices. Timbre, the most studied perceptual dimension of the singing voice, is generally believed to vary systematically between singing voice categories but is often assumed to be invariant with an individual singer. To test this assumption, 2 mezzo-sopranos and 2 sopranos were recorded singing the vowel /a/ on the pitches A3, C4, G4, B4, F5, and A5. Trials of three stimuli were constructed. Two of the three stimuli in each trial were produced by the same singer at two different pitches (X1 and X2), while the third stimulus was produced by a different singer (Y). Three X1X2 conditions were created: (1) G4, B4; (2) C4, F5; and (3) A3, A5. For each singer and each condition, Y was varied across the three remaining singers and across all six pitches. Experienced and inexperienced listeners were asked to identify which stimulus was produced by the “odd” person. The ability to correctly choose the odd person varied greatly depending on pitch factors, suggesting that the traditional concept of an invariant timbre associated with a singer is inaccurate and that vocal timbre must be conceptualized in terms of transformations in perceived quality that occur across an individual singer's range and/or registers.  相似文献   

12.
The voice conversion (VC) technique recently has emerged as a new branch of speech synthesis dealing with speaker identity. In this work, a linear prediction (LP) analysis is carried out on speech signals to obtain acoustical parameters related to speaker identity - the speech fundamental frequency, or pitch, voicing decision, signal energy, and vocal tract parameters. Once these parameters are established for two different speakers designated as source and target speakers, statistical mapping functions can then be applied to modify the established parameters. The mapping functions are derived from these parameters in such a way that the source parameters resemble those of the target. Finally, the modified parameters are used to produce the new speech signal. To illustrate the feasibility of the proposed approach, a simple to use voice conversion software has been developed. This VC technique has shown satisfactory results. The synthesized speech signal virtually matching that of the target speaker.  相似文献   

13.
Changes in mean fundamental frequency accompanying changes in loudness of phonation are analyzed in 9 professional singers, 9 nonsingers, and 10 male and 10 female patients suffering from vocal functional dysfunction. The subjects read discursive texts with noise in earphones, and some also at voluntarily varied vocal loudness. The healthy subjects phonated as softly and as loudly as possible at various fundamental frequencies throughout their pitch ranges, and the resulting mean phonetograms are compared. Mean pitch was found to increase by about half-semitones per decibel sound level. Grossly, the subject groups gave similar results, although the singers changed voice pitch more than the nonsingers. The voice pitch changes may be explained as passive results of changes of subglottal pressure required for the sound level variation.  相似文献   

14.
Two groups of 10 speech-language pathology graduate students were each given 7 weeks of singing lessons to determine whether voice lessons could have an effect on their clinical and perceptual skills. Pre-, mid-, and posttests to measure various skills were designed and implemented. With use of paired sample statistical testing, statistically significant results were obtained. In addition, the subjective responses of the students show that the lessons were effective in improving pitch perception, breath control, and legato production or easy onset. This study supports efforts to integrate curricula in vocal performance and speech-language pathology.  相似文献   

15.
16.
The current experiment examined pitch discrimination thresholds in listeners of classical Arab music, and listeners of Western popular music. Classical Arab music is characterized by modes (“Maqamat”, plural of “Maqam” in Arabic language) of which the smallest interval is a quarter tone. In contrast, the smallest interval in Western music is a semitone. We hypothesized that daily exposure to a musical style involving minuscule pitch differences may have a positive effect on pitch discrimination abilities. Results demonstrate superior pitch discrimination abilities in the classical Arab music listeners. These results indicate that musical cultures may differ in their influence on perceptual abilities, depending on their basic acoustic characteristics.  相似文献   

17.
HearFones (HF) have been designed to enhance auditory feedback during phonation. This study investigated the effects of HF (1) on sound perceivable by the subject, (2) on voice quality in reading and singing, and (3) on voice production in speech and singing at the same pitch and sound level.

Test 1: Text reading was recorded with two identical microphones in the ears of a subject. One ear was covered with HF, and the other was free. Four subjects attended this test. Tests 2 and 3: A reading sample was recorded from 13 subjects and a song from 12 subjects without and with HF on. Test 4: Six females repeated [pa:p:a] in speaking and singing modes without and with HF on same pitch and sound level.

Long-term average spectra were made (Tests 1–3), and formant frequencies, fundamental frequency, and sound level were measured (Tests 2 and 3). Subglottic pressure was estimated from oral pressure in [p], and simultaneously electroglottography (EGG) was registered during voicing on [a:] (Test 4). Voice quality in speech and singing was evaluated by three professional voice trainers (Tests 2–4).

HF seemed to enhance sound perceivable at the whole range studied (0–8 kHz), with the greatest enhancement (up to ca 25 dB) being at 1–3 kHz and at 4–7 kHz. The subjects tended to decrease loudness with HF (when sound level was not being monitored). In more than half of the cases, voice quality was evaluated “less strained” and “better controlled” with HF. When pitch and loudness were constant, no clear differences were heard but closed quotient of the EGG signal was higher and the signal more skewed, suggesting a better glottal closure and/or diminished activity of the thyroarytenoid muscle.  相似文献   


18.
Recent evolution in scientific knowledge and technology has led to monumental improvement in the standard of care for patients with voice disorders. New concepts in anatomy, physiology, measurement, and analysis have provided voice care professionals with not merely better understanding, but moreover an extensive vocabulary with which to think about voice function and dysfunction. Previously, we had to depend too much upon anecdote and “the art of medicine.” Thanks to scientific advances, we now have the tools we need for rational thought about the human voice. This is the fundamental change responsible for recent great advances in voice care.  相似文献   

19.
The self-organizing map (a neural network) was applied to the spectral pattern recognition of voice quality in 34 subjects: 15 patients operated on because of insufficient glottal closure and 19 subjects not treated for voice disorders. The voice samples, segments of sustained /a/, were perceptually rated by six experts. A self-organized acoustic feature map was first computed from tokens of /a/ and then used for the analysis of the samples. The locations of the samples on the map were determined and the distances from a normal reference were compared with the perceptual ratings. The map locations corresponded to the degree of audible disorder: the samples judged as normal were overlapping or close to the normal reference, whereas the samples judged as dysphonie were located further away from it. The comparison of pre- and postoperative samples of the patients showed that the perceived improvement of voice quality was also detected by the map.  相似文献   

20.
Accurate control of vocal pitch (fundamental frequency) requires coordination of sensory and motor systems. Previous research has supported the relationship between perceptual accuracy and vocal pitch matching accuracy. The purpose of this study was to investigate the role of memory for pitch in pitch matching and pitch discrimination ability. Three experimental tasks were used. First, a pitch matching task was completed, in which the participants listened to target tones and vocally matched the pitch of the tones. The second task was a pitch discrimination task that required the participants to judge the pitch (same or different) of complex tone pairs. The third task was pitch discrimination with memory interference task that was similar to the pitch discrimination task except interference tones were added. Results of the pitch matching and pitch discrimination tasks yielded a significant correlation between these values. When there was memory interference, pitch discrimination ability was poorer, and there was no significant correlation between pitch discrimination and pitch matching. These results support earlier findings of a relationship between pitch discrimination and pitch matching abilities. The results also suggest a possible role of pitch memory in both tasks. These findings may have implications for abilities related to accurate pitch control.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号