首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 31 毫秒
1.
Speech production parameters of three postlingually deafened adults who use cochlear implants were measured: after 24 h of auditory deprivation (which was achieved by turning the subject's speech processor off); after turning the speech processor back on; and after turning the speech processor off again. The measured parameters included vowel acoustics [F1, F2, F0, sound-pressure level (SPL), duration and H1-H2, the amplitude difference between the first two spectral harmonics, a correlate of breathiness] while reading word lists, and average airflow during the reading of passages. Changes in speech processor state (on-to-off or vice versa) were accompanied by numerous changes in speech production parameters. Many changes were in the direction of normalcy, and most were consistent with long-term speech production changes in the same subjects following activation of the processors of their cochlear implants [Perkell et al., J. Acoust. Soc. Am. 91, 2961-2978 (1992)]. Changes in mean airflow were always accompanied by H1-H2 (breathiness) changes in the same direction, probably due to underlying changes in laryngeal posture. Some parameters (different combinations of SPL, F0, H1-H2 and formants for different subjects) showed very rapid changes when turning the speech processor on or off. Parameter changes were faster and more pronounced, however, when the speech processor was turned on than when it was turned off. The picture that emerges from the present study is consistent with a dual role for auditory feedback in speech production: long-term calibration of articulatory parameters as well as feedback mechanisms with relatively short time constants.  相似文献   

2.
The timing of changes in parameters of speech production was investigated in six cochlear implant users by switching their implant microphones off and on a number of times in a single experimental session. The subjects repeated four short, two-word utterances, /dV1n#SV2d/ (S = /s/ or /S/), in quasi-random order. The changes between hearing and nonhearing states were introduced by a voice-activated switch at V1 onset. "Postural" measures were made of vowel sound pressure level (SPL), duration, F0; contrast measures were made of vowel separation (distance between pair members in the formant plane) and sibilant separation (difference in spectral means). Changes in parameter values were averaged over multiple utterances, lined up with respect to the switch. No matter whether prosthetic hearing was blocked or restored, contrast measures for vowels and sibilants did not change systematically. Some changes in duration, SPL and F0 were observed during the vowel within which hearing state was changed, V1, as well as during V2 and subsequent utterance repetitions. Thus, sound segment contrasts appear to be controlled differently from the postural parameters of speaking rate and average SPL and F0. These findings are interpreted in terms of the function of hypothesized feedback and feedforward mechanisms for speech motor control.  相似文献   

3.
4.
The effect of auditory feedback on speech production was investigated in five postlingually deafened adults implanted with the 22-channel Nucleus device. Changes in speech production were measured before implant and 1, 6, and 24 months postimplant. Acoustic measurements included: F1 and F2 of vowels in word-in-isolation and word-in-sentence context, voice-onset-time (VOT), spectral range of sibilants, fundamental frequency (F0) of word-in-isolation and word-in-sentence context, and word and sentence duration. Perceptual ratings of speech quality were done by ten listeners. The significant changes after cochlear implantation included: a decrease of F0, word and sentence duration, and F1 values, and an increase of voiced plosives' voicing lead (from positive to negative VOT values) and fricatives' spectral range. Significant changes occurred until 2 years postimplant when most measured values fell within Hebrew norms. Listeners were found to be sensitive to the acoustic changes in the speech from preimplant to 1, 6, and 24 months postimplant. Results suggest that when hearing is restored in postlingually deafened adults, calibration of speech is not immediate and occurs over time depending on the age-at-onset of deafness, years of deafness, and perception skills. The results also concur with hypothesis that the observed changes of some speech parameters are an indirect consequence of intentional changes in other articulatory parameters.  相似文献   

5.
Intrinsic fundamental frequency of vowels in sentence context   总被引:1,自引:0,他引:1  
High vowels have a higher intrinsic fundamental frequency (F0) than low vowels. This phenomenon has been verified in several languages. However, most studies of intrinsic F0 of vowels have used words either in isolation or bearing the main phrasal stress in a carrier sentence. As a first step towards an understanding of how the intrinsic F0 of vowels interacts with intonation in running speech, this study examined F0 of the vowels [i,a,u] in four sentence positions. The four speakers used for this study showed a statistically significant main effect of intrinsic F0 (high vowels had higher F0). Three of the four speakers also showed an interaction between intrinsic F0 and sentence position such that no significant F0 difference was observed in the unaccented, sentence-final position. The interaction was shown not to be due to vowel neutralization or correlated with changes in the glottal waveform shape, as evidenced by measures of the first formant frequency and spectral slope. Comparison with studies of tone languages and speech of the deaf suggests that both the lack of accent and the lower F0 caused the reduction in the intrinsic F0 difference.  相似文献   

6.
The aim of the study was to identify the acoustic correlates of female teachers' subjective voice complaints by recording their voices in their working environment. The subjects made recordings during lessons (N = 10) and breaks (N = 11). The subjects were divided into 2 groups: those with few voice complaints (FC group) and those with many voice complaints (MC group). The speech sample made in the breaks was maximally sustained /a/, from which fundamental frequency (F0), jitter, and shimmer were analyzed. The classroom samples were analyzed for F0, sound pressure level (SPL), and F0 time (the active vibration time of the vocal folds). Additionally, an index for assessing voice loading is presented. The results revealed a tendency of the MC group to have higher F0 and lower SPL and perturbation values than the FC group. The index values correlated moderately with the subjective vocal complaints.  相似文献   

7.
《Journal of voice》2020,34(3):485.e33-485.e43
PurposeThe present study aimed at measuring the smoothed and non-smoothed cepstral peak prominence (CPPS and CPP) in teachers who considered themselves to have normal voice but some of them had laryngeal pathology. The changes of CPP, CPPS, sound pressure level (SPL) and perceptual ratings with different voice tasks were investigated and the influence of vocal pathology on these measures was studied.MethodEighty-four Finnish female primary school teachers volunteered as participants. Laryngoscopically, 52.4% of these had laryngeal changes (39.3% mild, 13.1% disordered). Sound recordings were made for phonations of comfortable sustained vowel, comfortable speech, and speech produced at increased loudness level as used during teaching. CPP, CPPS and SPL values were extracted using Praat software for all three voice samples. Sound samples were also perceptually evaluated by five voice experts for overall voice quality (10 point scale from poor to excellent) and vocal firmness (10 point scale from breathy to pressed, with normal in the middle).ResultsThe CPP, CPPS and SPL values were significantly higher for vowels than for comfortable speech and for loud speech compared to comfortable speech (P < 0.001). Significant correlations were found between SPL and cepstral measures. The loud speech was perceived to be firmer and have a better voice quality than comfortable speech. No significant relationships of the laryngeal pathology status with cepstral values, perceptual ratings, or voice SPLs were found (P > 0.05).ConclusionNeither the acoustic measures (CPP, CPPS, and SPL) nor the perceptual evaluations could clearly distinguish teachers with laryngeal changes from laryngeally healthy teachers. Considering no vocal complaints of the subjects, the data could be considered representative of teachers with functionally healthy voice.  相似文献   

8.
The messa di voce, in its pure form a crescendo and decrescendo on one note, has been revered for centuries in classical singing, but the pedagogical assumptions of linearity and symmetry have received little critical assessment, especially across a wide fundamental frequency (F0) range. Five trained female classical singers performed a total of 318 messe di voce across their musical F0 range to identify its acoustical characteristics and the influence of F0. Sound pressure level (SPL) range was generally greater during crescendo at higher F0's and during decrescendo at lower FO's. Change in SPL during the messa di voce was predominantly nonlinear, and the shape of the SPL traces differed greatly between crescendo and decrescendo. Nonlinearity in SPL change was not related to SPL range but did show a F0 influence in decrescendo. Change in spectral balance (0-2 vs. 2-4 kHz) with respect to SPL change showed markedly more symmetry than linearity, so that changes in the mode of phonation during the messa di voce were dependent upon SPL regardless of whether the singer was in crescendo or decrescendo. Perceptual and physiological implications are discussed.  相似文献   

9.
This study investigated which acoustic cues within the speech signal are responsible for bimodal speech perception benefit. Seven cochlear implant (CI) users with usable residual hearing at low frequencies in the non-implanted ear participated. Sentence tests were performed in near-quiet (some noise on the CI side to reduce scores from ceiling) and in a modulated noise background, with the implant alone and with the addition, in the hearing ear, of one of four types of acoustic signals derived from the same sentences: (1) a complex tone modulated by the fundamental frequency (F0) and amplitude envelope contours; (2) a pure tone modulated by the F0 and amplitude contours; (3) a noise-vocoded signal; (4) unprocessed speech. The modulated tones provided F0 information without spectral shape information, whilst the vocoded signal presented spectral shape information without F0 information. For the group as a whole, only the unprocessed speech condition provided significant benefit over implant-alone scores, in both near-quiet and noise. This suggests that, on average, F0 or spectral cues in isolation provided limited benefit for these subjects in the tested listening conditions, and that the significant benefit observed in the full-signal condition was derived from implantees' use of a combination of these cues.  相似文献   

10.
Vocal warm-up was studied in terms of changes in voice parameters during a 45-minute vocal loading session in the morning. The voices of a randomly chosen group of 40 female and 40 male young students were loaded by having them read a novel aloud. The exposure groups (5 females and 5 males per cell) consisted of eight combinations of the following factors: (1) low (25 +/- 5%) or high (65 +/- 5%) relative humidity of ambient air; (2) low [< 65 dB(SPL)] or high [> 65 dB(SPL)] speech output level during vocal loading; (3) sitting or standing posture during vocal loading. Two sets of voice samples were recorded: a resting sample before the loading session and a loading sample after the loading session. The material recorded consisted of /pa:ppa/ words produced normally, as softly and as loudly as possible in this order by all subjects. The long /a/ vowel of the test word was inverse-filtered to obtain the glottal flow waveform. Time domain parameters of the glottal flow [open quotient (OQ), closing quotient (CQ), speed quotient (SQ), fundamental frequency (F0)], amplitude domain parameters of the glottal flow [glottal flow (fAC) and its logarithm, minimum of the first derivative of the glottal flow (dpeak) and its logarithm, amplitude quotient (AQ), and a new parameter, CQAQ], intraoral pressure (p), and sound pressure level (SPL) values of the phonations were analyzed. Voice range profiles (VRP) and the singer's formant (g/G, a/A, cl/c, e1/e, g1/g for females/males) of the loud phonation were also measured. Statistically significant differences between the preloading and postloading samples could be seen in many parameters, but the differences depended on gender and the type of phonation. In females the values of CQ, AQ, and CQAQ decreased and the values of SQ and p increased in normal phonations; the values of fAC, dpeak, and SPL increased in soft phonations; the values of AQ and CQAQ decreased in loud phonations; the harmonic energy in the singer's formant region increased significantly at every pitch. In males the values of OQ and AQ decreased and the values of dpeak, F0, p, and SPL increased in normal phonations; the values of fAC and p increased in soft phonations. The changes could be interpreted as signs of a shift toward hyperfunctional voice production. Low humidity was associated with more hyperfunctional changes than high humidity. High output was associated with more hyperfunctional changes than low output. Sitting position was associated with an increasing trend at both margins of male VRP, whereas the case was the opposite for standing position.  相似文献   

11.
12.
To examine spectral effects on declines in speech recognition in noise at high levels, word recognition for 18 young adults with normal hearing was assessed for low-pass-filtered speech and speech-shaped maskers or high-pass-filtered speech and speech-shaped maskers at three speech levels (70, 77, and 84 dB SPL) for each of three signal-to-noise ratios (+8, +3, and -2 dB). An additional low-level noise produced equivalent masked thresholds for all subjects. Pure-tone thresholds were measured in quiet and in all maskers. If word recognition was determined entirely by signal-to-noise ratio, and was independent of signal levels and the spectral content of speech and maskers, scores should remain constant with increasing level for both low- and high-frequency speech and maskers. Recognition of low-frequency speech in low-frequency maskers and high-frequency speech in high-frequency maskers decreased significantly with increasing speech level when signal-to-noise ratio was held constant. For low-frequency speech and speech-shaped maskers, the decline was attributed to nonlinear growth of masking which reduced the "effective" signal-to-noise ratio at high levels, similar to previous results for broadband speech and speech-shaped maskers. Masking growth and reduced "effective" signal-to-noise ratio accounted for some but not all the decline in recognition of high-frequency speech in high-frequency maskers.  相似文献   

13.
The acoustic effects of the adjustment in vocal effort that is required when the distance between speaker and addressee is varied over a large range (0.3-187.5 m) were investigated in phonated and, at shorter distances, also in whispered speech. Several characteristics were studied in the same sentence produced by men, women, and 7-year-old boys and girls: duration of vowels and consonants, pausing and occurrence of creaky voice, mean and range of F0, certain formant frequencies (F1 in [a] and F3), sound-pressure level (SPL) of voiced segments and [s], and spectral emphasis. In addition to levels and emphasis, vowel duration, F0, and F1 were substantially affected. "Vocal effort" was defined as the communication distance estimated by a group of listeners for each utterance. Most of the observed effects correlated better with this measure than with the actual distance, since some additional factors affected the speakers' choice. Differences between speaker groups emerged in segment durations, pausing behavior, and in the extent to which the SPL of [s] was affected. The whispered versions are compared with the phonated versions produced by the same speakers at the same distance. Several effects of whispering are found to be similar to those of increasing vocal effort.  相似文献   

14.
The effects of intensity on monosyllabic word recognition were studied in adults with normal hearing and mild-to-moderate sensorineural hearing loss. The stimuli were bandlimited NU#6 word lists presented in quiet and talker-spectrum-matched noise. Speech levels ranged from 64 to 99 dB SPL and S/N ratios from 28 to -4 dB. In quiet, the performance of normal-hearing subjects remained essentially constant in noise, at a fixed S/N ratio, it decreased as a linear function of speech level. Hearing-impaired subjects performed like normal-hearing subjects tested in noise when the data were corrected for the effects of audibility loss. From these and other results, it was concluded that: (1) speech intelligibility in noise decreases when speech levels exceed 69 dB SPL and the S/N ratio remains constant; (2) the effects of speech and noise level are synergistic; (3) the deterioration in intelligibility can be modeled as a relative increase in the effective masking level; (4) normal-hearing and hearing-impaired subjects are affected similarly by increased signal level when differences in speech audibility are considered; (5) the negative effects of increasing speech and noise levels on speech recognition are similar for all adult subjects, at least up to 80 years; and (6) the effective dynamic range of speech may be larger than the commonly assumed value of 30 dB.  相似文献   

15.
It was explored how three types of intensive cognitive load typical of military aviation (load on situation awareness, information processing, or decision-making) affect speech. The utterances of 13 male military pilots were recorded during simulated combat flights. Articulation rate was calculated from the speech samples, and the first formant (F1) and second formant (F2) were tracked from first-syllable short vowels in pre-defined phoneme environments. Articulation rate was found to correlate negatively (albeit with low coefficients) with loads on situation awareness and decision-making but not with changes in F1 or F2. Changes were seen in the spectrum of the vowels: mean F1 of front vowels usually increased and their mean F2 decreased as a function of cognitive load, and both F1 and F2 of back vowels increased. The strongest associations were seen between the three types of cognitive load and F1 and F2 changes in back vowels. Because fluent and clear radio speech communication is vital to safety in aviation and temporal and spectral changes may affect speech intelligibility, careful use of standard aviation phraseology and training in the production of clear speech during a high level of cognitive load are important measures that diminish the probability of possible misunderstandings.  相似文献   

16.
This study investigated the relation of symptoms of vocal fatigue to acoustic variables reflecting type of voice production and the effects of vocal loading. Seventy-nine female primary school teachers volunteered as subjects. Before and after a working day, (1) a 1-minute text reading sample was recorded at habitual loudness and loudly (as in large classroom), (2) a prolonged phonation on [a:] was recorded at habitual speaking pitch and loudness, and (3) a questionnaire about voice quality, ease, or difficulty of phonation and tiredness of throat was completed. The samples were analyzed for average fundamental frequency (F0), sound pressure level (SPL), and phonation type reflecting alpha ratio (SPL [1-5 kHz]-SPL [50 Hz-1 kHz]). The vowel samples were additionally analyzed for perturbation (jitter and shimmer). After a working day, F0, SPL, and alpha ratio were higher, jitter and shimmer values were lower, and more tiredness of throat was reported. The average levels of the acoustic parameters did not correlate with the symptoms. Increase in jitter and mean F0 in loud reading correlated with tiredness of throat. The results seem to suggest that, at least among experienced vocal professionals, voice production type had little relevance from the point of view of vocal fatigue reported. Differences in the acoustic parameters after a vocally loading working day mainly seem to reflect increased muscle activity as a consequence of vocal loading.  相似文献   

17.
Auditory feedback during speech production is known to play a role in speech sound acquisition and is also important for the maintenance of accurate articulation. In two studies the first formant (F1) of monosyllabic consonant-vowel-consonant words (CVCs) was shifted electronically and fed back to the participant very quickly so that participants perceived the modified speech as their own productions. When feedback was shifted up (experiment 1 and 2) or down (experiment 1) participants compensated by producing F1 in the opposite frequency direction from baseline. The threshold size of manipulation that initiated a compensation in F1 was usually greater than 60 Hz. When normal feedback was returned, F1 did not return immediately to baseline but showed an exponential deadaptation pattern. Experiment 1 showed that this effect was not influenced by the direction of the F1 shift, with both raising and lowering of F1 exhibiting the same effects. Experiment 2 showed that manipulating the number of trials that F1 was held at the maximum shift in frequency (0, 15, 45 trials) did not influence the recovery from adaptation. There was a correlation between the lag-one autocorrelation of trial-to-trial changes in F1 in the baseline recordings and the magnitude of compensation. Some participants therefore appeared to more actively stabilize their productions from trial-to-trial. The results provide insight into the perceptual control of speech and the representations that govern sensorimotor coordination.  相似文献   

18.
These studies investigated formant frequency discrimination by Japanese macaques (Macaca fuscata) using an AX discrimination procedure and techniques of operant conditioning. Nonhuman subjects were significantly more sensitive to increments in the center frequency of either the first (F1) or second (F2) formant of single-formant complexes than to corresponding pure-tone frequency shifts. Furthermore, difference limens (DLs) for multiformant signals were not significantly different than those for single-formant stimuli. These results suggest that Japanese monkeys process formant and pure-tone frequency increments differentially and that the same mechanisms mediate formant frequency discrimination in single-formant and vowel-like complexes. The importance of two of the cues available to mediate formant frequency discrimination, changes in the phase and the amplitude spectra of the signals, was investigated by independently manipulating these two parameters. Results of the studies indicated that phase cues were not a significant feature of formant frequency discrimination by Japanese macaques. Rather, subjects attended to relative level changes in harmonics within a narrow frequency range near F1 and F2 to detect formant frequency increments. These findings are compared to human formant discrimination data and suggest that both species rely on detecting alterations in spectral shape to discriminate formant frequency shifts. Implications of the results for animal models of speech perception are discussed.  相似文献   

19.
The purpose of this investigation was to study voice changes during a working day. The subjects consisted of 33 female primary and secondary schoolteachers who recorded their first and last lessons during one school day. The subjects were studied both as one group and two subgroups (those with many and those with few voice complaints). Estimates of fundamental frequency (F0), sound pressure level (SPL), the standard deviations of these values (F0 SD; SPL SD) and F0 time (vibration time of vocal folds) were made. The most obvious change due to loading was the rise of F0 that was 9.7 Hz between the first and last lesson (P = 0.00). F0 increased more (12.8. Hz, P = 0.006) in the subgroup with few complaints.  相似文献   

20.
Coarticulation studies in speech of deaf individuals have so far focused on intrasyllabic patterning of various consonant-vowel sequences. In this study, both inter- and intrasyllabic patterning were examined in disyllables /symbol see text #CVC/ and the effects of phonetic context, speaking rate, and segment type were explored. Systematic observation of F2 and durational measurements in disyllables minimally contrasting in vocalic ([i], [u,][a]) and in consonant ([b], [d]) context, respectively, was made at selected locations in the disyllable, in order to relate inferences about articulatory adjustments with their temporal coordinates. Results indicated that intervocalic coarticulation across hearing and deaf speakers varied as a function of the phonetic composition of disyllables (b_b or d_d). The deaf speakers showed reduced intervocalic coarticulation for bilabial but not for alveolar disyllables compared to the hearing speakers. Furthermore, they showed less marked consonant influences on the schwa and stressed vowel of disyllables compared to the hearing controls. Rate effects were minimal and did not alter the coarticulatory patterns observed across hearing status. The above findings modify the conclusions drawn from previous studies and suggest that the speech of deaf and hearing speakers is guided by different gestural organization.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号