期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

1.

Acoustic-phonetic characteristics of speech produced in noise and while wearing an oxygen mask

Z S Bond T J Moore B Gable 《The Journal of the Acoustical Society of America》1989,85(2):907-912

The present study investigated changes in the prosodic and acoustic-phonetic features of isolated words by four male talkers speaking in quite and in pink noise at a level of 95 dB SPL. Speech samples were collected both with and without an oxygen mask. Changes in duration, fundamental frequency, total energy, and formant center frequency were analyzed. In addition to the expected changes of increased pitch and amplitude associated with speaking in noise without an oxygen mask, significant effects were found (particularly in the formant center frequencies) as a result of using the oxygen mask. When the oxygen mask was employed, no further significant changes were caused by adding noise to the speaking situation. 相似文献

2.

Timing interference to speech in altered listening conditions

Howell P Sackin S 《The Journal of the Acoustical Society of America》2002,111(6):2842-2852

A theory is outlined that explains the disruption that occurs when auditory feedback is altered. The key part of the theory is that the number of, and relationship between, inputs to a timekeeper, operative during speech control, affects speech performance. The effects of alteration to auditory feedback depend on the extra input provided to the timekeeper. Different disruption is predicted for auditory feedback that is out of synchrony with other speech activity (e.g., delayed auditory feedback, DAF) compared with synchronous forms of altered feedback (e.g., frequency shifted feedback, FSF). Stimulus manipulations that can be made synchronously with speech are predicted to cause equivalent disruption to the synchronous form of altered feedback. Three experiments are reported. In all of them, subjects repeated a syllable at a fixed rate (Wing and Kristofferson, 1973). Overall timing variance was decomposed into the variance of a timekeeper (Cv) and the variance of a motor process (Mv). Experiment 1 validated Wing and Kristofferson's method for estimating Cv in a speech task by showing that only this variance component increased when subjects repeated syllables at different rates. Experiment 2 showed DAF increased Cv compared with when no altered sound occurred (experiment 1) and compared with FSF. In experiment 3, sections of the subject's output sequence were increased in amplitude. Subjects just heard this sound in one condition and made a duration decision about it in a second condition. When no response was made, results were like those with FSF. When a response was made, Cv increased at longer repetition periods. The findings that the principal effect of DAF, a duration decision and repetition period is on Cv whereas synchronous alterations that do not require a decision (amplitude increased sections where no response was made and FSF) do not affect Cv, support the hypothesis that the timekeeping process is affected by synchronized and asynchronized inputs in different ways. 相似文献

3.

Receiver-operating characteristics determined under several interaural conditions of listening

D S Emmerich 《The Journal of the Acoustical Society of America》1968,43(2):298-307

相似文献

4.

Acoustical and perceptual characteristics of speech produced with an electronic artificial larynx.

M S Weiss G H Yeni-Komshian J M Heinz 《The Journal of the Acoustical Society of America》1979,65(5):1298-1308

相似文献

5.

Optimum speech level to minimize listening difficulty in public spaces

Kobayashi M Morimoto M Sato H Sato H 《The Journal of the Acoustical Society of America》2007,121(1):251-256

For ideal speech communication in public spaces, it is important to determine the optimum speech level for various background noise levels. However, speech intelligibility scores, which is conventionally used as the subjective listening test to measure the quality of speech communication, is near perfect in most everyday situations. For this reason, it is proposed to determine optimum speech levels for speech communication in public spaces by using listening difficulty ratings. Two kinds of listening test were carried out in this work. The results of the tests and our previous work [M. Morimoto, H. Sato, and M. Kobayashi, J. Acoust. Soc. Am. 116, 1607-1613 (2004)] are jointly discussed for suggesting the relation between the optimum speech level and background noise level. The results demonstrate that: (1) optimum speech level is constant when background noise level is lower than 40 dBA, (2) optimum speech level appears to be the level, which maintains around 15 dBA of SN ratio when the background noise level is more than 40 dBA, and (3) listening difficulty increases as speech level increases under the condition where SN ratio is good enough to keep intelligibility near perfect. 相似文献

6.

Auditory evoked potentials in females with high and low acceptance of background noise when listening to speech

Tampas JW Harkrider AW 《The Journal of the Acoustical Society of America》2006,119(3):1548-1561

Acceptable noise level (ANL) is a measure of a listener's acceptance of background noise when listening to speech. A consistent finding in research on ANL is large intersubject variability in the acceptance of background noise. This variability is not related to age, gender, hearing sensitivity, type of background noise, speech perception in noise performance, cochlear responses, or efferent activity of the medial olivocochlear pathway. In the present study, auditory evoked potentials were examined in 21 young females with normal hearing with low and high acceptance of background noise to determine whether differences in judgments of background noise are related to differences measured in aggregate physiological responses from the auditory nervous system. Group differences in the auditory brainstem response, auditory middle latency response, and cortical, auditory late latency response indicate that differences in more central regions of the nervous system account for, at least in part, the variability in listeners' willingness to accept background noise when listening to speech. 相似文献

7.

The spatial unmasking of speech: evidence for better-ear listening

Edmonds BA Culling JF 《The Journal of the Acoustical Society of America》2006,120(3):1539-1545

Speech reception thresholds (SRTs) were measured for target speech presented concurrently with interfering speech (spoken by a different speaker). In experiment 1, the target and interferer were divided spectrally into high- and low-frequency bands and presented over headphones in three conditions: monaural, dichotic (target and interferer to different ears), and swapped (the low-frequency target band and the high-frequency interferer band were presented to one ear, while the high-frequency target band and the low-frequency interferer band were presented to the other ear). SRTs were highest in the monaural condition and lowest in the dichotic condition; SRTs in the swapped condition were intermediate. In experiment 2, two new conditions were devised such that one target band was presented in isolation to one ear while the other band was presented at the other ear with the interferer. The pattern of SRTs observed in experiment 2 suggests that performance in the swapped condition reflects the intelligibility of the target frequency bands at just one ear; the auditory system appears unable to exploit advantageous target-to-interferer ratios at different ears when segregating target speech from a competing speech interferer. 相似文献

8.

Priming and sentence context support listening to noise-vocoded speech by younger and older adults

Sheldon S Pichora-Fuller MK Schneider BA 《The Journal of the Acoustical Society of America》2008,123(1):489-499

Older adults are known to benefit from supportive context in order to compensate for age-related reductions in perceptual and cognitive processing, including when comprehending spoken language in adverse listening conditions. In the present study, we examine how younger and older adults benefit from two types of contextual support, predictability from sentence context and priming, when identifying target words in noise-vocoded sentences. In the first part of the experiment, benefit from context based on primarily semantic knowledge was evaluated by comparing the accuracy of identification of sentence-final target words that were either highly predictable or not predictable from the sentence context. In the second part of the experiment, benefit from priming was evaluated by comparing the accuracy of identification of target words when noise-vocoded sentences were either primed or not by the presentation of the sentence context without noise vocoding and with the target word replaced with white noise. Younger and older adults benefited from each type of supportive context, with the most benefit realized when both types were combined. Supportive context reduced the number of noise-vocoded bands needed for 50% word identification more for older adults than their younger counterparts. 相似文献

9.

Influence of the processing conditions on the characteristics of the clad layers produced with laminar plasma technology

Wei Ma 《Applied Surface Science》2006,252(23):8352-8359

Laminar plasma technology was used to produce ceramic hardened layers of Al₂O₃-40% mass Ni composite powders on stainless steel substrates. In order to investigate the influences of processing conditions on the morphologies of the surface modified layers, two different powder-feeding methods were tested, one with carrier gas called the powder injection method, and the other without carrier gas called powder transfers method. The microscopic investigations demonstrate that the cross-section of the clad layers consists of two distinct microstructural regions, in which the Al₂O₃ phases exhibit different growth mechanisms. When the powder transfers method is adopted, the number density and volume fraction of the Al₂O₃ particles increase considerably and their distributions exhibit zonal periodical characteristics. When the powder-feeding rate increases, the microstructure of the Al₂O₃ phases changes from a small globular to a long needle shape. Finite element simulations show that the transient thermo-physical features of the pool substances, such as solidification rate and cooling rate, influence strongly the mechanisms of the nucleation and the directional growth of the Al₂O₃ phases in the thermal processing. 相似文献

10.

Segmental intelligibility of synthetic speech produced by rule

J S Logan B G Greene D B Pisoni 《The Journal of the Acoustical Society of America》1989,86(2):566-581

This paper reports the results of an investigation that employed the modified rhyme test (MRT) to measure the segmental intelligibility of synthetic speech generated automatically by rule. Synthetic speech produced by ten text-to-speech systems was studied and compared to natural speech. A variation of the standard MRT was also used to study the effects of response set size on perceptual confusions. Results indicated that the segmental intelligibility scores formed a continuum. Several systems displayed very high levels of performance that were close to or equal to scores obtained with natural speech; other systems displayed substantially worse performance compared to natural speech. The overall performance of the best system, DECtalk--Paul, was equivalent to the data obtained with natural speech for consonants in syllable-initial position. The findings from this study are discussed in terms of the use of a set of standardized procedures for measuring intelligibility of synthetic speech under controlled laboratory conditions. Recent work investigating the perception of synthetic speech under more severe conditions in which greater demands are made on the listener's processing resources is also considered. The wide range of intelligibility scores obtained in the present study demonstrates important differences in perception and suggests that not all synthetic speech is perceptually equivalent to the listener. 相似文献

11.

Acoustic characteristics of reticent speech

Deborah M. Rekart Cynthia F. Begnal 《Journal of voice》1989,3(4):324-336

Reticent speakers differ from nonreticent speakers in vocal characteristics, such as fundamental frequency, frequency range, fluency, and intensity, which prompt negative impressions on the part of listeners. Waveform and spectrographic analyses were performed on the vocal cues of 19 reticent and nonreticent subjects (57 speech samples). Statistically significant differences were found in fluency between reticent and nonreticent speech. Reticent male speakers also showed significantly higher F₀, whereas reticent female speakers demonstrated narrower frequency range. Identification and analysis of these characteristics are required for effective remediation. 相似文献

12.

Subjective study of preferred listening conditions in Italian Catholic churches

Francesco Martellotta 《Journal of sound and vibration》2008,317(1-2):378-399

The paper describes the results of research aimed at investigating the preferred subjective listening conditions inside churches. The effect of different musical motifs (spanning Gregorian chants to symphonic music) was investigated and regression analysis was performed in order to point out the relationship between subjective ratings and acoustical parameters. In order to present realistic listening conditions to the subjects a small subset of nine churches was selected among a larger set of acoustic data collected in several Italian churches during a widespread on-site survey. The subset represented different architectural styles and shapes, and was characterized by average listening conditions. For each church a single source–receiver combination with fixed relative positions was chosen. Measured binaural impulse responses were cross-talk cancelled and then convolved with five anechoic motifs. Paired comparisons were finally performed, asking a trained panel of subjects their preference. Factor analysis pointed out a substantially common underlying pattern characterizing subjective responses. The results show that preferred listening conditions vary as a function of the musical motif, depending on early decay time for choral music and on a combination of initial time delay and lateral energy for instrumental music. 相似文献

13.

Acoustic characteristics of Mandarin esophageal speech

Liu H Wan M Wang S Wang X Lu C 《The Journal of the Acoustical Society of America》2005,118(2):1016-1025

The present study attempted to investigate the acoustic characteristics of Mandarin laryngeal and esophageal speech. Eight normal laryngeal and seven esophageal speakers participated in the acoustic experiments. Results from acoustic analyses of syllables /ma/and /ba/ indicated that, F0, intensity, and signal-to-noise ratio of laryngeal speech were significantly higher than those of esophageal speech. However, opposite results were found for vowel duration, jitter, and shimmer. Mean F0, intensity, and word per minute in reading were greater but number of pauses was smaller in laryngeal speech than those in esophageal speech. Similar patterns of F0 contours and vowel duration as a function of tone were found between laryngeal and esophageal speakers. Long-time spectra analysis indicated that higher first and second formant frequencies were associated with esophageal speech than that with normal laryngeal speech. 相似文献

14.

Comparing the effects of reverberation and of noise on speech recognition in simulated electric-acoustic listening

Helms Tillery K Brown CA Bacon SP 《The Journal of the Acoustical Society of America》2012,131(1):416-423

Cochlear implant users report difficulty understanding speech in both noisy and reverberant environments. Electric-acoustic stimulation (EAS) is known to improve speech intelligibility in noise. However, little is known about the potential benefits of EAS in reverberation, or about how such benefits relate to those observed in noise. The present study used EAS simulations to examine these questions. Sentences were convolved with impulse responses from a model of a room whose estimated reverberation times were varied from 0 to 1 sec. These reverberated stimuli were then vocoded to simulate electric stimulation, or presented as a combination of vocoder plus low-pass filtered speech to simulate EAS. Monaural sentence recognition scores were measured in two conditions: reverberated speech and speech in a reverberated noise. The long-term spectrum and amplitude modulations of the noise were equated to the reverberant energy, allowing a comparison of the effects of the interferer (speech vs noise). Results indicate that, at least in simulation, (1) EAS provides significant benefit in reverberation; (2) the benefits of EAS in reverberation may be underestimated by those in a comparable noise; and (3) the EAS benefit in reverberation likely arises from partially preserved cues in this background accessible via the low-frequency acoustic component. 相似文献

15.

Spectral and temporal changes to speech produced in the presence of energetic and informational maskers

Cooke M Lu Y 《The Journal of the Acoustical Society of America》2010,128(4):2059-2069

Talkers change the way they speak in noisy conditions. For energetic maskers, speech production changes are relatively well-understood, but less is known about how informational maskers such as competing speech affect speech production. The current study examines the effect of energetic and informational maskers on speech production by talkers speaking alone or in pairs. Talkers produced speech in quiet and in backgrounds of speech-shaped noise, speech-modulated noise, and competing speech. Relative to quiet, speech output level and fundamental frequency increased and spectral tilt flattened in proportion to the energetic masking capacity of the background. In response to modulated backgrounds, talkers were able to reduce substantially the degree of temporal overlap with the noise, with greater reduction for the competing speech background. Reduction in foreground-background overlap can be expected to lead to a release from both energetic and informational masking for listeners. Passive changes in speech rate, mean pause length or pause distribution cannot explain the overlap reduction, which appears instead to result from a purposeful process of listening while speaking. Talkers appear to monitor the background and exploit upcoming pauses, a strategy which is particularly effective for backgrounds containing intelligible speech. 相似文献

16.

Intelligibility of speech under nonexponential decay conditions.

B Yegnanarayana B S Ramakrishna 《The Journal of the Acoustical Society of America》1975,58(4):853-857

相似文献

17.

Choice of reference conditions for speech preference tests

M H Hecker C E Williams 《The Journal of the Acoustical Society of America》1966,39(5):946-952

相似文献

18.

Temporal properties of perceptual calibration to local and broad spectral characteristics of a listening context

Alexander JM Kluender KR 《The Journal of the Acoustical Society of America》2010,128(6):3597-3513

The auditory system calibrates to reliable properties of a listening environment in ways that enhance sensitivity to less predictable (more informative) aspects of sounds. These reliable properties may be spectrally local (e.g., peaks) or global (e.g., gross tilt), but the time course over which the auditory system registers and calibrates to these properties is unknown. Understanding temporal properties of this perceptual calibration is essential for revealing underlying mechanisms that serve to increase sensitivity to changing and informative properties of sounds. Relative influence of the second formant (F(2)) and spectral tilt was measured for identification of /u/ and /i/ following precursor contexts that were harmonic complexes with frequency-modulated resonances. Precursors filtered to match F(2) or tilt of following vowels induced perceptual calibration (diminished influence) to F(2) and tilt, respectively. Calibration to F(2) was greatest for shorter duration precursors (250 ms), which implicates physiologic and/or perceptual mechanisms that are sensitive to onsets. In contrast, calibration to tilt was greatest for precursors with longer durations and higher repetition rates because greater opportunities to sample the spectrum result in more stable estimates of long-term global spectral properties. Possible mechanisms that promote sensitivity to change are discussed. 相似文献

19.

Correlation characteristics and dimensionality of speech spetra

K P Li G W Hughes A S House 《The Journal of the Acoustical Society of America》1969,46(4):1019-1025

相似文献

20.

Effect of ambient noise on the vocal output and the preferred listening level of conversational speech

E. van Heusden R. Plomp L.C.W. Pols 《Applied Acoustics》1979,12(1):31-43

The effect of ambient noise on vocal output and the preferred listening level of conversational speech was investigated under conditions typical of everyday speech communication. For a speaker-listener distance of 1 m, vocal output and the preferred listening level in quiet were both about 50 dB(A). Deviations from this value were observed when the noise level exceeded a level of about 40 dB(A). The regression lines for the data points above this level showed a 3 dB rise for a 10 dB rise in noise level. The experiments further suggest that both speaker and listener (when the latter is able to control the playback level of recorded speech) try to compensate for the noise interference by raising the level of speech in order to keep the (subjective) loudness of speech in noise equal to the loudness of speech in quiet. 相似文献