期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

1.

R D Glave A C Rietveld 《The Journal of the Acoustical Society of America》1979,66(4):1018-1022

This paper presents a bimodal (audio-visual) study of speech loudness. The same acoustic stimuli (three sustained vowels of the articulatory qualities "effort" and "noneffort") are first presented in isolation, and then simultaneously together with an appropriate optical stimulus (the speaker's face on a video screen, synchronously producing the vowels). By the method of paired comparisons (law of comparative judgment) subjective loudness differences could be represented by different intervals between scale values. By this method previous results of effort-dependent speech loudness could be verified. In the bimodal study the optical cues have a measurable effect, but the acoustic cues are still dominant. Visual cues act most effectively if they are presented naturally, i.e., if acoustic and optical effort cues vary in the same direction. The experiments provide some evidence that speech loudness can be influenced by other than acoustic variables. 相似文献

2.

Acoustic level and vocal effort as cues for the loudness of speech

G D Allen 《The Journal of the Acoustical Society of America》1971,49(6):1831-1841

相似文献

3.

The role of visual speech cues in reducing energetic and informational masking

Helfer KS Freyman RL 《The Journal of the Acoustical Society of America》2005,117(2):842-849

Two experiments compared the effect of supplying visual speech information (e.g., lipreading cues) on the ability to hear one female talker's voice in the presence of steady-state noise or a masking complex consisting of two other female voices. In the first experiment intelligibility of sentences was measured in the presence of the two types of maskers with and without perceived spatial separation of target and masker. The second study tested detection of sentences in the same experimental conditions. Results showed that visual cues provided more benefit for both recognition and detection of speech when the masker consisted of other voices (versus steady-state noise). Moreover, visual cues provided greater benefit when the target speech and masker were spatially coincident versus when they appeared to arise from different spatial locations. The data obtained here are consistent with the hypothesis that lipreading cues help to segregate a target voice from competing voices, in addition to the established benefit of supplementing masked phonetic information. 相似文献

4.

Binaural release from masking for speech and gain in intelligibility 总被引：2，自引：0，他引：2

H Levitt L R Rabiner 《The Journal of the Acoustical Society of America》1967,42(3):601-608

相似文献

5.

Dependence of binaural loudness summation on interaural level difference and frequency for pure tones

ZHANG Jie & MAO DongXing Institute of Acoustics Tongji University Shanghai China 《中国科学:物理学力学天文学(英文版)》2010,(5)

Most of the existing loudness models are based on the diotic listening hypothesis,though human beings always hear in dichotic listening conditions.In this situation,the arithmetic mean of loudness at both ears is usually taken as the approximate value of overall perceived loudness,unaffected by the interaural level difference(ILD).The present work investigated the overall perceived loudness for pure tones in dichotic listening conditions through a subjective experiment.Two experimental procedures and system... 相似文献

6.

Most comfortable loudness for pure tones, noise, and speech

I M Ventry R W Woods 《The Journal of the Acoustical Society of America》1971,49(6):1805-1813

相似文献

7.

Binaural benefit for speech recognition with spectral mismatch across ears in simulated electric hearing

Yoon YS Liu A Fu QJ 《The Journal of the Acoustical Society of America》2011,130(2):EL94-E100

The present study investigated the effects of binaural spectral mismatch on binaural benefits in the context of bilateral cochlear implants using acoustic simulations. Binaural spectral mismatch was systematically manipulated by simulating changes in the relative insertion depths across ears. Sentence recognition, presented unilaterally and bilaterally, were measured in normal-hearing listeners in quiet and noise at +5 dB signal-to-noise ratio. Significant binaural benefits were observed when the interaural difference in insertion depth was 1 mm or less. This result suggests a dependence of the binaural benefit on redundant speech information, rather than on similarity in performance across ears. 相似文献

8.

Beneficial acoustic speech cues for cochlear implant users with residual acoustic hearing

Visram AS Azadpour M Kluk K McKay CM 《The Journal of the Acoustical Society of America》2012,131(5):4042-4050

This study investigated which acoustic cues within the speech signal are responsible for bimodal speech perception benefit. Seven cochlear implant (CI) users with usable residual hearing at low frequencies in the non-implanted ear participated. Sentence tests were performed in near-quiet (some noise on the CI side to reduce scores from ceiling) and in a modulated noise background, with the implant alone and with the addition, in the hearing ear, of one of four types of acoustic signals derived from the same sentences: (1) a complex tone modulated by the fundamental frequency (F0) and amplitude envelope contours; (2) a pure tone modulated by the F0 and amplitude contours; (3) a noise-vocoded signal; (4) unprocessed speech. The modulated tones provided F0 information without spectral shape information, whilst the vocoded signal presented spectral shape information without F0 information. For the group as a whole, only the unprocessed speech condition provided significant benefit over implant-alone scores, in both near-quiet and noise. This suggests that, on average, F0 or spectral cues in isolation provided limited benefit for these subjects in the tested listening conditions, and that the significant benefit observed in the full-signal condition was derived from implantees' use of a combination of these cues. 相似文献

9.

Binaural speech unmasking and localization in noise with bilateral cochlear implants using envelope and fine-timing based strategies

van Hoesel R Böhm M Pesch J Vandali A Battmer RD Lenarz T 《The Journal of the Acoustical Society of America》2008,123(4):2249-2263

Four adult bilateral cochlear implant users, with good open-set sentence recognition, were tested with three different sound coding strategies for binaural speech unmasking and their ability to localize 100 and 500 Hz click trains in noise. Two of the strategies tested were envelope-based strategies that are clinically widely used. The third was a research strategy that additionally preserved fine-timing cues at low frequencies. Speech reception thresholds were determined in diotic noise for diotic and interaurally time-delayed speech using direct audio input to a bilateral research processor. Localization in noise was assessed in the free field. Overall results, for both speech and localization tests, were similar with all three strategies. None provided a binaural speech unmasking advantage due to the application of 700 micros interaural time delay to the speech signal, and localization results showed similar response patterns across strategies that were well accounted for by the use of broadband interaural level cues. The data from both experiments combined indicate that, in contrast to normal hearing, timing cues available from natural head-width delays do not offer binaural advantages with present methods of electrical stimulation, even when fine-timing cues are explicitly coded. 相似文献

10.

Pulse-rate discrimination by cochlear-implant and normal-hearing listeners with and without binaural cues

Carlyon RP Long CJ Deeks JM 《The Journal of the Acoustical Society of America》2008,123(4):2276-2286

Experiment 1 measured rate discrimination of electric pulse trains by bilateral cochlear implant (CI) users, for standard rates of 100, 200, and 300 pps. In the diotic condition the pulses were presented simultaneously to the two ears. Consistent with previous results with unilateral stimulation, performance deteriorated at higher standard rates. In the signal interval of each trial in the dichotic condition, the standard rate was presented to the left ear and the (higher) signal rate was presented to the right ear; the non-signal intervals were the same as in the diotic condition. Performance in the dichotic condition was better for some listeners than in the diotic condition for standard rates of 100 and 200 pps, but not at 300 pps. It is concluded that the deterioration in rate discrimination observed for CI users at high rates cannot be alleviated by the introduction of a binaural cue, and is unlikely to be limited solely by central pitch processes. Experiment 2 performed an analogous experiment in which 300-pps acoustic pulse trains were bandpass filtered (3900-5400 Hz) and presented in a noise background to normal-hearing listeners. Unlike the results of experiment 1, performance was superior in the dichotic than in the diotic condition. 相似文献

11.

Use of high-rate envelope speech cues and their perceptually relevant dynamic range for the hearing impaired

MA Stone K Anton BC Moore 《The Journal of the Acoustical Society of America》2012,132(2):1141-1151

The ability of hearing-impaired (HI) listeners to use high-rate envelope information in a competing-talker situation was assessed. In experiment 1, signals were tone vocoded and the cutoff frequency (f(c)) of the envelope extraction filter was either 50?Hz (E filter) or 200?Hz (P filter). The channels for which the P or E filter was used were varied. Intelligibility was higher with the P filter regardless of whether it was used for low or high center frequencies. Performance was best when the P filter was used for all channels. Experiment 2 explored the dynamic range over which HI listeners made use of high-rate cues. In each channel of a vocoder, the envelope extracted using f(c)?=?16?Hz was replaced by the envelope extracted using f(c)?=?300?Hz, either at the peaks or valleys, with a parametrically varied "switching threshold." For a target-to-background ratio of +5?dB, changes in speech intelligibility occurred mainly when the switching threshold was between -8 and +8?dB relative to the channel root-mean-square level. This range is similar in width to, but about 3?dB higher in absolute level than, that found for normal-hearing listeners, despite the reduced dynamic range of the HI listeners. 相似文献

12.

Effects of low-pass filtering on the intelligibility of speech in quiet for people with and without dead regions at high frequencies

Vickers DA Moore BC Baer T 《The Journal of the Acoustical Society of America》2001,110(2):1164-1175

A dead region is a region of the cochlea where there are no functioning inner hair cells (IHCs) and/or neurons; it can be characterized in terms of the characteristic frequencies of the IHCs bordering that region. We examined the effect of high-frequency amplification on speech perception for subjects with high-frequency hearing loss with and without dead regions. The limits of any dead regions were defined by measuring psychophysical tuning curves and were confirmed using the TEN test described in Moore et al. [Br. J. Audiol. 34, 205-224 (2000)]. The speech stimuli were vowel-consonant-vowel (VCV) nonsense syllables, using one of three vowels (/i/, /a/, and /u/) and 21 different consonants. In a baseline condition, subjects were tested using broadband stimuli with a nominal input level of 65 dB SPL. Prior to presentation via Sennheiser HD580 earphones, the stimuli were subjected to the frequency-gain characteristic prescribed by the "Cambridge" formula, which is intended to give speech at 65 dB SPL the same overall loudness as for a normal listener, and to make the average loudness of the speech the same for each critical band over the frequency range important for speech intelligibility (in a listener without a dead region). The stimuli for all other conditions were initially subjected to this same frequency-gain characteristic. Then, the speech was low-pass filtered with various cutoff frequencies. For subjects without dead regions, performance generally improved progressively with increasing cutoff frequency. This indicates that they benefited from high-frequency information. For subjects with dead regions, two patterns of performance were observed. For most subjects, performance improved with increasing cutoff frequency until the cutoff frequency was somewhat above the estimated edge frequency of the dead region, but hardly changed with further increases. For a few subjects, performance initially improved with increasing cutoff frequency and then worsened with further increases, although the worsening was significant only for one subject. The results have important implications for the fitting of hearing aids. 相似文献

13.

Vocal Behavior and Vocal Loading Factors for Preschool Teachers at Work Studied with Binaural DAT Recordings

Maria Sdersten PhD Svante Granqvist Britta Hammarberg Annika Szabo 《Journal of voice》2002,16(3):356-371

Preschool teachers are at risk for developing voice problems such as vocal fatigue and vocal nodules. The purpose of this report was to study preschool teachers' voice use during work. Ten healthy female preschool teachers working at daycare centers (DCC) served as subjects. A binaural recording technique was used. Two microphones were placed on both sides of the subject's head, at equal distance from the mouth, and a portable DAT recorder was attached to the subject's waist. Recordings were made of a standard reading passage before work (baseline) and of spontaneous speech during work. The recording technique allowed separate analyses of the level of the background noise, and of the subjects' voice sound pressure level, mean fundamental frequency, and total phonation time. Among the results, mean background noise level for the ten DCCs was 76.1 dBA (range 73.0-78.2), which is more than 20 dB higher than what is recommended where speech communication is important (50-55 dBA). The subjects spoke on an average of 9.1 dB louder (p < 0.0001), and with higher mean fundamental frequency (247 Hz) during work as compared to the baseline (202 Hz) (p < 0.0001). Mean phonation time for the group was 17%, which was considered high. It was concluded that preschool teachers do have a highly vocally demanding profession. Important steps to reduce the vocal loading for this occupation would be to decrease the background noise levels and include pauses so that preschool teachers can rest their voices. 相似文献

14.

On the use of symmetrized dot patterns for the visual characterization of speech waveforms and other sampled data 总被引：1，自引：0，他引：1

C A Pickover 《The Journal of the Acoustical Society of America》1986,80(3):955-960

While the spectrogram (and related graphic analyses) have been invaluable in showing the general frequency content of an input signal, sometimes it is difficult for trained and untrained users to see on the spectrogram differences which are perceptible to the ear. In this paper, several demonstrations of a novel representation are presented which, in some cases, can make subtle differences in input signals obvious to the human analyst. The representation, a "symmetrized dot pattern" (SDP), provides a stimulus in which local visual correlations are integrated to form a global percept and can potentially be applied to the detection and characterization of significant features of any sampled data. 相似文献

15.

Faciliation of Mandarin tone perception by visual speech in clear and degraded audio: implications for cochlear implants

Smith D Burnham D 《The Journal of the Acoustical Society of America》2012,131(2):1480-1489

Cochlear implant (CI) users in tone language environments report great difficulty in perceiving lexical tone. This study investigated the augmentation of simulated cochlear implant audio by visual (facial) speech information for tone. Native speakers of Mandarin and Australian English were asked to discriminate between minimal pairs of Mandarin tones in five conditions: Auditory-Only, Auditory-Visual, CI-simulated Auditory-Only, CI-simulated Auditory-Visual, and Visual-Only (silent video). Discrimination in CI-simulated audio conditions was poor compared with normal audio, and varied according to tone pair, with tone pairs with strong non-F0 cues discriminated the most easily. The availability of visual speech information also improved discrimination in the CI-simulated audio conditions, particularly on tone pairs with strong durational cues. In the silent Visual-Only condition, both Mandarin and Australian English speakers discriminated tones above chance levels. Interestingly, tone-nai?ve listeners outperformed native listeners in the Visual-Only condition, suggesting firstly that visual speech information for tone is available, and may in fact be under-used by normal-hearing tone language perceivers, and secondly that the perception of such information may be language-general, rather than the product of language-specific learning. This may find application in the development of methods to improve tone perception in CI users in tone language environments. 相似文献

16.

Analysis of hidden-charm pentaquark molecular states with and without strangeness via the QCD sum rules

Zhi-Gang Wang Qi Xin 《中国物理C(英文版)》2021,45(12):123105-123105-11

In this study, we investigate the

\begin{document}$\bar{D}\Sigma_c$\end{document}

,

\begin{document}$\bar{D}\Xi^\prime_c$\end{document}

,

\begin{document}$\bar{D}\Sigma_c^*$\end{document}

,

\begin{document}$\bar{D}\Xi_c^*$\end{document}

,

\begin{document}$\bar{D}^{*}\Sigma_c$\end{document}

,

\begin{document}$\bar{D}^{*}\Xi^\prime_c$\end{document}

,

\begin{document}$\bar{D}^{*}\Sigma_c^*$\end{document}

, and

\begin{document}$\bar{D}^{*}\Xi_c^*$\end{document}

pentaquark molecular states with and without strangeness via the QCD sum rules in detail, focusing on the light flavor,

\begin{document}$SU(3)$\end{document}

, breaking effects, and make predictions for new pentaquark molecular states besides assigning

\begin{document}$P_c(4312)$\end{document}

,

\begin{document}$P_c(4380)$\end{document}

,

\begin{document}$P_c(4440)$\end{document}

,

\begin{document}$P_c(4457)$\end{document}

, and

\begin{document}$P_{cs}(4459)$\end{document}

self-consistently. In the future, we can search for these pentaquark molecular states in the decay of

\begin{document}$\Lambda_b^0$\end{document}

,

\begin{document}$\Xi_b^0$\end{document}

, and

\begin{document}$\Xi_b^-$\end{document}

. Furthermore, we discuss high-dimensional vacuum condensates in detail. 相似文献

17.

Spectral Action for Torsion with and without Boundaries

B.?Iochum Email author C.?Levy D.?Vassilevich 《Communications in Mathematical Physics》2012,310(2):367-382

We derive a commutative spectral triple and study the spectral action for a rather general geometric setting which includes the (skew-symmetric) torsion and the chiral bag conditions on the boundary. The spectral action splits into bulk and boundary parts. In the bulk, we clarify certain issues of the previous calculations, show that many terms in fact cancel out, and demonstrate that this cancellation is a result of the chiral symmetry of spectral action. On the boundary, we calculate several leading terms in the expansion of spectral action in four dimensions for vanishing chiral parameter θ of the boundary conditions, and show that θ = 0 is a critical point of the action in any dimension and at all orders of the expansion. 相似文献

18.

Audibility-based predictions of speech recognition for children and adults with normal hearing

McCreery RW Stelmachowicz PG 《The Journal of the Acoustical Society of America》2011,130(6):4070-4081

This study investigated the relationship between audibility and predictions of speech recognition for children and adults with normal hearing. The Speech Intelligibility Index (SII) is used to quantify the audibility of speech signals and can be applied to transfer functions to predict speech recognition scores. Although the SII is used clinically with children, relatively few studies have evaluated SII predictions of children's speech recognition directly. Children have required more audibility than adults to reach maximum levels of speech understanding in previous studies. Furthermore, children may require greater bandwidth than adults for optimal speech understanding, which could influence frequency-importance functions used to calculate the SII. Speech recognition was measured for 116 children and 19 adults with normal hearing. Stimulus bandwidth and background noise level were varied systematically in order to evaluate speech recognition as predicted by the SII and derive frequency-importance functions for children and adults. Results suggested that children required greater audibility to reach the same level of speech understanding as adults. However, differences in performance between adults and children did not vary across frequency bands. 相似文献

19.

A computer interface for psychophysical and speech research with the Nucleus cochlear implant 总被引：4，自引：0，他引：4

R V Shannon D D Adams R L Ferrel R L Palumbo M Grandgenett 《The Journal of the Acoustical Society of America》1990,87(2):905-907

A computer interface has been designed and implemented that allows presentation of biphasic pulse stimuli to patients with the Nucleus Ltd./Cochlear Corporation cochlear implant. The one version of the interface connects to a standard parallel output port of a PC or AT compatible computer, and another version plugs directly into a standard PC/XT bus slot. The host computer sends a stream of bytes to the parallel port that specifies the configuration of the desired output pulses. Upon receipt of the data, the interface generates the appropriate burst sequence that is delivered to the patient's external transmitter coil. The coded information is interpreted by the internal receiver that delivers the pulse to the specified electrodes at the specified amplitude and pulse width. This interface makes it possible to interleave pulses on two or more electrode pairs, to modulate the amplitude or timing of a pulse sequence, or to sweep a stimulus across the electrode array. Investigators can achieve stimulus control with this interface that allows them to conduct psychophysical, electrophysiological, and speech experiments not possible through the patient's speech processor or with available clinical interfaces. 相似文献

20.

Color appearance and visual measurements for color samples with gloss effect 总被引：2，自引：0，他引：2

马健徐海松 M.Ronnier Luo 《中国光学快报(英文版)》2009,7(9):869-872

We assess the color appearance of the samples with different inks on glossy substrates, five kinds of paper with different gloss levels. The color samples are measured using spectrophotometers under different illuminating/viewing geometries and visually estimated using the psychophysical method of magnitude estimation. The results of the two approaches are compared through the color appearance model of CIECAM02. The experimental data analysis indicates that the 0/45 and 15/0 geometries can be used to describe the two major aspects of gloss effect, the enlargement of color gamut, and the reduction of lightness. The agreement for hue attribute between instrumental measurement and visual assessment is better than those for colorfulness and lightness. 相似文献