期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

1.

Vertical-plane sound localization probed with ripple-spectrum noise

Macpherson EA Middlebrooks JC 《The Journal of the Acoustical Society of America》2003,114(1):430-445

Ripple-spectrum stimuli were used to investigate the scale of spectral detail used by listeners in interpreting spectral cues for vertical-plane localization. In three experiments, free-field localization judgments were obtained for 250-ms, 0.6-16-kHz noise bursts with log-ripple spectra that varied in ripple density, peak-to-trough depth, and phase. When ripple density was varied and depth was held constant at 40 dB, listeners' localization error rates increased most (relative to rates for flat-spectrum targets) for densities of 0.5-2 ripples/oct. When depth was varied and density was held constant at 1 ripple/oct, localization accuracy was degraded only for ripple depths > or = 20 dB. When phase was varied and density was held constant at 1 ripple/oct and depth at 40 dB, three of five listeners made errors at consistent locations unrelated to the ripple phase, whereas two listeners made errors at locations systematically modulated by ripple phase. Although the reported upper limit for ripple discrimination is 10 ripples/oct [Supin et al., J. Acoust. Soc. Am. 106, 2800-2804 (1999)], present results indicate that details finer than 2 ripples/oct or coarser than 0.5 ripples/oct do not strongly influence processing of spectral cues for sound localization. The low spectral-frequency limit suggests that broad-scale spectral variation is discounted, even though components at this scale are among those contributing the most to the shapes of directional transfer functions. 相似文献

2.

The role of spectral modulation cues in virtual sound localization

Qian J Eddins DA 《The Journal of the Acoustical Society of America》2008,123(1):302-314

Sound localization cues generally include interaural time difference, interaural intensity difference, and spectral cues. The purpose of this study is to investigate the important spectral cues involved in so-called head related transfer functions (HRTFs) using a combination of HRTF analyses and a virtual sound localization (VSL) experiment. Previous psychoacoustical and physiological studies have both suggested the existence of spectral modulation frequency (SMF) channels for analyzing spectral information (e.g., the spectral cues coded in HRTFs). SMFs are in a domain related to the Fourier transform of HRTFs. The relationship between various SMF regions and sound localization was tested here by filtering or enhancing HRTFs in the SMF domain under a series of conditions using a VSL experiment. Present results revealed that azimuth localization was not significantly affected by HRTF manipulation. Applying notch filters between 0.1 and 0.4 cyclesoctave or between 0.35 and 0.65 cyclesoctave resulted in significantly less accurate elevation responses at low elevations, while spectral enhancement in these two SMF regions did not produce a significant change in sound localization. Likewise, low-pass filtering at 2 cyclesoctave did not significantly influence localization accuracy, suggesting that the major cues for sound localization are in the SMF region below 2 cyclesoctave. 相似文献

3.

Median plane localization using a parametric model of the head-related transfer function based on spectral cues

Kazuhiro Iida Motokuni Itoh Masayuki Morimoto 《Applied Acoustics》2007,68(8):835-850

To examine a simulation method for vertical sound localization, and to clarify which peaks and notches in head-related transfer functions (HRTFs) play a role as spectral cues, localization tests in the median plane were carried out using a parametric HRTF model, which is recomposed only of extracted spectral peaks and notches. The results show that the parametric HRTF recomposed using the first and second notches (N1 and N2) and the first peak (P1) provides almost the same localization accuracy as the measured HRTFs. Observations of the spectral peaks and notches indicate that N1 and N2 change remarkably as the source elevation changes, whereas P1 does not depend on the source elevation. In conclusion, N1 and N2 can be regarded as spectral cues, and the hearing system could utilize P1 as the reference information to analyze N1 and N2. 相似文献

4.

Estimation of interaural level difference based on anthropometry and its effect on sound localization

Watanabe K Ozawa K Iwaya Y Suzuki YI Aso K 《The Journal of the Acoustical Society of America》2007,122(5):2832-2841

Individualization of head-related transfer functions (HRTFs) is important for highly accurate sound localization systems such as virtual auditory displays. A method to estimate interaural level differences (ILDs) from a listener's anthropometry is presented in this paper to avoid the burden of direct measurement of HRTFs. The main result presented in this paper is that localization is improved with nonindividualized HRTF if ILD is fitted to the listener. First, the relationship between ILDs and the anthropometric parameters was analyzed using multiple regression analysis. The azimuthal variation of the ILDs in each 1/3-octave band was then estimated from the listener's anthropometric parameters. A psychoacoustical experiment was carried out to evaluate its effectiveness. The experimental results show that the adjustment of the frequency characteristics of ILDs for a listener with the proposed method is effective for localization accuracy. 相似文献

5.

The behavioral response of mice to gaps in noise depends on its spectral components and its bandwidth

Ison JR Allen PD Rivoli PJ Moore JT 《The Journal of the Acoustical Society of America》2005,117(6):3944-3951

The purpose of these experiments was to determine whether detecting brief decrements in noise level ("gaps") varies with the spectral content and bandwidth of noise in mice as it does in humans. The behavioral effect of gaps was quantified by their inhibiting a subsequent acoustic startle reflex. Gap durations from 1 to 29 ms were presented in five adjacent 1-octave noise bands and one 5-octave band, their range being 2 kHz to 64 kHz. Gaps ended 60 ms before the startle stimulus (experiment 1) or at startle onset (experiment 2). Asymptotic inhibition was greater for higher-frequency 1-octave bands and highest for the 5-octave band in both experiments, but time constants were related to frequency only in experiment 1. For the lowest band (2-4 kHz) neither noise decrements (experiment 1 and 2) nor increments (experiment 3) had any behavioral consequence, but this band was effective when presented as a pulse in quiet (experiment 4). The lowest frequencies in the most effective 1-octave band were one octave above the spectral region where mice have their best absolute thresholds. These effects are similar to those obtained in humans, and reveal a special contribution of wide band, high-frequency stimulation to temporal acuity. 相似文献

6.

Perceptual recalibration in human sound localization: learning to remediate front-back reversals

Zahorik P Bangayan P Sundareswaran V Wang K Tam C 《The Journal of the Acoustical Society of America》2006,120(1):343-359

The efficacy of a sound localization training procedure that provided listeners with auditory, visual, and proprioceptive/vestibular feedback as to the correct sound-source position was evaluated using a virtual auditory display that used nonindividualized head-related transfer functions (HRTFs). Under these degraded stimulus conditions, in which the monaural spectral cues to sound-source direction were inappropriate, localization accuracy was initially poor with frequent front-back reversals (source localized to the incorrect front-back hemifield) for five of six listeners. Short periods of training (two 30-min sessions) were found to significantly reduce the rate of front-back reversal responses for four of five listeners that showed high initial reversal rates. Reversal rates remained unchanged for all listeners in a control group that did not participate in the training procedure. Because analyses of the HRTFs used in the display demonstrated a simple and robust front-back cue related to energy in the 3-7-kHz bandwidth, it is suggested that the reductions observed in reversal rates following the training procedure resulted from improved processing of this front-back cue, which is perhaps a form of rapid perceptual recalibration. Reversal rate reductions were found to generalize to untrained source locations, and persisted at least 4 months following the training procedure. 相似文献

7.

The effect of aging on horizontal plane sound localization

Abel SM Giguère C Consoli A Papsin BC 《The Journal of the Acoustical Society of America》2000,108(2):743-752

An experiment was conducted to determine the effect of aging on sound localization. Seven groups of 16 subjects, aged 10-81 years, were tested. Sound localization was assessed using six different arrays of four or eight loudspeakers that surrounded the subject in the horizontal plane, at a distance of 1 m. For two 4-speaker arrays, one loudspeaker was positioned in each spatial quadrant, on either side of the midline or the interaural axis, respectively. For four 8-speaker arrays, two loudspeakers were positioned in each quadrant, one close to the midline and the second separated from the first by 15 degrees, 30 degrees, 45 degrees, or 60 degrees. Three different 300-ms stimuli were localized: two one-third-octave noise bands, centered at 0.5 and 4 kHz, and broadband noise. The stimulus level (75 dB SPL) was well above hearing threshold for all subjects tested. Over the age range studied, percent-correct sound-source identification judgments decreased by 12%-15%. Performance decrements were apparent as early as the third decade of life. Broadband noise was easiest to localize (both binaural and spectral cues were available), and the 0.5-kHz noise band, the most difficult to localize (primarily interaural temporal difference cue available). Accuracy was relatively higher in front of than behind the head, and errors were largely front/back mirror image reversals. A left-sided superiority was evident until the fifth decade of life. The results support the conclusions that the processing of spectral information becomes progressively less efficient with aging, and is generally worse for sources on the right side of space. 相似文献

8.

The bat head-related transfer function reveals binaural cues for sound localization in azimuth and elevation 总被引：3，自引：0，他引：3

Aytekin M Grassi E Sahota M Moss CF 《The Journal of the Acoustical Society of America》2004,116(6):3594-3605

Directional properties of the sound transformation at the ear of four intact echolocating bats, Eptesicus fuscus, were investigated via measurements of the head-related transfer function (HRTF). Contributions of external ear structures to directional features of the transfer functions were examined by remeasuring the HRTF in the absence of the pinna and tragus. The investigation mainly focused on the interactions between the spatial and the spectral features in the bat HRTF. The pinna provides gain and shapes these features over a large frequency band (20-90 kHz), and the tragus contributes gain and directionality at the high frequencies (60 to 90 kHz). Analysis of the spatial and spectral characteristics of the bat HRTF reveals that both interaural level differences (ILD) and monaural spectral features are subject to changes in sound source azimuth and elevation. Consequently, localization cues for horizontal and vertical components of the sound source location interact. Availability of multiple cues about sound source azimuth and elevation should enhance information to support reliable sound localization. These findings stress the importance of the acoustic information received at the two ears for sound localization of sonar target position in both azimuth and elevation. 相似文献

9.

A performance adequate computational model for auditory localization

Chung W Carlile S Leong P 《The Journal of the Acoustical Society of America》2000,107(1):432-445

A computational model of auditory localization resulting in performance similar to humans is reported. The model incorporates both the monaural and binaural cues available to a human for sound localization. Essential elements used in the simulation of the processes of auditory cue generation and encoding by the nervous system include measured head-related transfer functions (HRTFs), minimum audible field (MAF), and the Patterson-Holdsworth cochlear model. A two-layer feed-forward back-propagation artificial neural network (ANN) was trained to transform the localization cues to a two-dimensional map that gives the direction of the sound source. The model results were compared with (i) the localization performance of the human listener who provided the HRTFs for the model and (ii) the localization performance of a group of 19 other human listeners. The localization accuracy and front-back confusion error rates exhibited by the model were similar to both the single listener and the group results. This suggests that the simulation of the cue generation and extraction processes as well as the model parameters were reasonable approximations to the overall biological processes. The amplitude resolution of the monaural spectral cues was varied and the influence on the model's performance was determined. The model with 128 cochlear channels required an amplitude resolution of approximately 20 discrete levels for encoding the spectral cue to deliver similar localization performance to the group of human listeners. 相似文献

10.

Parseihian G Katz BF 《The Journal of the Acoustical Society of America》2012,131(4):2948-2957

The paper reports on the ability of people to rapidly adapt in localizing virtual sound sources in both azimuth and elevation when listening to sounds synthesized using non-individualized head-related transfer functions (HRTFs). Participants were placed within an audio-kinesthetic Virtual Auditory Environment (VAE) platform that allows association of the physical position of a virtual sound source with an alternate set of acoustic spectral cues through the use of a tracked physical ball manipulated by the subject. This set-up offers a natural perception-action coupling, which is not limited to the visual field of view. The experiment consisted of three sessions: an initial localization test to evaluate participants' performance, an adaptation session, and a subsequent localization test. A reference control group was included using individual measured HRTFs. Results show significant improvement in localization performance. Relative to the control group, participants using non-individual HRTFs reduced localization errors in elevation by 10° with three sessions of 12 min. No significant improvement was found for azimuthal errors or for single session adaptation. 相似文献

11.

The influence of pinnae-based spectral cues on sound localization 总被引：1，自引：0，他引：1

A D Musicant R A Butler 《The Journal of the Acoustical Society of America》1984,75(4):1195-1200

The role of pinnae-based spectral cues was investigated by requiring listeners to locate sound, binaurally, in the horizontal plane with and without partial occlusion of their external ears. The main finding was that the high frequencies were necessary for optimal performance. When the stimulus contained the higher audio frequencies, e.g., broadband and 4.0-kHz high-pass noise, localization accuracy was significantly superior to that recorded for stimuli consisting only of the lower frequencies (4.0- and 1.0-kHz low-pass noise). This finding was attributed to the influence of the spectral cues furnished by the pinnae, for when the stimulus composition included high frequencies, pinnae occlusion resulted in a marked decline in localization accuracy. Numerous front-rear reversals occurred. Moreover, the ability to distinguish among sounds originating within the same quadrant also suffered. Performance proficiency for the low-pass stimuli was not further degraded under conditions of pinnae occlusion. In locating the 4.0-kHz high-pass noise when both, neither, or only one ear was occluded, the data demonstrated unequivocally that the pinna-based cues of the "near" ear contributed powerfully toward localization accuracy. 相似文献

12.

人工头近场头相关传输函数及其特性

下载免费PDF全文

余光正谢菠荪饶丹《声学学报》2012,37(4):378-385

采用球形正十二面体声源及其空间定位系统,测量并建立了KEMAR人工头的近场头相关传输函数(HRTF)数据库。基于数据库分析了近场HRTF在频域和时域随声源距离变化的规律;讨论了用近场HRTF算得的双耳声级差(TLD)和双耳时间差(ITD)所包含的声源距离定位信息。结果表明,测量系统和所得数据具有较好的重复性和准确性,保留了1 kHz以下的低频定位信息。并且,近场HRTF幅度谱和ILD随声源距离的变化明显;用相关法算得2 kHz以下频段的ITD随声源距离略有变化。本文数据库及其分析结果将为声源距离定位的应用提供基础。相似文献

13.

Transfer effects on sound localization performances from playing a virtual three-dimensional auditory game

Akio Honda Hiroshi Shibata Jiro Gyoba Kouji Saitou Yukio Iwaya Yôiti Suzuki 《Applied Acoustics》2007,68(8):885-896

Transfer effects of playing an auditory game with a virtual auditory display (VAD) were investigated. Furthermore, we analyzed the effects of playing the VAD game on sound localization performance under subjects’ own head-related transfer functions (HRTFs) and HRTFs fitted from those of 16 other adults. Participants performed sound localization tasks initially and 2 weeks later to show the effects. The VAD game players were of three groups, using own HRTFs, fitted HRTFs, and no playing (control). The VAD game-playing results revealed that: (1) the hit rate of the sound localization task for real sound sources increased approximately 20%; (2) the vertical and horizontal localization error decreased significantly; (3) sound localization performance using fitted HRTFs was similar to performance using own HRTFs. Follow-up tests revealed that transfer effects persisted more than 1 month, suggesting that the effects of playing the VAD game transfer to sound localization performance. 相似文献

14.

Spectral integration and wideband analysis in gap detection and overshoot paradigms

Hall JW Buss E Grose JH 《The Journal of the Acoustical Society of America》2007,122(6):3598-3608

Several listening conditions show that energy remote from a target frequency can deleteriously affect sensitivity. One interpretation of such results entails a wideband analysis involving a wide predetection filter. The present study tested the hypothesis that both temporal gap detection and overshoot results are consistent with a wideband analysis, as contrasted with statistical combination of information across independent channels. For gap detection, stimuli were random or comodulated 50-Hz-wide noise bands centered on 1000, 1932, 3569, and 6437 Hz. For overshoot, the masker was an 8-kHz low-pass filtered noise, with 5-ms tone bursts presented at the same center frequencies used for gap detection. Signals were presented with either 0- or 250-ms delay after masker onset. In each paradigm, the target was introduced at only one frequency or at all four frequencies. Results from gap detection conditions did not favor a wideband analysis interpretation: Results in the random condition were consistent with an optimal combination of cues across frequency. An across-channel interference effect was also evident when only one of the four bands contained the gap. Although results from the overshoot conditions were consistent with a wideband analysis interpretation, they were more parsimoniously accounted for in terms of statistical combination of information. 相似文献

15.

Narrow-band sound localization related to external ear acoustics. 总被引：3，自引：0，他引：3

J C Middlebrooks 《The Journal of the Acoustical Society of America》1992,92(5):2607-2624

Human subjects localized brief 1/6-oct bandpassed noise bursts that were centered at 6, 8, 10, and 12 kHz. All testing was done under binaural conditions. The horizontal component of subjects' responses was accurate, comparable to that for broadband localization, but the vertical and front/back components exhibited systematic errors. Specifically, responses tended to cluster within restricted ranges that were specific for each center frequency. The directional transfer functions of the subjects' external ears were measured for 360 horizontal and vertical locations. The spectra of the sounds that were present in the subjects' ear canals, the "proximal stimulus" spectra, were computed by combining the spectra of the narrow-band sound sources with the directional transfer functions for particular stimulus locations. Subjects consistently localized sounds to regions within which the associated directional transfer function correlated most closely with the proximal stimulus spectrum. A quantitative model was constructed that successfully predicted subjects' responses based on interaural level difference and spectral cues. A test of the model, using techniques adapted from signal detection theory, indicated that subjects tend to use interaural level difference and spectral shape cues independently, limited only by a slight spatial correlation of the two cues. A testing procedure is described that provides a quantitative comparison of various predictive models of sound localization. 相似文献

16.

The role of contrasting temporal amplitude patterns in the perception of speech

Healy EW Warren RM 《The Journal of the Acoustical Society of America》2003,113(3):1676-1688

Despite a lack of traditional speech features, novel sentences restricted to a narrow spectral slit can retain nearly perfect intelligibility [R. M. Warren et al., Percept. Psychophys. 57, 175-182 (1995)]. The current study employed 514 listeners to elucidate the cues allowing this high intelligibility, and to examine generally the use of narrow-band temporal speech patterns. When 1/3-octave sentences were processed to preserve the overall temporal pattern of amplitude fluctuation, but eliminate contrasting amplitude patterns within the band, sentence intelligibility dropped from values near 100% to values near zero (experiment 1). However, when a 1/3-octave speech band was partitioned to create a contrasting pair of independently amplitude-modulated 1/6-octave patterns, some intelligibility was restored (experiment 2). An additional experiment (3) showed that temporal patterns can also be integrated across wide frequency separations, or across the two ears. Despite the linguistic content of single temporal patterns, open-set intelligibility does not occur. Instead, a contrast between at least two temporal patterns is required for the comprehension of novel sentences and their component words. These contrasting patterns can reside together within a narrow range of frequencies, or they can be integrated across frequencies or ears. This view of speech perception, in which across-frequency changes in energy are seen as systematic changes in the temporal fluctuation patterns at two or more fixed loci, is more in line with the physiological encoding of complex signals. 相似文献

17.

Direction-dependent spectral properties of cat external ear: new data and cross-species comparisons 总被引：5，自引：0，他引：5

A D Musicant J C Chan J E Hind 《The Journal of the Acoustical Society of America》1990,87(2):757-781

Free-field to eardrum transfer functions were measured in anesthetized cats inside an anechoic chamber. Direction-dependent transformations were determined by measurement of sound-pressure levels using a small probe tube microphone surgically implanted in a ventral position near the tympanic membrane. Loudspeaker and probe microphone characteristics were eliminated by subtraction of the signal recorded in the free field with no animal present. Complexities of the transfer function, which include the presence of prominent spectral notches in the 8- to 18-kHz frequency region, are due primarily to the acoustical properties of the pinna. Differential amplification of frequency components within the broadband stimulus occurs as a function of source direction. Spectral features vary systematically with changes in both elevation (EL) and azimuth (AZ). The contrast between a notch and its shoulders is enhanced in the interaural spectral records. Spectral data from single source locations and spatial data for single frequencies at many locations are presented and comparisons with other species are drawn. It is suggested that spectral features in the 8- to 18-kHz region provide some of the necessary spectral information for sound localization and that the contrast in spectral energy between the frequencies at the notch and its shoulders is a potential directional cue. 相似文献

18.

Sound localization with monocular vision

Sharon M. Abel Christine Tikuisis 《Applied Acoustics》2005,66(8):932-944

An experiment was carried out to determine whether sudden loss of vision in one eye would result in a bias in sound localization in the direction of the viewing eye. Fifteen normal-sighted young adults were tested binocularly and with the right or left eye covered. Within each vision condition, sound localization was assessed using three different arrays of six loudspeakers, positioned frontally and on the right and left sides of space, in combination with two stimuli, a one-third octave noise band centred at 4 kHz and broadband noise. These assessed the utilization of mainly the interaural level difference cue and binaural and spectral cues in combination, respectively. One block of 90 speaker identification trials was presented for each of the 18 conditions. For the lateral arrays in combination with the broadband noise stimulus, monocular vision resulted in decreased accuracy on the contralateral side. Errors were in the direction of the viewing eye. While monocularity resulted in performance decrements with the 4-kHz stimulus, the error pattern was not consistent. These results support the hypothesis of visually guided auditory adaptation of binaural and spectral cues in combination in response to sudden deprivation of vision in one eye. 相似文献

19.

不同测量对头相关传输函数的听觉影响

下载免费PDF全文

钟小丽徐秀《声学学报》2018,43(1):83-90

头相关传输函数(HRTF)是虚拟听觉重放的核心·目前,HRTF的实验室测量缺乏统一的规范。本文研究了不同测量对HRTF的听觉影响。首先提出了扩散场均衡的预处理方法,并对来自5个不同数据库的KEMAR假人的HRTF数据进行了扩散场均衡;然后,采用谱差异评估了不同数据库HRTF测量的频谱差异;最后,采用HRTF合成的虚拟声信号开展了一系列的主观听音实验,包括定位实验和区分实验·结果表明,扩散场均衡是一种有效的HRTF预处理方法,可以减小不同测量对HRTF频谱的影响;不同测量基本上不影响HRTF在12 kHz以下的定位效果,但对音色的影响较大,从而导致听觉上的可区分. 相似文献

20.

Estimation of HRTFs on the horizontal plane using physical features

Takanori Nishino Naoya Inoue Fumitada Itakura 《Applied Acoustics》2007,68(8):897-908

Sound localization can be controlled by using head related transfer functions (HRTFs), which are related to the size of the head, the ears and so on. Since HRTFs are characterized by source directions and subjects, it is necessary to conduct measurements in all directions for all subjects. However, such measurement is expensive and time-consuming. In this paper, we propose a simpler and more useful method that investigates the relationship between HRTFs and physical size by multiple regression analysis. The estimated HRTFs are evaluated by objective and subjective measures. For objective results, the average spectral distortion score is 4.0 dB in a bandwidth ranging from 0 to 8 kHz. Subjective results indicate no significant difference between the measured and the estimated HRTFs in that frequency range. These results support the hypothesis that the proposed method is effective for estimating HRTFs. 相似文献