首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 31 毫秒
1.
谢菠荪  刘路路  江建亮 《声学学报》2021,46(6):1223-1233
双耳重放的目标之一是在耳机重放中产生不同方向和距离的虚拟源感知。本文研究了动态双耳Ambisonics重放自由场虚拟源方向和距离信息的简化信号处理方法。该信号处理方法包括两步:第1步是基于目标声场的球谐函数分解,合成采用扬声器的近场Ambisonics重放中逐级重构目标声场的信号;第2步是采用虚拟扬声器重放的方法,用动态头相关函数滤波处理将Ambisonics的扬声器重放信号转换为双耳重放信号并用耳机重放。进一步研究了动态双耳Ambisonics的阶数对定位效果的影响,为简化信号处理提供依据。对重放产生的双耳声压分析表明,5阶动态双耳Ambisonics重放足以提供听觉方向定位和距离感知的重要信息。同时心理声学的实验结果表明,结合声源距离相关的响度因素,5阶动态双耳Ambisonics重放可产生不同方向和1.0 m以下不同近场距离的自由场虚拟源的听觉感知。本文的方法仅需要固定距离的48个均匀空间方向的远场非个性化HRTF处理,实现了信号处理的简化。   相似文献   

2.
适当均衡耳机到鼓膜的传递函数可有效提高耳机声重放效果。耳廓与耳道滤波效应引起的幅度峰谷有助于人耳听觉感知,以平直幅频响应为目标的幅度均衡无法保持适当的峰谷。该文提出了基于roex滤波器与Mel频率倒谱系数的耳机到鼓膜的传递函数平滑方法,用于模拟人耳听觉感知特性和平滑耳机到鼓膜的传递函数,使均衡后的幅频响应保持相应的峰谷,避免了幅度峰谷过渡均衡。实验结果表明,进行耳机到鼓膜的传递函数平滑的幅度均衡对提高耳机的音色有显著作用,基于Mel频率倒谱系数平滑的幅度均衡对提高耳机的音色最为显著。  相似文献   

3.
Ambisonics is a series of spatial sound reproduction system based on spatial harmonics decomposition and each order approximation of sound field. Ambisonics signals are originally intended for loudspeakers reproduction. By using head-related transfer functions (HRTFs) filters, binaural Ambisonics converts the Ambisonics signals for static or dynamic headphone reproduction. In present work, the performances of static and dynamic binaural Ambisonics reproduction are evaluated and compared. The mean binaural pressure errors across target source directions are first analyzed. Then a virtual source localization experiment is conducted, and the localization performances are evaluated by analyzing the percentages of front-back and up-down confusion, the mean angle error and discreteness in the localization results. The results indicate that binaural Ambsonics reproduction with insufficiently high order (for example, 5-10 order) is unable to recreate correct high-frequency magnitude spectra in binaural pressures, resulting in degradation in localization for static reproduction. Because dynamic localization cue is included, dynamic binaural Ambisoncis reproduction yields obviously better localization performance than static reproduction with the same order. Even a 3-order dynamic binaural Ambisoncis reproduction exhibits appropriate localizations performance.  相似文献   

4.
均衡耳机到鼓膜的传递函数可有效提高耳机声重放效果。通过主客观实验对比研究幅度均衡方法、相位均衡方法和幅度相位同时均衡方法,结果表明相比于幅度均衡,相位均衡对耳机瞬态响应的影响更大:当系统相位为线性时,瞬态响应衰减越快,线性相位对耳机重放提升音色有显著作用;而以平直幅频响应为目标的幅度均衡对提升耳机重放效果有限。  相似文献   

5.
A scheme for analyzing the timbre in spatial sound with binaural auditory model is proposed and the Ambisonics is taken as an example for analysis.Ambisonics is a spatial sound system based on physical sound field reconstruction.The errors and timbre colorations in the final reconstructed sound field depend on the spatial aliasing errors on both the recording and reproducing stages of Ambisonics.The binaural loudness level spectra in Ambisonics reconstruction is calculated by using Moore's revised loudness model and then compared with the result of real sound source,so as to evaluate the timbre coloration in Ambisonics quantitatively.The results indicate that,in the case of ideal independent signals,the high-frequency limit and radius of region without perceived timbre coloration increase with the order of Ambisonics.On the other hand,in the case of recording by microphone array,once the high-frequency limit of microphone array exceeds that of sound field reconstruction,array recording influences little on the binaural loudness level spectra and thus timbre in final reconstruction up to the highfrequency limit of reproduction.Based on the binaural auditory model analysis,a scheme for optimizing design of Ambisonics recording and reproduction is also suggested.The subjective experiment yields consistent results with those of binaural model,thus verifies the effectiveness of the model analysis.  相似文献   

6.
刘阳  谢菠荪 《声学学报》2015,40(5):717-729
提出用双耳听觉模型对空间声音色进行分析的普遍方法,并以Ambisonics为例进行了分析。Ambisonics是基于物理声场重构的空间声系统,其最终重构声场误差以及音色改变是由传声器捡拾和重放空间混叠误差共同引起的。采用修正的Moore双耳响度模型计算了Ambisonics重构声场的双耳响度级谱并和目标声场的情况比较,从而定量评价重构声场的音色改变。结果表明,在理想捡拾信号的情况下,无音色改变重放的上限频率和区域大小随Ambisonics的阶数而增加。而对于传声器阵列捡拾的情况,只要阵列的上限频率大于Ambisonics重放的上限频率,在重放的上限频率以下,传声器阵列空间混叠误差对最终重构声场及其感知音色的影响就可以忽略。在此基础上,提出了一种综合考虑捡拾与重放性能的Ambisonics系统优化设计方法。心理声学实验得到了和双耳听觉模型一致的结果,从而也验证了模型分析的有效性。   相似文献   

7.
钟小丽  谢菠荪 《应用声学》2012,31(6):410-415
虚拟听觉重放采用头相关传输函数(HRTF)合成双耳声信号,并用耳机重放,以产生所需的空间听觉事件。理想的虚拟听觉重放需要个性化HRTF。个性化HRTF可通过实验测量或数值计算相对地准确获得。然而,测量每个潜在使用者的高空间分辨率HRTF是困难的,而数值计算HRTF的频段往往受限于计算机性能。近年发展了多种HRTF的近似获取方法,并成为热门研究课题,但效果有待验证和提高。本文评述了个性化HRTF近似的研究进展,指出了存在的问题和今后的方向。  相似文献   

8.
Auditory functional magnetic resonance imaging (fMRI) requires quantification of sound stimuli in the magnetic environment and adequate isolation of background noise. We report the development of two novel sound measurement systems that accurately measure the sound intensity inside the ear, which can simultaneously provide the similar or greater amount of scanner- noise protection than ear-muffs. First, we placed a 2.6 x 2.6-mm microphone in an insert phone that was connected to a headphone [microphone-integrated, foam-tipped insert-phone with a headphone (MIHP)]. This attenuated scanner noise by 37.8+/-4.6 dB, a level better than the reference amount obtained using earmuffs. The nonmetallic optical microphone was integrated with a headphone [optical microphone in a headphone (OMHP)] and it effectively detected the change of sound intensity caused by variable compression on the cushions of the headphone. Wearing the OMHP reduced the noise by 28.5+/-5.9 dB and did not affect echoplanar magnetic resonance images. We also performed an auditory fMRI study using the MIHP system and presented increase in the auditory cortical activation following 10-dB increment in the intensity of sound stimulation. These two newly developed sound measurement systems successfully achieved the accurate quantification of sound stimuli with maintaining the similar level of noise protection of wearing earmuffs in the auditory fMRI experiment.  相似文献   

9.
Animals live in cluttered auditory environments, where sounds arrive at the two ears through several paths. Reflections make sound localization difficult, and it is thought that the auditory system deals with this issue by isolating the first wavefront and suppressing later signals. However, in many situations, reflections arrive too early to be suppressed, for example, reflections from the ground in small animals. This paper examines the implications of these early reflections on binaural cues to sound localization, using realistic models of reflecting surfaces and a spherical model of diffraction by the head. The fusion of direct and reflected signals at each ear results in interference patterns in binaural cues as a function of frequency. These cues are maximally modified at frequencies related to the delay between direct and reflected signals, and therefore to the spatial location of the sound source. Thus, natural binaural cues differ from anechoic cues. In particular, the range of interaural time differences is substantially larger than in anechoic environments. Reflections may potentially contribute binaural cues to distance and polar angle when the properties of the reflecting surface are known and stable, for example, for reflections on the ground.  相似文献   

10.
在两扬声器虚拟声重放中,通过精确重构双耳声压而产生不同的空间听觉感知。其重放的定位性能应该是由双耳声压控制的代价和稳定性所共同决定的。过去研究主要对双耳声压控制的稳定性进行分析,并以此作为扬声器布置和信号处理的依据。该文研究表明仅对双耳声压的稳定性分析是不足以完全衡量扬声器虚拟声重放的定位性能的。进一步采用虚拟声信号处理滤波器响应平均功率对双耳声压控制的代价进行分析。结果表明,缩窄左右对称扬声器布置的张角或采用非对称扬声器布置会明显增加产生侧向目标虚拟源时的双耳声压控制代价。虚拟源(虚拟声像)定位实验表明,双耳声压控制代价增加会引起虚拟源定位缺陷。实际应用中,为了有效产生侧向虚拟源,应避免采用过窄张角(如立体声偶极)和非对称的扬声器布置。  相似文献   

11.
Major criteria for a successful binaural reproduction are not only a suitable localization performance, but also the authenticity and plausibility of the presented scene. It is therefore interesting to examine whether the binaural reproduction can be perceptually distinguished from a real source. The aim of the presented investigation is to compare the quality of the binaural reproduction via headphones with two different microphone setups (miniature microphone in Open-Dome and ear plug) for individual head-related-transfer-function (HRTF) and headphone-transfer-function (HpTF) measurements. Listening tests with a total of 80 subjects were carried out focusing on plausibility and authenticity. In the examination of plausibility detection rates showed that subjects were not able to match the reproduced pink noise to its reproduction system (real source vs. binaural reproduction via headphones). The authenticity of the static binaural reproduction was highly dependent on the stimulus. Pink noise could often be distinguished due to coloration in higher frequencies and small differences in location. A difference between microphone setups could not be found in neither of the listening tests.  相似文献   

12.
The potential of spherical-harmonics beamforming (SHB) techniques for the auralization of target sound sources in a background noise was investigated and contrasted with traditional head-related transfer function (HRTF)-based binaural synthesis. A scaling of SHB was theoretically derived to estimate the free-field pressure at the center of a spherical microphone array and verified by comparing simulated frequency response functions with directly measured ones. The results show that there is good agreement in the frequency range of interest. A listening experiment was conducted to evaluate the auralization method subjectively. A set of ten environmental and product sounds were processed for headphone presentation in three different ways: (1) binaural synthesis using dummy head measurements, (2) the same with background noise, and (3) SHB of the noisy condition in combination with binaural synthesis. Two levels of background noise (62, 72 dB SPL) were used and two independent groups of subjects (N=14) evaluated either the loudness or annoyance of the processed sounds. The results indicate that SHB almost entirely restored the loudness (or annoyance) of the target sounds to unmasked levels, even when presented with background noise, and thus may be a useful tool to psychoacoustically analyze composite sources.  相似文献   

13.
频率对环绕声声像定位的影响   总被引:3,自引:1,他引:2       下载免费PDF全文
本文考虑双耳相位差的高级近似,导出了中频情况下适用的具有更普遍意义的平面环绕声声像定位公式。在低频时该式将化为通常的环绕声声像定位公式,而随着声音频率的增加,声像位置将与频率有关。将新的公式用到方型排列和棱型排列的4-4-4环绕声系统,得到了同实验相一致的结果。文中着重指出,声像随频率而变化是导致环绕声重发中侧向声像不稳定的重要在而为今后改进环绕声系统提供了理论基础。  相似文献   

14.
张承云  谢菠荪 《应用声学》2016,35(4):283-287
为改善5.1通路环绕声的双耳重放性能,提出一种基于低价头踪迹跟踪模块的动态双耳重放方法。头踪迹跟踪模块通过单片机采集磁传感器、加速度传感器的输出数据,计算出倾听者头部水平方向信息,并将其经USB接口传给计算机进行动态双耳声信号合成。心理声学实验表明,本文提出的方法可以消除虚拟声源前后混乱和头中定位现象,提升5.1通路环绕声双耳重放的虚拟声源定位性能。  相似文献   

15.
The auditory system takes advantage of early reflections (ERs) in a room by integrating them with the direct sound (DS) and thereby increasing the effective speech level. In the present paper the benefit from realistic ERs on speech intelligibility in diffuse speech-shaped noise was investigated for normal-hearing and hearing-impaired listeners. Monaural and binaural speech intelligibility tests were performed in a virtual auditory environment where the spectral characteristics of ERs from a simulated room could be preserved. The useful ER energy was derived from the speech intelligibility results and the efficiency of the ERs was determined as the ratio of the useful ER energy to the total ER energy. Even though ER energy contributed to speech intelligibility, DS energy was always more efficient, leading to better speech intelligibility for both groups of listeners. The efficiency loss for the ERs was mainly ascribed to their altered spectrum compared to the DS and to the filtering by the torso, head, and pinna. No binaural processing other than a binaural summation effect could be observed.  相似文献   

16.
Headphone rendering of nearby virtual sound sources represents to date an open issue in 3-D audio, due to a number of technical challenges and temporal requirements involved in the measurement of individual Head-Related Transfer Functions (HRTFs). In order to tackle this problem, we propose a filter model of near-field effects based on the Distance Variation Function (Kan et al., 2009). Thanks to its simple structure and low order, the model can be applied to any far-field virtual auditory display to yield a realistic and computationally efficient near-field compensation of spectral and binaural effects. The model is subjectively evaluated in two psychophysical experiments where the relative distance of pairs of virtually rendered sound sources is judged. Results show that even though sound intensity overshadows subtler near-field effects when it is available as a cue for distance, the model is capable of offering relative distance information of near lateral virtual sources when intensity cues are removed. Furthermore, performances of the model in relative distance rendering are compared to those of alternative near-field rendering methods available in the literature.  相似文献   

17.
考虑头部转动带来的动态因素对听觉垂直定位的贡献,提出了前方空间环绕声的四扬声器虚拟重放方法。4个扬声器分别布置在水平面左前、右前以及高仰角的左前上、右前上方向,并采用听觉传输信号处理的方法将多通路空间环绕声信号转换为4个扬声器的重放信号。以9.1通路空间环绕声虚拟重放为例,采用头相关传输函数对双耳声压及其包含的定位因素进行分析表明,该方法可以产生正确的双耳时间差及其随头部转动的变化,从而产生合适的侧向定位双耳因素和垂直定位的动态因素。而心理声学实验结果表明,该方法可以重放稳定的前方空间的水平和垂直虚拟源。因此,四扬声器布置结合听觉传输处理足以重放前方空间环绕声的垂直定位信息,实现多通路空间环绕声的向下混合与简化。   相似文献   

18.
The auditory system encodes the timing of peaks in basilar-membrane motion with exquisite precision, and perceptual models of binaural processing indicate that the limit of temporal resolution in humans is as little as 10-20 microseconds. In these binaural studies, pairs of continuous sounds with microsecond differences are presented simultaneously, one sound to each ear. In this paper, a monaural masking experiment is described in which pairs of continuous sounds with microsecond time differences were combined and presented to both ears. The stimuli were matched in terms of the excitation patterns they produced, and a perceptual model of monaural processing indicates that the limit of temporal resolution in this case is similar to that in the binaural system.  相似文献   

19.
A commonly accepted physiological model for lateralization of low-frequency sounds by interaural time delay (ITD) stipulates that binaural comparison neurons receive input from frequency-matched channels from each ear. Here, the effects of hypothetical interaural frequency mismatches on this model are reported. For this study, the cat's auditory system peripheral to the binaural comparison neurons was represented by a neurophysiologically derived model, and binaural comparison neurons were represented by cross-correlators. The results of the study indicate that, for binaural comparison neurons receiving input from one cochlear channel from each ear, interaural CF mismatches may serve to either augment or diminish the effective difference in ipsilateral and contralateral axonal time delays from the periphery to the binaural comparison neuron. The magnitude of this increase or decrease in the effective time delay difference can be up to 400 microseconds for CF mismatches of 0.2 octaves or less for binaural neurons with CFs between 250 Hz and 2.5 kHz. For binaural comparison neurons with nominal CFs near 500 Hz, the 25-microsecond effective time delay difference caused by a 0.012-octave CF mismatch is equal to the ITD previously shown to be behaviorally sufficient for the cat to lateralize a low-frequency sound source.  相似文献   

20.
By analyzing the differences between binaural recording and real listening, it was deduced that there were some unrevealed auditory localization clues, and the sound pressure distribution pattern at the entrance of ear canal was probably a clue. It was proved through the listening test that the unrevealed auditory localization clues really exist with the reduction to absurdity. And the effective frequency bands of the unrevealed localization clues were induced and summed. The result of finite element based simulations showed that the pressure distribution at the entrance of ear canal was non-uniform, and the pattern was related to the direction of sound source. And it was proved that the sound pressure distribution pattern at the entrance of the ear canal carried the sound source direction information and could be used as an unrevealed localization clue. The frequency bands in which the sound pressure distribution patterns had significant differences between front and back sound source directions were roughly matched with the effective frequency bands of unrevealed localization clues obtained from the listening tests. To some extent, it supports the hypothesis that the sound pressure distribution pattern could be a kind of unrevealed auditory localization clues.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号