首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到10条相似文献,搜索用时 171 毫秒
1.
The reliability of algorithms for room acoustic simulations has often been confirmed on the basis of the verification of predicted room acoustical parameters. This paper presents a complementary perceptual validation procedure consisting of two experiments, respectively dealing with speech intelligibility, and with sound source front–back localisation.The evaluated simulation algorithm, implemented in software ODEON®, is a hybrid method that is based on an image source algorithm for the prediction of early sound reflection and on ray-tracing for the later part, using a stochastic scattering process with secondary sources. The binaural room impulse response (BRIR) is calculated from a simulated room impulse response where information about the arriving time, intensity and spatial direction of each sound reflection is collected and convolved with a measured Head Related Transfer Function (HRTF). The listening stimuli for the speech intelligibility and localisation tests are auralised convolutions of anechoic sound samples with measured and simulated BRIRs.Perception tests were performed with human subjects in two acoustical environments, i.e. an anechoic and reverberant room, by presenting the stimuli to subjects in a natural way, and via headphones by using two non-individualized HRTFs (artificial head and hearing aids placed on the ears of the artificial head) of both a simulated and a real room.Very good correspondence is found between the results obtained with simulated and measured BRIRs, both for speech intelligibility in the presence of noise and for sound source localisation tests. In the anechoic room an increase in speech intelligibility is observed when noise and signal are presented from sources located at different angles. This improvement is not so evident in the reverberant room, with the sound sources at 1-m distance from the listener. Interestingly, the performance of people for front–back localisation is better in the reverberant room than in the anechoic room.The correlation between people’s ability for sound source localisation on one hand, and their ability for recognition of binaurally received speech in reverberation on the other hand, is found to be weak.  相似文献   

2.
This paper presents the results of new studies based on speech intelligibility tests in simulated sound fields and analyses of impulse response measurements in rooms used for speech communication. The speech intelligibility test results confirm the importance of early reflections for achieving good conditions for speech in rooms. The addition of early reflections increased the effective signal-to-noise ratio and related speech intelligibility scores for both impaired and nonimpaired listeners. The new results also show that for common conditions where the direct sound is reduced, it is only possible to understand speech because of the presence of early reflections. Analyses of measured impulse responses in rooms intended for speech show that early reflections can increase the effective signal-to-noise ratio by up to 9 dB. A room acoustics computer model is used to demonstrate that the relative importance of early reflections can be influenced by the room acoustics design.  相似文献   

3.
The speech intelligibility in classroom can be influenced by background-noise levels, speech sound pressure level (SSPL), reverberation time and signal-to-noise ratio (SNR). The relationship between SSPL and subjective Chinese Mandarin speech intelligibility and the effect of different SNRs on Chinese Mandarin speech intelligibility in the simulated classroom were investigated through room acoustical simulation, auralisation technique and subjective evaluation. Chinese speech intelligibility test signals recorded in anechoic chamber were convolved with the simulated binaural room impulse responses, and then reproduced through the headphone by different SSPLs and SNRs. The results show that Chinese Mandarin speech intelligibility scores increase with increasing of SSPLs and SNRs within a certain range in simulated classrooms. Chinese Mandarin speech intelligibility scores have no significant difference with SNRs of no less than 15 dBA under the same reverberation time condition.  相似文献   

4.
A number of objective evaluation methods are currently used to quantify the speech intelligibility in a built environment, including the speech transmission index (STI), rapid speech transmission index (RASTI), articulation index (AI), and the percent articulation loss of consonants (%ALCons). Certain software programs can quickly evaluate STI, RASTI, and %ALCons from a measured room impulse response. In this project, two impulse-response-based software packages (WinMLS and SIA-Smaart Acoustic Tools) were evaluated for their ability to determine intelligibility accurately. In four different spaces with background noise levels less than NC 45, speech intelligibility was measured via three methods: (1) with WinMLS 2000; (2) with SIA-Smaart Acoustic Tools (v4.0.2); and (3) from listening tests with humans. The study found that WinMLS measurements of speech intelligibility based on STI, RASTI, and %ALCons corresponded well with performance on the listening tests. SIA-Smaart results were correlated to human responses, but tended to under-predict intelligibility based on STI and RASTI, and over-predict intelligibility based on %ALCons.  相似文献   

5.
The binaural auralization of a 3D sound field using spherical-harmonics beamforming (SHB) techniques was investigated and compared with the traditional method using a head-and-torso simulator (HATS). The new procedure was verified by comparing simulated room impulse responses with measured ones binaurally. The objective comparisons show that there is good agreement in the frequency range between 0.1 and 6.4 kHz. A listening experiment was performed to validate the SHB method subjectively and to compare it to the HATS method. Two musical excerpts, pop and classical, were used. Subjective responses were collected in two head rotation conditions (fixed and rotating) and six spatial reproduction modes, including phantom mono, stereo, and surround sound. The results show that subjective scales of width, spaciousness, and preference based on the SHB method were similar to those obtained for the HATS method, although the width and spaciousness of the stimuli processed by the SHB method were judged slightly higher than the ones using the HATS method in general. Thus, binaural synthesis using SHB may be a useful tool to reproduce a 3D sound field binaurally, while saving considerably on measurement time because head rotation can be simulated based on a single recording.  相似文献   

6.
When speech is in competition with interfering sources in rooms, monaural indicators of intelligibility fail to take account of the listener's abilities to separate target speech from interfering sounds using the binaural system. In order to incorporate these segregation abilities and their susceptibility to reverberation, Lavandier and Culling [J. Acoust. Soc. Am. 127, 387-399 (2010)] proposed a model which combines effects of better-ear listening and binaural unmasking. A computationally efficient version of this model is evaluated here under more realistic conditions that include head shadow, multiple stationary noise sources, and real-room acoustics. Three experiments are presented in which speech reception thresholds were measured in the presence of one to three interferers using real-room listening over headphones, simulated by convolving anechoic stimuli with binaural room impulse-responses measured with dummy-head transducers in five rooms. Without fitting any parameter of the model, there was close correspondence between measured and predicted differences in threshold across all tested conditions. The model's components of better-ear listening and binaural unmasking were validated both in isolation and in combination. The computational efficiency of this prediction method allows the generation of complex "intelligibility maps" from room designs.  相似文献   

7.
J.H. Wang  C.S. Pai 《Applied Acoustics》2003,64(12):1141-1158
The binaural room impulse responses (BRIRs) can be applied to 3-D sound field reconstruction, virtual reality, noise control, et al. Because the BRIRs are non-minimum phase functions, it is difficult to find the exact inverse functions of the BRIRs, especially when there are two or more sources in a reverberant space. In this work, a method was proposed to find the inverse functions of BRIRs with two sound sources in a reverberant space. The concept of time delays and the method of weighted least squares were used to find the causal, however, approximate inverse functions. The accuracy of the inverse functions was first evaluated objectively by a dummy head system. The result shows that the distortion due to crosstalk and room reverberation can be improved by 16∼18 dB. The inverse functions were also verified subjectively by 20 students. The result of subjective evaluation also shows that the inverse functions can be used successfully to reduce the crosstalk effect and the room reverberation.  相似文献   

8.
Speech reception thresholds were measured to investigate the influence of a room on speech segregation between a spatially separated target and interferer. The listening tests were realized under headphones. A room simulation allowed selected positioning of the interferer and target, as well as varying the absorption coefficient of the room internal surfaces. The measurements involved target sentences and speech-shaped noise or 2-voice interferers. Four experiments revealed that speech segregation in rooms was not only dependent on the azimuth separation of sound sources, but also on their direct-to-reverberant energy ratio at the listening position. This parameter was varied for interferer and target independently. Speech intelligibility decreased as the direct-to-reverberant ratio of sources was degraded by sound reflections in the room. The influence of the direct-to-reverberant ratio of the interferer was in agreement with binaural unmasking theories, through its effect on interaural coherence. The effect on the target occurred at higher levels of reverberation and was explained by the intrinsic degradation of speech intelligibility in reverberation.  相似文献   

9.
Inaccuracies in computation and auralization of room impulse responses are related in part to inadequate modeling of edge diffraction, i.e., the scattering from edges of finite surfaces. A validated time-domain model (based on analytical extensions to the Biot-Tolstoy-Medwin technique) is thus employed here to compute early room impulse responses with edge diffraction. Furthermore, the computations are extended to include combinations of specular and diffracted paths in the example problem of a stage-house. These combinations constitute a significant component of the total nonspecular scattering and also help to identify edge diffraction in measured impulse responses. The computed impulse responses are then convolved with anechoic signals with a variety of time-frequency characteristics. Initial listening tests with varying orders and combinations of diffraction suggest that (1) depending on the input signal, the diffraction contributions can be clearly audible even in nonshadow zones for this conservative open geometry and (2) second-order diffraction to nonshadowed receivers can often be neglected. Finally, a practical implementation for binaural simulation is proposed, based on the singular behavior of edge diffraction along the least-time path for a given source-edge-receiver orientation. This study thus provides a first major step toward computing edge diffraction for more accurate room acoustics auralization.  相似文献   

10.
The previous work [Morimoto et al., J. Acoust. Soc. Am. 116, 1607-1613] showed that listening difficulty ratings can be used to evaluate speech transmission performance more exactly and sensitively than intelligibility. Meanwhile, speech transmission performance is usually evaluated using acoustical objective measures, which are directly associated with physical parameters of room acoustic design. However, the relationship between listening difficulty ratings and acoustical objective measures was not minutely investigated. In the present study, a total of 96 impulse responses were used to investigate the relationship between listening difficulty ratings and several objective measures in unidirectional sound fields. The result of the listening test showed that (1) the correlation between listening difficulty ratings and speech transmission index (STI) is the strongest of all tested objective measures, and (2) A-weighted D(50), C(50), and center time, which are obtained from the impulse responses passed through an A-weighted filter, also strongly correlate with listening difficulty ratings, and their correlations with listening difficulty ratings are not statistically different from the correlation between listening difficulty ratings and STI.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号