首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 912 毫秒
1.
2.
A number of objective evaluation methods are currently used to quantify the speech intelligibility in a built environment, including the speech transmission index (STI), rapid speech transmission index (RASTI), articulation index (AI), and the percent articulation loss of consonants (%ALCons). Certain software programs can quickly evaluate STI, RASTI, and %ALCons from a measured room impulse response. In this project, two impulse-response-based software packages (WinMLS and SIA-Smaart Acoustic Tools) were evaluated for their ability to determine intelligibility accurately. In four different spaces with background noise levels less than NC 45, speech intelligibility was measured via three methods: (1) with WinMLS 2000; (2) with SIA-Smaart Acoustic Tools (v4.0.2); and (3) from listening tests with humans. The study found that WinMLS measurements of speech intelligibility based on STI, RASTI, and %ALCons corresponded well with performance on the listening tests. SIA-Smaart results were correlated to human responses, but tended to under-predict intelligibility based on STI and RASTI, and over-predict intelligibility based on %ALCons.  相似文献   

3.
Although the speech transmission index (STI) is a well-accepted and standardized method for objective prediction of speech intelligibility in a wide range of environments and applications, it is essentially a monaural model. Advantages of binaural hearing in speech intelligibility are disregarded. In specific conditions, this leads to considerable mismatches between subjective intelligibility and the STI. A binaural version of the STI was developed based on interaural cross correlograms, which shows a considerably improved correspondence with subjective intelligibility in dichotic listening conditions. The new binaural STI is designed to be a relatively simple model, which adds only few parameters to the original standardized STI and changes none of the existing model parameters. For monaural conditions, the outcome is identical to the standardized STI. The new model was validated on a set of 39 dichotic listening conditions, featuring anechoic, classroom, listening room, and strongly echoic environments. For these 39 conditions, speech intelligibility [consonant-vowel-consonant (CVC) word score] and binaural STI were measured. On the basis of these conditions, the relation between binaural STI and CVC word scores closely matches the STI reference curve (standardized relation between STI and CVC word score) for monaural listening. A better-ear STI appears to perform quite well in relation to the binaural STI model; the monaural STI performs poorly in these cases.  相似文献   

4.
During the acoustical design of, e.g., auditoria or open-plan offices, it is important to know how speech can be perceived in various parts of the room. Different objective methods have been developed to measure and predict speech intelligibility, and these have been extensively used in various spaces. In this study, two such methods were compared, the speech transmission index (STI) and the speech intelligibility index (SII). Also the simplification of the STI, the room acoustics speech transmission index (RASTI), was considered. These quantities are all based on determining an apparent speech-to-noise ratio on selected frequency bands and summing them using a specific weighting. For comparison, some data were needed on the possible differences of these methods resulting from the calculation scheme and also measuring equipment. Their prediction accuracy was also of interest. Measurements were made in a laboratory having adjustable noise level and absorption, and in a real auditorium. It was found that the measurement equipment, especially the selection of the loudspeaker, can greatly affect the accuracy of the results. The prediction accuracy of the RASTI was found acceptable, if the input values for the prediction are accurately known, even though the studied space was not ideally diffuse.  相似文献   

5.
祝培生  朱国风  朱彤 《应用声学》2014,33(5):419-425
近年来音质仿真技术的快速发展为语言传输指数STI的预测提供了一个潜在的解决方案。但这种方法的有效性如何,则是在使用该技术之前应该考虑的问题。本文对3个房间内音质仿真与实际测量STI进行实验对比,研究表明:在仿真模型与实际空间的声学等效较为准确的情况下,使用音质仿真软件ODEON计算得到的STI误差较小;混响时间的变化在背景噪声较高时可能会对仿真STI的准确性带来显著影响,随着混响时间的增加,仿真与实际测量STI的差值可能变大;信噪比的变化并不会给仿真STI的准确性带来显著影响;仿真脉冲响应与实际测量脉冲响应的频谱有一定差别,时域上的反射声序列也不相同,但这些差别对仿真STI的影响并不大;仿真过程中比较容易产生的信噪比误差对仿真STI产生了较大影响。由于影响音质仿真结果的因素较多,仿真模型与实际空间的声学等效也比较复杂,尤其是对于没有实际参照校准的房间来说,想要获得准确的STI预测结果是比较困难的。  相似文献   

6.
By systematically varying the amount of sound absorption, and the location of the sound-absorbing material in a simulated classroom, it was possible to assess the accuracy of the prediction of speech metrics in quite simple acoustical environments. Predictions of speech level, early-to-late sound ratios (C50) and speech transmission index (STI) values were obtained analytically and with two hybrid ray-based computer programs, RAYNOISE 3.0 and ODEON 4.1. The RAYNOISE predictions were accomplished with a purely specular reflection model and also with a calibrated diffuse reflection model. ODEON uses a parameter called transition order, TO, to change the reflection procedure from purely specular to diffuse for reflections that have orders higher than TO. A parametric study was conducted to determine the best transition order for the ODEON prediction of speech metrics. It was found that the analytical predictions of speech level and C50 were on average accurate to about 1 just-noticeable difference (jnd), whereas the analytical predictions of STI were on average within 2 jnd's. ODEON predictions of speech level, C50 and STI were on average within 2 jnd's. RAYNOISE predictions of C50 and STI with the specular model were on average within 2 jnd's. However, the RAYNOISE predictions of speech level, with both types of reflection models, and the RAYNOISE predictions of C50 and STI with the diffuse model had average errors greater than 2 jnd's. The effects of the sound-absorption treatments on the measured speech metric values are also discussed.  相似文献   

7.
Unattended background speech is a known source of cognitive and subjective distraction in open-plan offices. This study investigated whether the deleterious effects of background speech can be affected by room acoustic design that decreases speech intelligibility, as measured by the Speech Transmission Index (STI). The experiment was conducted in an open-plan office laboratory (84 m2) in which four acoustic conditions were physically built. Three conditions contained background speech. A quiet condition was included for comparison. The speech conditions differed in terms of the degree of absorption, screen height, desk isolation, and the level of masking sound. The speech sounds simulated an environment where phone conversations are heard from different locations varying in distance. Ninety-eight volunteers were tested. The presence of background speech had detrimental effects on the subjective perceptions of noise effects and on cognitive performance in short-term memory and working memory tasks. These effects were not attenuated nor amplified within a three-hour working period. The reduction of the STI by room acoustic means decreased subjective disturbance, whereas the effects on cognitive performance were somewhat smaller than expected. The effects of room acoustic design on subjective distraction were stronger among noise-sensitive subjects, suggesting that they benefited more from acoustic improvements than non-sensitive subjects. The results imply that reducing the STI is beneficial for performance and acoustic satisfaction especially regarding speech coming from more distant desks. However, acoustic design does not sufficiently decrease the distraction caused by speech from adjacent desks.  相似文献   

8.
Ambient sound can impair verbal short-term memory performance. This finding is relevant to the acoustic optimization of open-plan offices. Two algorithmic approaches claim to model the impairment during a given sound condition. One model is based on the Speech Transmission Index (STI). The other approach relies on the hearing sensation fluctuation strength (F). Within the scope of our consulting activities the approach based on F can hardly be applied and the model based on the STI is often misinterpreted in terms of semanticity. Therefore we put to test the two models and elucidate the relevance of temporal–spectral variability and semanticity of background sound with regard to impairment of performance. A group of 24 subjects performed a short-term memory task and rated perceived annoyance during eight different speech and speech-like noise conditions, which varied with regard to STI and F. The empirical data is compared to the model predictions, which only partly cover the experimental results. Speech impairs performance more than all other sound conditions and variable speech-like noise is more impairing than continuous speech-like noise. Sound masking with continuous speech-like noise provides relief from the negative effect of background speech. This positive effect is more pronounced if the signal to noise ratio is −3 dB(A) or even lower.  相似文献   

9.
A method for computing the speech transmission index (STI) using real speech stimuli is presented and evaluated. The method reduces the effects of some of the artifacts that can be encountered when speech waveforms are used as probe stimuli. Speech-based STIs are computed for conversational and clearly articulated speech in several noisy, reverberant, and noisy-reverberant environments and compared with speech intelligibility scores. The results indicate that, for each speaking style, the speech-based STI values are monotonically related to intelligibility scores for the degraded speech conditions tested. Therefore, the STI can be computed using speech probe waveforms and the values of the resulting indices are as good predictors of intelligibility scores as those derived from MTFs by theoretical methods.  相似文献   

10.
The previous work [Morimoto et al., J. Acoust. Soc. Am. 116, 1607-1613] showed that listening difficulty ratings can be used to evaluate speech transmission performance more exactly and sensitively than intelligibility. Meanwhile, speech transmission performance is usually evaluated using acoustical objective measures, which are directly associated with physical parameters of room acoustic design. However, the relationship between listening difficulty ratings and acoustical objective measures was not minutely investigated. In the present study, a total of 96 impulse responses were used to investigate the relationship between listening difficulty ratings and several objective measures in unidirectional sound fields. The result of the listening test showed that (1) the correlation between listening difficulty ratings and speech transmission index (STI) is the strongest of all tested objective measures, and (2) A-weighted D(50), C(50), and center time, which are obtained from the impulse responses passed through an A-weighted filter, also strongly correlate with listening difficulty ratings, and their correlations with listening difficulty ratings are not statistically different from the correlation between listening difficulty ratings and STI.  相似文献   

11.
This paper examines the accuracy of the speech transmission index (STI) calculated from the reverberation time (T) and signal-to-noise ratio (LSN) of enclosed spaces. Differences between measured and predicted STIs have been analysed in two rooms (reverberant vs. absorbent), for a wide range of absorption conditions and signal-to-noise ratios (sixteen tests). The STI was measured using maximum length sequence analysis and predictions were calculated using either measured or predicted values of T and LSN, the latter assuming diffuse sound field conditions. The results obtained for all the conditions tested showed that STI predictions based on T and LSN tend to underestimate the STI, with differences between measured and predicted STIs always lower than 0.1 (on a 0.0–1.0 scale), and on average lower than 0.06. According to previous research, these differences are noticeable and therefore non-negligible, as 0.03 is the just noticeable difference in STI. The use of either measured or predicted values of T and LSN provided similar STI predictions (i.e. non-noticeable changes), with differences between predictions that are on average lower than 0.03 for the absorbent room, and lower than 0.01 for the reverberant room.  相似文献   

12.
The Speech Transmission Index (STI) is a physical metric that is well correlated with the intelligibility of speech degraded by additive noise and reverberation. The traditional STI uses modulated noise as a probe signal and is valid for assessing degradations that result from linear operations on the speech signal. Researchers have attempted to extend the STI to predict the intelligibility of nonlinearly processed speech by proposing variations that use speech as a probe signal. This work considers four previously proposed speech-based STI methods and four novel methods, studied under conditions of additive noise, reverberation, and two nonlinear operations (envelope thresholding and spectral subtraction). Analyzing intermediate metrics in the STI calculation reveals why some methods fail for nonlinear operations. Results indicate that none of the previously proposed methods is adequate for all of the conditions considered, while four proposed methods produce qualitatively reasonable results and warrant further study. The discussion considers the relevance of this work to predicting the intelligibility of cochlear-implant processed speech.  相似文献   

13.
The reliability of algorithms for room acoustic simulations has often been confirmed on the basis of the verification of predicted room acoustical parameters. This paper presents a complementary perceptual validation procedure consisting of two experiments, respectively dealing with speech intelligibility, and with sound source front–back localisation.The evaluated simulation algorithm, implemented in software ODEON®, is a hybrid method that is based on an image source algorithm for the prediction of early sound reflection and on ray-tracing for the later part, using a stochastic scattering process with secondary sources. The binaural room impulse response (BRIR) is calculated from a simulated room impulse response where information about the arriving time, intensity and spatial direction of each sound reflection is collected and convolved with a measured Head Related Transfer Function (HRTF). The listening stimuli for the speech intelligibility and localisation tests are auralised convolutions of anechoic sound samples with measured and simulated BRIRs.Perception tests were performed with human subjects in two acoustical environments, i.e. an anechoic and reverberant room, by presenting the stimuli to subjects in a natural way, and via headphones by using two non-individualized HRTFs (artificial head and hearing aids placed on the ears of the artificial head) of both a simulated and a real room.Very good correspondence is found between the results obtained with simulated and measured BRIRs, both for speech intelligibility in the presence of noise and for sound source localisation tests. In the anechoic room an increase in speech intelligibility is observed when noise and signal are presented from sources located at different angles. This improvement is not so evident in the reverberant room, with the sound sources at 1-m distance from the listener. Interestingly, the performance of people for front–back localisation is better in the reverberant room than in the anechoic room.The correlation between people’s ability for sound source localisation on one hand, and their ability for recognition of binaurally received speech in reverberation on the other hand, is found to be weak.  相似文献   

14.
While the Speech Transmission Index (STI) is widely applied for prediction of speech intelligibility in room acoustics and telecommunication engineering, it is unclear how to interpret STI values when non-native talkers or listeners are involved. Based on subjectively measured psychometric functions for sentence intelligibility in noise, for populations of native and non-native communicators, a correction function for the interpretation of the STI is derived. This function is applied to determine the appropriate STI ranges with qualification labels ("bad"-"excellent"), for specific populations of non-natives. The correction function is derived by relating the non-native psychometric function to the native psychometric function by a single parameter (nu). For listeners, the nu parameter is found to be highly correlated with linguistic entropy. It is shown that the proposed correction function is also valid for conditions featuring bandwidth limiting and reverberation.  相似文献   

15.
The study of mosque acoustics, with regard to acoustical characteristics, sound quality for speech intelligibility, and other applicable acoustic criteria, has been largely neglected. In this study a background as to why mosques are designed as they are and how mosque design is influenced by worship considerations is given. In the study the acoustical characteristics of typically constructed contemporary mosques in Saudi Arabia have been investigated, employing a well-known impulse response. Extensive field measurements were taken in 21 representative mosques of different sizes and architectural features in order to characterize their acoustical quality and to identify the impact of air conditioning, ceiling fans, and sound reinforcement systems on their acoustics. Objective room-acoustic indicators such as reverberation time (RT) and clarity (C50) were measured. Background noise (BN) was assessed with and without the operation of air conditioning and fans. The speech transmission index (STI) was also evaluated with and without the operation of existing sound reinforcement systems. The existence of acoustical deficiencies was confirmed and quantified. The study, in addition to describing mosque acoustics, compares design goals to results obtained in practice and suggests acoustical target values for mosque design. The results show that acoustical quality in the investigated mosques deviates from optimum conditions when unoccupied, but is much better in the occupied condition.  相似文献   

16.
The present study is concerned with the convolutive Blind Source Separation (BSS) of sound sources that leads to a significant speech intelligibility enhancement. Two experiments were conducted. In the first experiment two different algorithms of convolutive BSS were compared. Both methods are based on second order statistics since such approach is simple and gives satisfactory performance. The data resulted from this experiment suggested that with different approaches, different speech intelligibility improvement could be obtained. In the second experiment the influence of the spatial configuration of the cardioid microphones on the BSS performance was measured. It was revealed that the best separation for a considered spatial configuration can be obtained when microphones are directed alternately.  相似文献   

17.
The mathematical formulation of the continuum approach to radiative transfer modeling in two-phase semi-transparent media is numerically validated by comparing radiative fluxes computed by (i) direct, discrete-scale and (ii) continuum-scale approaches. The analysis is based on geometrical optics. The discrete-scale approach uses the Monte Carlo ray-tracing applied directly to real 3D geometry measured by computed tomography. The continuum-scale approach is based on a set of continuum-scale radiative transfer equations and associated radiative properties, and employs the Monte Carlo ray-tracing for computations of radiative fluxes and for computations of the radiative properties. The model two-phase media are reticulate porous ceramics and a particle packed bed, each composed of semitransparent solid and fluid phases. The results obtained by the two approaches are in good agreement within the limits of statistical uncertainty. The continuum-scale approach leads to a reduction in computational time by approximately one order of magnitude, and is therefore suited to treat radiative transfer problems in two-phase media in a wide range of engineering applications.  相似文献   

18.
Speech transmission index (STI) is an objective measure of the acoustic properties of office environments and is used to specify norms for acceptable acoustic work conditions. Yet, the tasks used to evaluate the effects of varying STIs on work performance have often been focusing on memory (as memory of visually presented words) and reading tasks and may not give a complete view of the severity even of low STI values (i.e., when speech intelligibility is low). Against this background, we used a more typical office-work task in the present study. The participants were asked to write short essays (5 min per essay) in 5 different STI conditions (0.08; 0.23; 0.34; 0.50; and 0.71). Writing fluency dropped drastically and the number of pauses longer than 5 s increased at STI values above 0.23. This study shows that realistic work-related performance drops even at low STI values and has implications for how to evaluate acoustic conditions in school and office environments.  相似文献   

19.
20.
The purpose of the present study was to compare the speech performance of four types of alaryngeal phonation-electrolaryngeal (EL), pneumatic artificial laryngeal (PA), tracheoesophageal (TE), and standard esophageal (SE) speech-by adult Cantonese-speaking laryngectomees. Subjective ratings of (1) voice quality, (2) articulation proficiency, (3) quietness of speech, (4) pitch variability, and (5) overall speech intelligibility were given by eight naive individuals who had no prior experience with any form of alaryngeal speech. Results indicated that SE and TE speech was perceived to be more hoarse than PA and EL speech. EL speech was associated with significantly less pitch variability, and PA speakers produced speech with the least amount of perceived noise. However, articulation proficiency and overall speech intelligibility were found to be comparable in all four types of alaryngeal speakers.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号