首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 841 毫秒
1.
A number of objective evaluation methods are currently used to quantify the speech intelligibility in a built environment, including the speech transmission index (STI), rapid speech transmission index (RASTI), articulation index (AI), and the percent articulation loss of consonants (%ALCons). Certain software programs can quickly evaluate STI, RASTI, and %ALCons from a measured room impulse response. In this project, two impulse-response-based software packages (WinMLS and SIA-Smaart Acoustic Tools) were evaluated for their ability to determine intelligibility accurately. In four different spaces with background noise levels less than NC 45, speech intelligibility was measured via three methods: (1) with WinMLS 2000; (2) with SIA-Smaart Acoustic Tools (v4.0.2); and (3) from listening tests with humans. The study found that WinMLS measurements of speech intelligibility based on STI, RASTI, and %ALCons corresponded well with performance on the listening tests. SIA-Smaart results were correlated to human responses, but tended to under-predict intelligibility based on STI and RASTI, and over-predict intelligibility based on %ALCons.  相似文献   

2.
Although the speech transmission index (STI) is a well-accepted and standardized method for objective prediction of speech intelligibility in a wide range of environments and applications, it is essentially a monaural model. Advantages of binaural hearing in speech intelligibility are disregarded. In specific conditions, this leads to considerable mismatches between subjective intelligibility and the STI. A binaural version of the STI was developed based on interaural cross correlograms, which shows a considerably improved correspondence with subjective intelligibility in dichotic listening conditions. The new binaural STI is designed to be a relatively simple model, which adds only few parameters to the original standardized STI and changes none of the existing model parameters. For monaural conditions, the outcome is identical to the standardized STI. The new model was validated on a set of 39 dichotic listening conditions, featuring anechoic, classroom, listening room, and strongly echoic environments. For these 39 conditions, speech intelligibility [consonant-vowel-consonant (CVC) word score] and binaural STI were measured. On the basis of these conditions, the relation between binaural STI and CVC word scores closely matches the STI reference curve (standardized relation between STI and CVC word score) for monaural listening. A better-ear STI appears to perform quite well in relation to the binaural STI model; the monaural STI performs poorly in these cases.  相似文献   

3.
A method for computing the speech transmission index (STI) using real speech stimuli is presented and evaluated. The method reduces the effects of some of the artifacts that can be encountered when speech waveforms are used as probe stimuli. Speech-based STIs are computed for conversational and clearly articulated speech in several noisy, reverberant, and noisy-reverberant environments and compared with speech intelligibility scores. The results indicate that, for each speaking style, the speech-based STI values are monotonically related to intelligibility scores for the degraded speech conditions tested. Therefore, the STI can be computed using speech probe waveforms and the values of the resulting indices are as good predictors of intelligibility scores as those derived from MTFs by theoretical methods.  相似文献   

4.
The paper presents the function of STI in the domain of reverberation time. Through the application of the said function, we can quickly estimate the speech transmission index, knowing only the time of room reverberation. For that purpose we applied a known method which consists in physical estimation of speech intelligibility basing on the modulation transfer function (MTF) determined in a room. Then, the STI was described using a logarithmic function whereof argument was the room reverberation time. To verify the model, reverberation times of six rooms were measured. The selected rooms were very different deliberately. They had different cubature and shape. The selection included a small cuboid, lecture halls and a church. Then, the same rooms were modeled in the ODEON version 11.23 and their reverberation times were determined. Furthermore, the STI was determined in the ODEON and then compared with the reverberation time obtained in effect of fast estimation. The statistical verification with the use of correlation index and regression equation has demonstrated that the fast estimation yields results close to those obtained in the computer simulation in ODEON. We obtained the correlation index at the level close to 1. Furthermore, the test probability at the level lower than 0.05 bespeaks of a statistically significant linear relation for the confidence level of 95%.  相似文献   

5.
The characteristics of sound propagation and speech transmission along a tunnel with a "T" intersection were investigated. At receivers within sight of the sound source, low frequencies were mainly attenuated around the intersection than high frequencies. At receivers out of sight of the source, high frequencies were extensively attenuated. The overall pattern of sound attenuation along the different sections of tunnel, which was calculated by the conical beam method, agreed well with the measurements in this study. Numerical calculations of reflected and diffracted waves with minimum transmission paths in a two-dimensional plane showed that reflected waves were the primary contributors to sound fields out of sight of the source. The articulation scores measured at receivers within sight of the source were high, and most of the confusion concerned syllables that could easily be misheard, even if there were a high signal-to-noise ratio. The types of syllable confusions observed at the receivers out of sight of the source appeared to have been caused by the greater deterioration in speech signals along this part of the tunnel, especially at high frequencies. The evaluation by rapid speech transmission indices (RASTI) appeared to be overestimated at the receivers out of sight of the source. Taking into account the early decay times of impulsive sound and the calculation procedures used in RASTI, it is concluded that speech intelligibility may not have been evaluated correctly by RASTI.  相似文献   

6.
Speech transmission index (STI) is an objective measure of the acoustic properties of office environments and is used to specify norms for acceptable acoustic work conditions. Yet, the tasks used to evaluate the effects of varying STIs on work performance have often been focusing on memory (as memory of visually presented words) and reading tasks and may not give a complete view of the severity even of low STI values (i.e., when speech intelligibility is low). Against this background, we used a more typical office-work task in the present study. The participants were asked to write short essays (5 min per essay) in 5 different STI conditions (0.08; 0.23; 0.34; 0.50; and 0.71). Writing fluency dropped drastically and the number of pauses longer than 5 s increased at STI values above 0.23. This study shows that realistic work-related performance drops even at low STI values and has implications for how to evaluate acoustic conditions in school and office environments.  相似文献   

7.
祝培生  朱国风  朱彤 《应用声学》2014,33(5):419-425
近年来音质仿真技术的快速发展为语言传输指数STI的预测提供了一个潜在的解决方案。但这种方法的有效性如何,则是在使用该技术之前应该考虑的问题。本文对3个房间内音质仿真与实际测量STI进行实验对比,研究表明:在仿真模型与实际空间的声学等效较为准确的情况下,使用音质仿真软件ODEON计算得到的STI误差较小;混响时间的变化在背景噪声较高时可能会对仿真STI的准确性带来显著影响,随着混响时间的增加,仿真与实际测量STI的差值可能变大;信噪比的变化并不会给仿真STI的准确性带来显著影响;仿真脉冲响应与实际测量脉冲响应的频谱有一定差别,时域上的反射声序列也不相同,但这些差别对仿真STI的影响并不大;仿真过程中比较容易产生的信噪比误差对仿真STI产生了较大影响。由于影响音质仿真结果的因素较多,仿真模型与实际空间的声学等效也比较复杂,尤其是对于没有实际参照校准的房间来说,想要获得准确的STI预测结果是比较困难的。  相似文献   

8.
This paper examines the accuracy of the speech transmission index (STI) calculated from the reverberation time (T) and signal-to-noise ratio (LSN) of enclosed spaces. Differences between measured and predicted STIs have been analysed in two rooms (reverberant vs. absorbent), for a wide range of absorption conditions and signal-to-noise ratios (sixteen tests). The STI was measured using maximum length sequence analysis and predictions were calculated using either measured or predicted values of T and LSN, the latter assuming diffuse sound field conditions. The results obtained for all the conditions tested showed that STI predictions based on T and LSN tend to underestimate the STI, with differences between measured and predicted STIs always lower than 0.1 (on a 0.0–1.0 scale), and on average lower than 0.06. According to previous research, these differences are noticeable and therefore non-negligible, as 0.03 is the just noticeable difference in STI. The use of either measured or predicted values of T and LSN provided similar STI predictions (i.e. non-noticeable changes), with differences between predictions that are on average lower than 0.03 for the absorbent room, and lower than 0.01 for the reverberant room.  相似文献   

9.
Traffic tunnels are generally hostile acoustic environments, both in terms of reverberation and ambient noise levels. Public address (PA) systems used to convey spoken warnings must meet stringent design requirements in order to produce sufficiently intelligible speech. To be able to predict PA system performance at tunnel design time, two different speech transmission index (STI) calculation procedures were implemented. The first procedure predicts the STI based on ray-tracing simulations. Comparison with measured STI data showed that this simulation approach yields accurate intelligibility estimates. However, the procedure is time-consuming and too complex to be used by non-specialists. For this reason, a second (simpler and more efficient) procedure was developed, based on fixed non-linear regression, statistically deriving prediction functions from measured data and ray-tracing results. This procedure was compared to the approach based on ray tracing, and found to yield STI predictions closely matching those of the ray-tracing model.  相似文献   

10.
J Kang 《Applied Acoustics》2002,63(12):1315-1333
The objective of this paper is to study the basic characteristics of conversation intelligibility in dining spaces where the seat number and occupancy level are relatively high, and to investigate the effectiveness of strategic architectural acoustic treatments on improving the intelligibility. A radiosity-based computer model has been developed and a parametric study has been carried out using the model. Computation in a typical dining hall shows that a design merely based on the current guidelines for space use may lead to very poor conversation intelligibility. Increasing boundary absorption can typically increase the speech transmission index (STI) by 0.2-0.4. For a given amount of absorption, in a regularly-shaped dining hall the difference in intelligibility between various absorber arrangements is generally negligible, whereas in a flat or long dining hall it is important to strategically arrange the absorbers. The improvement in intelligibility by enlarging the area per diner, changing the ceiling height, and increasing the length/width ratio has also been investigated. For a given room condition, the model can give the maximum number of seats according to the requirement in intelligibility.  相似文献   

11.
For the purpose of improving speech transmission performance in a dome space, the acoustical properties in a dome having a diameter of 20 m were examined. The acoustical properties measured evenly on the floor of the dome were evaluated both objectively and subjectively and the interrelationship of the objective measures and subjective measures were also examined. Then, on the basis of the results of the study, simplified acoustical remedies were applied to the dome to improve speech intelligibility and the effect of the remedies was also examined. The following findings were obtained from this investigation.(1) The speech transmission performance in the dome space without treatment by absorptive materials varies greatly with the locations of sound sources and observation points: a range of 0.17-0.59 for RASTI value and a range of 30-97% for speech intelligibility test results. (2) There are peculiar observation points at which speech transmission quality is very high due to a considerable sum of the energy arriving in the first 0.06 s after the arrival of the direct sound. (3) Of all the measured acoustical parameters, RASTI, EDT in 1 kHz band, early-to-late arriving sound energy ratio, and Ts corresponded well to the speech intelligibility test scores. (4) Rubber tiles, cotton canvas 12 m in length, and glass wool board, are effective in improving speech intelligibility remarkably due to increased sound absorption and the diffusion effect.  相似文献   

12.
While the Speech Transmission Index (STI) is widely applied for prediction of speech intelligibility in room acoustics and telecommunication engineering, it is unclear how to interpret STI values when non-native talkers or listeners are involved. Based on subjectively measured psychometric functions for sentence intelligibility in noise, for populations of native and non-native communicators, a correction function for the interpretation of the STI is derived. This function is applied to determine the appropriate STI ranges with qualification labels ("bad"-"excellent"), for specific populations of non-natives. The correction function is derived by relating the non-native psychometric function to the native psychometric function by a single parameter (nu). For listeners, the nu parameter is found to be highly correlated with linguistic entropy. It is shown that the proposed correction function is also valid for conditions featuring bandwidth limiting and reverberation.  相似文献   

13.
14.
By systematically varying the amount of sound absorption, and the location of the sound-absorbing material in a simulated classroom, it was possible to assess the accuracy of the prediction of speech metrics in quite simple acoustical environments. Predictions of speech level, early-to-late sound ratios (C50) and speech transmission index (STI) values were obtained analytically and with two hybrid ray-based computer programs, RAYNOISE 3.0 and ODEON 4.1. The RAYNOISE predictions were accomplished with a purely specular reflection model and also with a calibrated diffuse reflection model. ODEON uses a parameter called transition order, TO, to change the reflection procedure from purely specular to diffuse for reflections that have orders higher than TO. A parametric study was conducted to determine the best transition order for the ODEON prediction of speech metrics. It was found that the analytical predictions of speech level and C50 were on average accurate to about 1 just-noticeable difference (jnd), whereas the analytical predictions of STI were on average within 2 jnd's. ODEON predictions of speech level, C50 and STI were on average within 2 jnd's. RAYNOISE predictions of C50 and STI with the specular model were on average within 2 jnd's. However, the RAYNOISE predictions of speech level, with both types of reflection models, and the RAYNOISE predictions of C50 and STI with the diffuse model had average errors greater than 2 jnd's. The effects of the sound-absorption treatments on the measured speech metric values are also discussed.  相似文献   

15.
Subjective speech intelligibility can be assessed by speech recorded in an anechoic chamber and then convolved with room impulse responses that can be created by acoustic simulation. The speech intelligibility (SI) assessment based on auralization was validated in three rooms. The articulation scores obtained from simulated sound field were compared with the ones from measured sound field and from direct listening in rooms. Results show that the speech intelligibility prediction based on auralization technique with simulated binaural room impulse responses (BRIRs) is in agreement with reality and results from measured BRIRs. When this technique is used with simulated and measured monaural room impulse responses (MRIRs), the predicted results underestimate the reality. It has been shown that auralization technique with simulated BRIRs is capable of assessing subjective speech intelligibility of listening positions in the room.  相似文献   

16.
The Signal-to-Noise Ratio devised by Lochner and Burger contributed an objective design index for predicting speech intelligibility. Their index provided a measure of useful and detrimental reflected speech energy according to the integration and masking characteristics of hearing, and enabled predictions to be made from impulse measurements in models. However, it was found necessary to extend the Signal-to-Noise Ratio theory to account for the effect of fluctuating ambient background noise on speech intelligibility. A modified Signal-to-Noise Ratio was derived from a best-fitting empirical correlation with speech intelligibility in a series of measurements in existing auditoria. In the modified Signal-to-Noise Ratio ambient background noise is no longer considered in terms of its steady state characteristics but more specifically in terms of its transient and spectral characteristics given by the concept of the L10 PNC level. The index has been applied as design criteria to prediction and to evaluation techniques.  相似文献   

17.
It is known that the sound field in a long space is not diffuse, and that the classic theory of room acoustics is not applicable. A theoretical model is developed for the prediction of reverberation time and speech transmission index in rectangular long enclosures, such as corridors and train stations, where the acoustic quality is important for speech. The model is based on an image-source method, and both acoustically hard and impedance boundaries are investigated. An approximate analytical solution is used to predict the frequency response of the sound field. The reverberation time is determined from the decay curve which is computed by a reverse-time integration of the squared impulse response. The angle-dependence of reflection coefficients of the boundaries and the change of phase upon reflection are incorporated in this model. Due to the relatively long distance of sound propagation, the effect of atmospheric absorption is also considered. Measurements of reverberation time and speech transmission index taken from a real tunnel, a corridor, and a model tunnel are presented. The theoretical predictions are found to agree well with the experimental data. An application of the proposed model has been suggested.  相似文献   

18.
Predictors of speech intelligibility in rooms   总被引:6,自引:0,他引:6  
Three different types of acoustical measures were compared as predictors of speech intelligibility in rooms of varied size and acoustical conditions. These included signal-to-noise measures, the speech transmission index derived from modulation transfer functions, and useful/detrimental sound ratios obtained from early/late sound ratios, speech, and background levels. The most successful forms of each type of measure were of similar prediction accuracy, but the useful/detrimental ratios based on a 0.08-s early time interval were most accurate. Several physical measures, although based on very different calculation procedures, were quite strongly related to each other.  相似文献   

19.
This paper discusses the prediction of verbal-communication quality in eating establishments (EEs). EEs contain talkers and listeners who require high speech intelligibility at their tables, and high speech privacy between tables. Using catt-Acoustic, verbal-communication quality--quantified by speech transmission index (STI)--in models of three existing EEs was predicted. Talker voice-output levels were predicted using an existing empirical model accounting for the Lombard effect. With these, catt-Acoustic predicted impulse responses, speech levels and noise levels at primary and secondary listener positions, and the corresponding STIs. The untreated EEs were first modeled for various talker and listener positions, and occupancies. Then various treated configurations, involving reduced volume, increased absorption and barriers were studied to determine the effectiveness of the treatments. The results suggest that placing barriers around tables can be an effective way to achieve good verbal-communication quality. Increasing the absorption of the room surfaces or decreasing the ceiling height to control reverberation may not be effective. However, increasing the surface absorption and putting barriers around tables may achieve optimal speech conditions in EEs. Subdividing large EEs into smaller ones can also be effective.  相似文献   

20.
This paper examines the impact of room acoustic conditions on the speech intelligibility of four languages (English, Polish, Arabic and Mandarin). Listening test scores (diagnostic rhyme tests, phonemically balanced word tests and phonemically balanced sentence tests) of the four languages were compared under four room acoustic conditions defined by their speech transmission index (STI = 0.2, 0.4, 0.6 and 0.8). The results obtained indicated that there was a statistically significant difference between the word intelligibility scores of languages under all room acoustic conditions, apart from the STI = 0.8 condition. English was the most intelligible language under all conditions, and differences with other languages were larger when conditions were poor (maximum difference of 29% at STI = 0.2, 33% at STI = 0.4 and 14% at STI = 0.6). Results also showed that Arabic and Polish were particularly sensitive to background noise, and that Mandarin was significantly more intelligible than those languages at STI = 0.4. Consonant-to-vowel ratios and languages’ distinctive features and acoustical properties explained some of the scores obtained. Sentence intelligibility scores confirmed variations between languages, but these variations were statistically significant only at the STI = 0.4 condition (sentence tests being less sensitive to very good and very poor room acoustic conditions). Overall, the results indicate that large variations between the speech intelligibility of different languages can occur, especially for spaces that are expected to be challenging in terms of room acoustic conditions. Recommendations solely based on room acoustic parameters (e.g. STI) might then prove to be insufficient for designing a multilingual environment.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号