共查询到20条相似文献,搜索用时 15 毫秒
1.
2.
为了提高发音质量判别精度,提出了一种基于音素模型感知度的发音质量评价方法。它采用不同语音样本集合下样本声学特征的对数后验概率期望差作为音素模型对变异发音的感知度,并以此为基础,生成各音素对应的识别模型候选集。实验表明,所提出的方法使语音识别网络候选音素模型集合尺寸减少约95%;在非母语语音数据库上,该方法评分与人工专家打分相关性为0.828,基于该方法得到的声韵母错误检出率为70.8%,声调错误检出率为42.5%,均优于其它方法。 相似文献
3.
4.
5.
6.
针对腭裂患者易出现塞音弱化或消失的现象,提出了一种基于塞音段爆破能量检测的腭裂康复手术客观评价方法。该方法采用类听觉的滤波器组作为处理前端,并对处理后得到的信号在其各子带内分别计算塞音除阻过程中的能量变化率。对腭裂组和术后对照组的平均子带能量变化率进行了比对,结果表明腭裂组在高频段(子带中心频率从209.8 Hz至8000 Hz)具有较小的除阻能量变化率。对不送气清塞音/d/、/b/进行了实验,Logistic回归表明提出的方法与主观判听一致性在音节/di/和/bu/上分别达到88.9%和90.27%。 相似文献
7.
8.
9.
Study on the acoustical characteristic is important to speech and speaker recognition in Chinese whispered speech. In this paper, the characteristics of whispered speech are introduced and the acoustical characteristics in Chinese whispered speech are discussed. There is no fundamental frequency in the whispered speech, so other characteristics such as the duration and frequency of formant are extracted and analyzed. From experiments with six simple Chinese whispered vowels, it is proved that the duration and the frequency of formant can be used as the main acoustical characteristics in the Chinese whispered recognition. 相似文献
10.
An objective visual performance evaluation with visual evoked potential(VEP) measurements was first integrated into an adaptive optics(AO) system. The optical and neural limits to vision can be bypassed through this system. Visual performance can be measured electrophysiologically with VEP, which reflects the objective function from the retina to the primary visual cortex. The VEP measurements without and with AO correction were preliminarily carried out using this system, demonstrating the great potential of this system in the objective visual performance evaluation. The new system will provide the necessary technique and equipment support for the further study of human visual function. 相似文献
11.
研究汉语自然口语识别中的建模单元选择问题。在HMM三状态模型中,声韵母单元与音素单元作为两种最流行的建模单元各有优劣。一方面从自然口语音变严重的问题出发,倾向采用粗粒度的声韵母单元以概括各种音变;另一方面从三状态结构可能无法有效描述复杂单元的问题出发,又倾向采用细粒度的音素单元。本文在实验语音学理论研究成果与声韵母时长分析实验结果的基础上,主张对扩展声韵母单元进行有选择的拆分,提出了基于鼻韵尾分离的声韵母拆分方法。实验结果表明本文的方法与扩展声韵母单元、音素单元相比,识别性能有了明显改善,其字错误率分别降低2.23%和9.45%。 相似文献
12.
T Gay 《The Journal of the Acoustical Society of America》1978,63(1):223-230
The purpose of this experiment was to study the effects of changes in speaking rate on both the attainment of acoustic vowel targets and the relative time and speed of movements toward these presumed targets. Four speakers produced a number of different CVC and CVCVC utterances at slow and fast speaking rates. Spectrographic measurements showed that the midpoint format frequencies of the different vowels did not vary as a function of rate. However, for fast speech the onset frequencies of second formant transitions were closer to their target frequencies while CV transition rates remained essentially unchanged, indicating that movement toward the vowel simply began earlier for fast speech. Changes in both speaking rate and lexical stress had different effects. For stressed vowels, an increase in speaking rate was accompanied primarily by a decrease in duration. However, destressed vowels, even if they were of the same duration as quickly produced stressed vowels, were reduced in overall amplitude, fundamental frequency, and to some extent, vowel color. These results suggest that speaking rate and lexical stress are controlled by two different mechanisms. 相似文献
13.
Hillenbrand JM Clark MJ Nearey TM 《The Journal of the Acoustical Society of America》2001,109(2):748-763
A significant body of evidence has accumulated indicating that vowel identification is influenced by spectral change patterns. For example, a large-scale study of vowel formant patterns showed substantial improvements in category separability when a pattern classifier was trained on multiple samples of the formant pattern rather than a single sample at steady state [J. Hillenbrand et al., J. Acoust. Soc. Am. 97, 3099-3111 (1995)]. However, in the earlier study all utterances were recorded in a constant /hVd/ environment. The main purpose of the present study was to determine whether a close relationship between vowel identity and spectral change patterns is maintained when the consonant environment is allowed to vary. Recordings were made of six men and six women producing eight vowels (see text) in isolation and in CVC syllables. The CVC utterances consisted of all combinations of seven initial consonants (/h,b,d,g,p,t,k/) and six final consonants (/b,d,g,p,t,k/). Formant frequencies for F1-F3 were measured every 5 ms during the vowel using an interactive editing tool. Results showed highly significant effects of phonetic environment. As with an earlier study of this type, particularly large shifts in formant patterns were seen for rounded vowels in alveolar environments [K. Stevens and A. House, J. Speech Hear. Res. 6, 111-128 (1963)]. Despite these context effects, substantial improvements in category separability were observed when a pattern classifier incorporated spectral change information. Modeling work showed that many aspects of listener behavior could be accounted for by a fairly simple pattern classifier incorporating F0, duration, and two discrete samples of the formant pattern. 相似文献
14.
15.
本文在评述低温绝对辐射计和SIRCUS发展的基础上,讨论了基于探测器标准的光谱可调谐自校准标准光源的工作原理、发展与应用前景。在探测器型光谱辐射标准研究方面,工作在液氦温度的低温绝对辐射计不确定度达0.01%。美国国家标准与技术研究院(NIST)建立的均匀光源光谱辐照度和光谱辐亮度响应度定标装置(SIRCUS)采用一系列激光器,由低温绝对辐射计传递的硅陷阱探测器定标,不确定度已达到0.1%,成功应用于空间遥感仪器高精度辐射定标。分析认为,发展中的基于探测器标准的光谱可调谐自校准标准光源,定标精度高,自行校正老化、衰减,保证了定标精度长期稳定。 相似文献
16.
17.
18.
Liu C 《The Journal of the Acoustical Society of America》2008,123(4):EL52-EL58
The goal of this study was to measure the ability of normal-hearing listeners to discriminate formant frequency for vowels in isolation and sentences at three signal levels. Results showed significant elevation in formant thresholds as formant frequency and linguistic context increased. The signal level indicated a rollover effect, especially for F2, in which formant thresholds at 85 dB SPL were lower than thresholds at 70 or 100 dB SPL in both isolated vowels and sentences. This rollover level effect could be due to reduced frequency selectivity and forward/backward masking in sentence at high signal levels for normal-hearing listeners. 相似文献
19.
The room acoustical parameters reverberation time, RT; early decay time, EDT; clarity, C80; time gravity, Tg; bass ratio, BR; strength, G; initial time delay gap, ITDG; interaural cross-correlation coefficient, IACC(E), the where binaural quality index BQI equals [1-IACC(E3)]; and stage support, ST1 were measured in 18 major chamber-music halls in Austria, Germany, the Netherlands, Czech Republic, Switzerland, and Japan, employing procedures in accordance with ISO 3382 (1997). In combination with the architectural data, the intrinsic objective parameters for the acoustics of chamber-music halls and their variation range were examined. The results of these studies reveal four pertinent orthogonal parameters: RT, G, ITDG, BQI. General design guidelines for a chamber-music hall are presented. 相似文献
20.
普通话孤立字四声的一种模式识别方法 总被引:4,自引:0,他引:4
普通话孤立字的声调识别是普通话语音识别中的一项重要任务.本文提出一种新的模式识别算法进行普通话四声调的识别.在大量统计实验基础上,定义了四个参数做为基音频率轨迹的描述.并且,在假设其服从高维正态分布(统计实验表明,这一假设是合理的)的基础上,根据最小错误概率准则推导出参数矢量与每一声调类型的距离公式,实现了统计意义上的最佳识别效果.对于非特定人的四声识别实验表明,这一算法取得了十分满意的结果。 相似文献