首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 15 毫秒
1.
董滨  赵庆卫  颜永红 《声学学报》2007,32(2):122-128
提出了一种以元音的共振峰模式为特征基于支持向量机算法的分类评估方法,用以对汉语普通话中的韵母发音水平进行客观测试。此算法为每个韵母分别训练全分类模型、子分类模型和评估模型,在两级分类的基础上对发音水平进行测试打分。实验结果表明,全分类模型可以达到90%以上的分类正确率,客观测试与专家主观评估的相似度达到82%,在性能上超过了传统的以倒谱系数为特征的隐含马尔科夫模型方法。  相似文献   

2.
一种基于音素模型感知度的发音质量评价方法   总被引:1,自引:1,他引:0       下载免费PDF全文
张茹  韩纪庆 《声学学报》2013,38(2):201-207
为了提高发音质量判别精度,提出了一种基于音素模型感知度的发音质量评价方法。它采用不同语音样本集合下样本声学特征的对数后验概率期望差作为音素模型对变异发音的感知度,并以此为基础,生成各音素对应的识别模型候选集。实验表明,所提出的方法使语音识别网络候选音素模型集合尺寸减少约95%;在非母语语音数据库上,该方法评分与人工专家打分相关性为0.828,基于该方法得到的声韵母错误检出率为70.8%,声调错误检出率为42.5%,均优于其它方法。  相似文献   

3.
4.
赵毅  尹雪飞  陈克安 《应用声学》2010,29(6):416-424
共振峰频率是语音信号的一个重要参数。传统的基于线性预测的共振峰检测算法由于受到计算量的限制,很难实现实时处理。本文提出一种基于倒谱变换的共振峰频率检测算法,采用后置处理,比较声道冲击响应对数幅频特性的二次导数和相频特性一次导数检测出的结果,删除伪峰数值和甄别合并共振峰,提高检测精度。仿真结果证明,该算法计算效率高,低信噪比下仍能保持较好的检测性能。  相似文献   

5.
6.
胡琦  赵庆卫  马莲  颜永红 《声学学报》2014,39(6):757-763
针对腭裂患者易出现塞音弱化或消失的现象,提出了一种基于塞音段爆破能量检测的腭裂康复手术客观评价方法。该方法采用类听觉的滤波器组作为处理前端,并对处理后得到的信号在其各子带内分别计算塞音除阻过程中的能量变化率。对腭裂组和术后对照组的平均子带能量变化率进行了比对,结果表明腭裂组在高频段(子带中心频率从209.8 Hz至8000 Hz)具有较小的除阻能量变化率。对不送气清塞音/d/、/b/进行了实验,Logistic回归表明提出的方法与主观判听一致性在音节/di/和/bu/上分别达到88.9%和90.27%。  相似文献   

7.
基于听觉模型的耳语音的声韵切分   总被引:5,自引:0,他引:5       下载免费PDF全文
丁慧  栗学丽  徐柏龄 《应用声学》2004,23(2):20-25,44
本文分析了耳语音的特点,并根据生理声学及心理声学的基本理论与实验资料,提出了一种利用听觉模型来进行耳语音声韵切分的方法。这种适用于耳语音声韵切分的听觉感知模型主要分为四个层次:耳蜗对声音频率的分解机理;听觉系统的时域和频域非线性变化;中枢神经系统的侧抑制机理。这种模型能反映在噪声环境下人对低能量语音的听觉感知特性,因而适于耳语音识别,在耳语音声韵母切分实验中得到了满意的结果。  相似文献   

8.
王晓红  孙平  徐卓  吕兆锋 《光学技术》2012,38(5):573-578
选用Munsell新标数据集,采用基于模式识别技术的圆度、色相角偏差、明度线性度与空间投影点聚集度四个指标来分别评价目前八个典型色貌模型在彩度均匀性、色相预测准确性、明度均匀性、颜色的空间再现的能力。结果表明,不同的色貌模型在四个方面的颜色再现性上表现出不同的优势。八个色貌模型对彩度预测最好的是RLAB,每个色貌模型对红色和绿色预测比较好,CAM02在色相预测整体上占优势;明度的均匀性预测都比较好,尤其是CIELAB;LLAB的颜色的空间再现能力最好。  相似文献   

9.
Study on the acoustical characteristic is important to speech and speaker recognition in Chinese whispered speech. In this paper, the characteristics of whispered speech are introduced and the acoustical characteristics in Chinese whispered speech are discussed. There is no fundamental frequency in the whispered speech, so other characteristics such as the duration and frequency of formant are extracted and analyzed. From experiments with six simple Chinese whispered vowels, it is proved that the duration and the frequency of formant can be used as the main acoustical characteristics in the Chinese whispered recognition.  相似文献   

10.
An objective visual performance evaluation with visual evoked potential(VEP) measurements was first integrated into an adaptive optics(AO) system. The optical and neural limits to vision can be bypassed through this system. Visual performance can be measured electrophysiologically with VEP, which reflects the objective function from the retina to the primary visual cortex. The VEP measurements without and with AO correction were preliminarily carried out using this system, demonstrating the great potential of this system in the objective visual performance evaluation. The new system will provide the necessary technique and equipment support for the further study of human visual function.  相似文献   

11.
邵健  赵庆卫  颜永红 《声学学报》2010,35(5):587-592
研究汉语自然口语识别中的建模单元选择问题。在HMM三状态模型中,声韵母单元与音素单元作为两种最流行的建模单元各有优劣。一方面从自然口语音变严重的问题出发,倾向采用粗粒度的声韵母单元以概括各种音变;另一方面从三状态结构可能无法有效描述复杂单元的问题出发,又倾向采用细粒度的音素单元。本文在实验语音学理论研究成果与声韵母时长分析实验结果的基础上,主张对扩展声韵母单元进行有选择的拆分,提出了基于鼻韵尾分离的声韵母拆分方法。实验结果表明本文的方法与扩展声韵母单元、音素单元相比,识别性能有了明显改善,其字错误率分别降低2.23%和9.45%。  相似文献   

12.
The purpose of this experiment was to study the effects of changes in speaking rate on both the attainment of acoustic vowel targets and the relative time and speed of movements toward these presumed targets. Four speakers produced a number of different CVC and CVCVC utterances at slow and fast speaking rates. Spectrographic measurements showed that the midpoint format frequencies of the different vowels did not vary as a function of rate. However, for fast speech the onset frequencies of second formant transitions were closer to their target frequencies while CV transition rates remained essentially unchanged, indicating that movement toward the vowel simply began earlier for fast speech. Changes in both speaking rate and lexical stress had different effects. For stressed vowels, an increase in speaking rate was accompanied primarily by a decrease in duration. However, destressed vowels, even if they were of the same duration as quickly produced stressed vowels, were reduced in overall amplitude, fundamental frequency, and to some extent, vowel color. These results suggest that speaking rate and lexical stress are controlled by two different mechanisms.  相似文献   

13.
A significant body of evidence has accumulated indicating that vowel identification is influenced by spectral change patterns. For example, a large-scale study of vowel formant patterns showed substantial improvements in category separability when a pattern classifier was trained on multiple samples of the formant pattern rather than a single sample at steady state [J. Hillenbrand et al., J. Acoust. Soc. Am. 97, 3099-3111 (1995)]. However, in the earlier study all utterances were recorded in a constant /hVd/ environment. The main purpose of the present study was to determine whether a close relationship between vowel identity and spectral change patterns is maintained when the consonant environment is allowed to vary. Recordings were made of six men and six women producing eight vowels (see text) in isolation and in CVC syllables. The CVC utterances consisted of all combinations of seven initial consonants (/h,b,d,g,p,t,k/) and six final consonants (/b,d,g,p,t,k/). Formant frequencies for F1-F3 were measured every 5 ms during the vowel using an interactive editing tool. Results showed highly significant effects of phonetic environment. As with an earlier study of this type, particularly large shifts in formant patterns were seen for rounded vowels in alveolar environments [K. Stevens and A. House, J. Speech Hear. Res. 6, 111-128 (1963)]. Despite these context effects, substantial improvements in category separability were observed when a pattern classifier incorporated spectral change information. Modeling work showed that many aspects of listener behavior could be accounted for by a fairly simple pattern classifier incorporating F0, duration, and two discrete samples of the formant pattern.  相似文献   

14.
15.
李志刚 《中国光学》2015,8(6):909-918
本文在评述低温绝对辐射计和SIRCUS发展的基础上,讨论了基于探测器标准的光谱可调谐自校准标准光源的工作原理、发展与应用前景。在探测器型光谱辐射标准研究方面,工作在液氦温度的低温绝对辐射计不确定度达0.01%。美国国家标准与技术研究院(NIST)建立的均匀光源光谱辐照度和光谱辐亮度响应度定标装置(SIRCUS)采用一系列激光器,由低温绝对辐射计传递的硅陷阱探测器定标,不确定度已达到0.1%,成功应用于空间遥感仪器高精度辐射定标。分析认为,发展中的基于探测器标准的光谱可调谐自校准标准光源,定标精度高,自行校正老化、衰减,保证了定标精度长期稳定。  相似文献   

16.
药剂颜色标准的谱识别方法   总被引:1,自引:0,他引:1  
提出一种药剂颜色标准的谱识别新方法 ,该方法突破了传统的三刺激值测色模式。提出了用光谱特征及非固定的背景模式 ,代替三刺激值及CIE测量模式 ,建立了良好的判别函数 ,适于药剂颜色标准的客观检验与定级 ,结果令人满意。  相似文献   

17.
《光学技术》2015,(5):396-399
图像清晰度是评价图像质量时常用的指标之一。现有的清晰度评价模型未能充分考虑人眼视觉的亮度掩盖特性。为此,在均方根对比度基础上,考虑人眼亮度掩盖特性,通过计算图像中人眼感兴趣区域(包含细节、边缘和纹理)的感知对比度构造一种无参考的图像清晰度客观评价模型。并利用IVC数据库来验证模型,结果表明,与已有的4种清晰(模糊)度评价模型相比,该模型的评价结果更接近人眼主观感受,且计算量小,运算耗时短,是一种简单有效的图像清晰度评价模型。  相似文献   

18.
The goal of this study was to measure the ability of normal-hearing listeners to discriminate formant frequency for vowels in isolation and sentences at three signal levels. Results showed significant elevation in formant thresholds as formant frequency and linguistic context increased. The signal level indicated a rollover effect, especially for F2, in which formant thresholds at 85 dB SPL were lower than thresholds at 70 or 100 dB SPL in both isolated vowels and sentences. This rollover level effect could be due to reduced frequency selectivity and forward/backward masking in sentence at high signal levels for normal-hearing listeners.  相似文献   

19.
The room acoustical parameters reverberation time, RT; early decay time, EDT; clarity, C80; time gravity, Tg; bass ratio, BR; strength, G; initial time delay gap, ITDG; interaural cross-correlation coefficient, IACC(E), the where binaural quality index BQI equals [1-IACC(E3)]; and stage support, ST1 were measured in 18 major chamber-music halls in Austria, Germany, the Netherlands, Czech Republic, Switzerland, and Japan, employing procedures in accordance with ISO 3382 (1997). In combination with the architectural data, the intrinsic objective parameters for the acoustics of chamber-music halls and their variation range were examined. The results of these studies reveal four pertinent orthogonal parameters: RT, G, ITDG, BQI. General design guidelines for a chamber-music hall are presented.  相似文献   

20.
普通话孤立字四声的一种模式识别方法   总被引:4,自引:0,他引:4  
普通话孤立字的声调识别是普通话语音识别中的一项重要任务.本文提出一种新的模式识别算法进行普通话四声调的识别.在大量统计实验基础上,定义了四个参数做为基音频率轨迹的描述.并且,在假设其服从高维正态分布(统计实验表明,这一假设是合理的)的基础上,根据最小错误概率准则推导出参数矢量与每一声调类型的距离公式,实现了统计意义上的最佳识别效果.对于非特定人的四声识别实验表明,这一算法取得了十分满意的结果。  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号