首页 | 本学科首页   官方微博 | 高级检索  
     

汉语发音质量评估的实验研究
引用本文:葛凤培, 潘复平, 董滨, 颜永红. 汉语发音质量评估的实验研究[J]. 声学学报, 2010, 35(2): 261-266. DOI: 10.15949/j.cnki.0371-0025.2010.02.027
作者姓名:葛凤培  潘复平  董滨  颜永红
作者单位:1.中国科学院声学研究所 中科信利语音实验室 北京 100190
基金项目:国家高技术研究发展计划(,国家科技支撑计划,国家重点基础研究发展规划项目计划,国家自然科学基金 
摘    要:研究了发音评估系统中通用的置信度测度——后验概率算法,针对它存在的不足,提出了两种改进方案。首先,为了降低计算复杂度,传统算法采用了求最大值算法代替求和算法,在被测发音偏离目标音素集的情况下,这会严重降低后验概率的计算精度,本文提出基于扩展的音素混淆网络的后验概率算法。其次,为使置信度能评估不同语音段长的发音质量优劣,传统算法采用了后验概率的段长规整策略,研究分析发现声学似然值与时间的关系更为紧密,所以本文提出了基于声学似然值的时间规整方案。试验结果表明:与传统算法相比,采用改进的置信度算法能使平均打分错误率相对降低35%左右,有效地改善了计算机辅助语言学习系统的性能。

收稿时间:2009-08-27
修稿时间:2010-01-04

Experimental investigation of Putonghua pronunciation quality assessment system
GE Fengpei, PAN Fuping, DONG Bin, YAN Yonghong. Experimental investigation of Putonghua pronunciation quality assessment system[J]. ACTA ACUSTICA, 2010, 35(2): 261-266. DOI: 10.15949/j.cnki.0371-0025.2010.02.027
Authors:GE Fengpei  PAN Fuping  DONG Bin  YAN Yonghong
Affiliation:1.ThinkIT Speech Lab, Institute of Acoustics, Chinese Academy of Sciences Beijing 100190
Abstract:As the most effective confidence measure in computer assisted language learning system, the posterior probability is used widely, in which some tricks are applied to reduce the computation complexity. It analyzes the defect of the traditional algorithm and proposes some improvements. First, the traditional algorithm adopts the method of maximum instead of sum in the calculation of the denominator, which seriously reduces the accuracy of posterior probability. Therefore, taking into account both computation complexity and system performance, it proposes an algorithm based on phoneme confusion extended network. Second, in the traditional algorithm, the posterior probability is normalized by its segment time. In fact, the acoustic likelihood is more related with time and grows with the frame number. So, it proposes the acoustic likelihood based normalization algorithm. The experimental results show that compared to traditional algorithm, the proposed algorithm can improve system performance significantly, about 35% average score error rate relatively, and the computation complexity does not increased.
Keywords:
本文献已被 CNKI 万方数据 等数据库收录!
点击此处可从《声学学报》浏览原始摘要信息
点击此处可从《声学学报》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号