首页 | 本学科首页   官方微博 | 高级检索  
     

融合引导概率的语音识别解码算法研究
引用本文:杨占磊, 刘文举, 晁浩. 融合引导概率的语音识别解码算法研究[J]. 声学学报, 2012, 37(2): 209-217. DOI: 10.15949/j.cnki.0371-0025.2012.02.010
作者姓名:杨占磊  刘文举  晁浩
作者单位:1.中国科学院自动化研究所模式识别国家重点实验室 北京 100190
基金项目:国家重点基础研究发展计划(973计划)(2004CB318105)、国家高技术研究发展计划(863计划)(20060101Z4073,2006AA01Z194)和国家自然科学基金(90820011,60675026,90820303)资助项目。
摘    要:语音帧在声学特征空间中的位置信息可以辅助解码器对潜在路径进行筛选。传统的语音识别系统缺乏利用这种位置信息。针对这种不足,本文提出一种引导概率模型,用于描述语音帧属于声学特征空间不同局部的概率,并将其用于识别。使用引导概率后,解码器更强调对声学特征空间中最有希望的局部进行搜索,保留并扩展通过此局部空间的路径,同时弱化不经过此局部空间的路径。实验结果显示,融合引导概率的解码算法在不显著增加解码复杂度的情形下,使汉字相对错误率下降10.95%。结果分析表明,融合了语音帧声学位置信息的解码方法能够更有效地鉴别潜在路径,从而降低误识率。

收稿时间:2011-03-23
修稿时间:2011-06-09

Integrating induced probability into decoding for large vocabulary continuous speech recognition
YANG Zhanlei, LIU Wenju, CHAO Hao. Integrating induced probability into decoding for large vocabulary continuous speech recognition[J]. ACTA ACUSTICA, 2012, 37(2): 209-217. DOI: 10.15949/j.cnki.0371-0025.2012.02.010
Authors:YANG Zhanlei  LIU Wenju  CHAO Hao
Affiliation:1.National Laboratory of Pattern Recognition, Institute of Automation, Chinese Academy of Sciences Beijing 100190
Abstract:This paper integrates location information of frames into conventional acoustic model (AM) and language model (LM) likelihoods, in order to distinguish potential path candidates more precisely at decoding stage. This paper proposes an induced probability, which represents location information of frames within the whole acoustic space. By integrating the induced probability, the decoder is directed to search within the most promising regions of acoustic space. Promising paths are enhanced and unlikely paths are weakened. Experiments conducted on Chinese Putonghua show that the character error rate is reduced by 10.95% relatively without increasing decoding complexity significantly. Finally, pruning analysis shows that integrating location information of frames into traditional decoding framework is helpful for improving system performance. 
Keywords:
点击此处可从《声学学报》浏览原始摘要信息
点击此处可从《声学学报》下载免费的PDF全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号