首页 | 本学科首页   官方微博 | 高级检索  
     

综合非语境因素的语音数据分类与声学建模研究
引用本文:丁鹏, 徐波. 综合非语境因素的语音数据分类与声学建模研究[J]. 声学学报, 2004, 29(1): 23-28. DOI: 10.15949/j.cnki.0371-0025.2004.01.005
作者姓名:丁鹏  徐波
作者单位:1.中国科学院自动化研究所 模式识别国家重点实验室 北京 100080
摘    要:分别采用基于数据聚类和基于先验知识的两种研究方法,深入探讨了性别、口音、语速、信道等非语境因素对语音数据分类与建模的影响。为了综合考虑语境、非语境因素在统一框架下建模的问题,采用非语境因素扩展决策树方法。而对于这种方法生成的多套非语境因素相关的高精度声学模型,提出一种依据最大似然准则,动态组合生成测试人相关声学模型的算法。这种方法可以使系统相对误识率平均降低8%~10%。实验结果说明为非语境因素分类建模可以提高声学模型的建模能力,而且模型组合算法可以有效解决统一建模所带来的模型选择问题。

收稿时间:2002-03-05
修稿时间:2002-06-05

Integrating non-context features in speech data classification and modeling
DING Peng, XU Bo. Integrating non-context features in speech data classification and modeling[J]. ACTA ACUSTICA, 2004, 29(1): 23-28. DOI: 10.15949/j.cnki.0371-0025.2004.01.005
Authors:DING Peng  XU Bo
Affiliation:1.National Laboratory of Pattern Recognition, Institute of Automation, The Chinese Academy of Sciences Beijing 100080
Abstract:Effects of the non-context features, such as gender, speaker group identity, speaking rate and channel, for the classification and modeling of the speech data are studied based on data clustering and pre-classification knowledge methods. In order to incorporate non-context features with the context ones in the modeling process, generalized feature decision tree scheme is adopted and extended for the building of multiple high resolution acoustic models. Maximum likelihood model combination is then advanced to solve the subsequent model selection problem. Experimental results on two sets indicated that 8.
Keywords:
本文献已被 CNKI 等数据库收录!
点击此处可从《声学学报》浏览原始摘要信息
点击此处可从《声学学报》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号