首页 | 本学科首页   官方微博 | 高级检索  
     

自动发音错误检测中基于F1值最大化的声学模型训练方法
引用本文:黄浩,王建明,哈利旦·阿布都热依木,吾守尔·斯拉木. 自动发音错误检测中基于F1值最大化的声学模型训练方法[J]. 声学学报, 2013, 38(6): 751-758. DOI: 10.15949/j.cnki.0371-0025.2013.06.010
作者姓名:黄浩  王建明  哈利旦·阿布都热依木  吾守尔·斯拉木
作者单位:1. 新疆大学信息科学与工程学院 乌鲁木齐 830046;
基金项目:国家自然科学基金(60965002,60865001,61163026)、新疆高校科研计划培育基金(XJEDU2008S15)和新疆大学博士科研启动基金(BS090143)资助
摘    要:摘要为了提高计算机辅助语言学习中自动发音错误检测系统的性能,提出一种声学模型的区分性训练方法。该方法将经过正确度标注的非母语语音数据库上的发音错误检测的F1值的最大化作为模型参数的训练准则。采用Sigmoid 函数对F1值函数进行平滑构造目标函数,并利用构造弱意义辅助函数的方法以及扩展Baum-Welch 形式的参数更新公式进行优化。提出在模型参数更新与音素门限同时优化的策略保证目标函数增长的单调性。发音错误检测实验表明该方法能够有效地增大训练和测试数据检错的F1值。同时训练数据和测试数据上的精确度、召回率以及检测正确度都有明显改进。 

收稿时间:2012-06-26

Maximum F1-score acoustic model training for automatic mispronunciation detection
Affiliation:1. Department of Information Science and Engineering, Xinjiang University Urumqi 830046;2. Department of Electrical Engineering, Xinjiang University Urumqi 830046
Abstract:To improve the performance of automatic mispronunciation detection in computer-assisted language learning, a discriminative acoustic model training method is proposed. The method aims at maximizing the F1-score of mispronunciation detection results on the annotated non-native speech database. The training objective function is formulated as a smooth form of the F1-score by using the sigmoid function, and is optimized by using the extended Baum-Welch form like updating equations based on the weak-sense auxiliary function method. Simultaneous updating strategy of acoustic models and phone threshold parameters is proposed to ensure monotonicity of the objective function improvement. Mispronunciation detection experiments show that the method is effective in increasing the F1-score,precision, recall and detection accuracy on both the training and evaluation data set. 
Keywords:
点击此处可从《声学学报》浏览原始摘要信息
点击此处可从《声学学报》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号