首页 | 本学科首页   官方微博 | 高级检索  
     检索      

基于VEMAP的说话人识别鲁棒性研究
引用本文:黄文娜,彭亚雄.基于VEMAP的说话人识别鲁棒性研究[J].电声技术,2016,40(11):44-47.
作者姓名:黄文娜  彭亚雄
作者单位:贵州大学大数据与信息工程学院,贵州 贵阳,550025
基金项目:基于社区司法矫正的声纹识别系统研究;贵州省数字健康管理工程技术研究中心
摘    要:为了改善发声力度变化对说话人识别系统性能的影响.针对不同发声力度下语音信号的分析,提出了使用发声力度最大后验概率(Vocal Effort Maximum A Posteriori,VEMAP)自适应方法更新基于高斯混合模型-通用背景模型(Gaussian Mixture Model-Universal Background Model,GMM-UBM)的说话人识别系统模型.实验表明,所提出的方法使不同发声力度下系统EER%降低了88.45%与85.16%,有效解决了因发声力度变化引起的训练语音与测试语音音量失配,从而导致说话人识别性能降低的问题,改善说话人识别系统性能效果显著.

关 键 词:说话人识别  发声力度  发声力度最大后验概率自适应  高斯混合模型-通用背景模型
收稿时间:2016/5/17 0:00:00
修稿时间:2016/5/17 0:00:00

Research on robustness of Speaker Recognition Based on VEMAP
Huang Wenna and Peng Yaxiong.Research on robustness of Speaker Recognition Based on VEMAP[J].Audio Engineering,2016,40(11):44-47.
Authors:Huang Wenna and Peng Yaxiong
Abstract:In order to improve the performance of recognition system caused which is influenced by the changes of vocal efforts. In this paper, based on the analysis of the speech signals under different vocal efforts ,Vocal Effort Maximum a posteriori(VEMAP) adaptive method has been proposed to update the speaker recognition model which based on Gaussian Mixture Model-Universal Background Model (GMM-UBM). From the results of this experiment, VEMAP adaptive method can make the EER% of recognition system which under different vocal efforts reduce by 88.45% and 85.16%. Effectively solve the vocal mismatch of training speech and test speech caused by vocal effort that lead to the Speaker recognition performance degradation problem, Improving the speaker recognition system performance obviously.
Keywords:Speaker recognition  Vocal effort  Vocal Effort Maximum a posteriori  Gaussian Mixture Model-Universal Background Model
本文献已被 万方数据 等数据库收录!
点击此处可从《电声技术》浏览原始摘要信息
点击此处可从《电声技术》下载免费的PDF全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号