首页 | 本学科首页   官方微博 | 高级检索  
     检索      


Speaker Identification in a Shouted Talking Environment Based on Novel Third-Order Circular Suprasegmental Hidden Markov Models
Authors:Ismail M A Shahin
Institution:1.Department of Electrical and Computer Engineering,University of Sharjah,Sharjah,UAE
Abstract:It is well known that speaker identification yields very high performance in a neutral talking environment; on the other hand, the performance has been sharply declined in a shouted talking environment. This work aims at proposing, implementing, and evaluating novel third-order circular suprasegmental hidden Markov models (CSPHMM3s) to improve the low performance of text-independent speaker identification in a shouted talking environment. CSPHMM3s possess combined characteristics of: Circular Hidden Markov models, third-order hidden Markov models (HMM3s), and suprasegmental hidden Markov models (SPHMMs). Our results show that CSPHMM3s are superior to each of: first-order left-to-right suprasegmental hidden Markov models (LTRSPHMM1s), second-order left-to-right suprasegmental hidden Markov models (LTRSPHMM2s), third-order left-to-right suprasegmental hidden Markov models (LTRSPHMM3s), first-order circular suprasegmental hidden Markov models (CSPHMM1s), and second-order circular suprasegmental hidden Markov models (CSPHMM2s) in a shouted talking environment. Using our collected speech database, average speaker identification performance in a shouted talking environment based on LTRSPHMM1s, LTRSPHMM2s, LTRSPHMM3s, CSPHMM1s, CSPHMM2s, and CSPHMM3s is 74.6, 78.4, 81.7, 78.7, 83.4, and 85.8 %, respectively. Speaker identification performance that has been achieved based on CSPHMM3s is close to that attained based on subjective assessment by human listeners.
Keywords:
本文献已被 SpringerLink 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号