首页 | 本学科首页   官方微博 | 高级检索  
     检索      


Statistical feature of pitch frequency distributions for robust speaker identification
Authors:Linghua Zhang  Baoyu Zheng  Zhen Yang
Institution:Dept of Info. Eng., Nanjing University of Posts & Telecom., Nanjing 210003, China
Abstract:This letter proposes an effective and robust speech feature extraction method based on statistical analysis of Pitch Frequency Distributions (PFD) for speaker identification. Compared with the conventional cepstrum, PFD is relatively insensitive to Additive White Gaussian Noise (AWGN), but it does not show good performance for speaker identification, even if under clean environments. To compensate this shortcoming, PFD and conventional cepstrum are combined to make the ultimate decision, instead of simple taking one kind of features into account Experimental results indicate that the hybrid approach can give outstanding improvement for text-independent speaker identification under noisy environments corrupted by AWGN. Communication author: Zhang Linghua, born in 1964, female, associate professor. Department of Information Engineering, Nanjing University of Posts & Telecommunications, Nanjing 210003, China.
Keywords:Speaker identification  Feature extraction  Pitch frequency  Gaussian Mixture Model (GMM)
本文献已被 维普 万方数据 SpringerLink 等数据库收录!
点击此处可从《电子科学学刊(英文版)》浏览原始摘要信息
点击此处可从《电子科学学刊(英文版)》下载免费的PDF全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号