首页 | 本学科首页   官方微博 | 高级检索  
     检索      

基于Fisher线性判别分析的语音信号端点检测方法
引用本文:王明合,张二华,唐振民,许昊.基于Fisher线性判别分析的语音信号端点检测方法[J].电子与信息学报,2015,37(6):1343-1349.
作者姓名:王明合  张二华  唐振民  许昊
作者单位:南京理工大学计算机科学与工程学院 南京 210094
摘    要:传统的语音端点检测方法对辅音,特别是受到噪声污染的清音部分与背景噪声之间分离能力不足。针对上述问题,该文提出一种基于Fisher线性判别分析的梅尔频率倒谱系数(F-MFCC)端点检测方法。将清音信号和背景噪声视为两类分类问题,采用Fisher准则求解具有判别信息的最佳投影方向,使得投影后的特征参数具有最小类内散度和最大类间散度,从而增大清音与背景噪声的可分离性。在不同语音库上的实验结果表明,F-MFCC能够在不同信噪比和背景噪声条件下提高语音端点检测的准确率。

关 键 词:语音处理    语音端点检测    梅尔频率倒谱系数    Fisher线性判别分析
收稿时间:2014-08-29

Voice Activity Detection Based on Fisher Linear Discriminant Analysis
Wang Ming-he,Zhang Er-hua,Tang Zhen-min,Xu Hao.Voice Activity Detection Based on Fisher Linear Discriminant Analysis[J].Journal of Electronics & Information Technology,2015,37(6):1343-1349.
Authors:Wang Ming-he  Zhang Er-hua  Tang Zhen-min  Xu Hao
Abstract:Traditional Voice Activity Detection (VAD) approaches can not effectively detect consonant as well as noisy unvoiced consonant. To address this problem, this paper proposes a VAD approach Mel Frequency Cepstrum Coefficient (F-MFCC) based on Fisher linear discriminant analysis, in consideration of two-class issue regarding to consonant and background noise. Fisher criterion rule is used to solve the optimal projection vector, building upon which we can minimize the within-class scatter can be minimized and the between-class scatter can be maximized, as a result to enhance separability between consonant and background noise. Extensive experiments are conducted to evaluate the F-MFCC performance. The results demonstrate that, under different SNR and noise conditions, the proposed approach achieves higher VAD accuracy.
Keywords:Speech processing  Voice Activity Detection (VAD)  Mel Frequency Cepstrum Coefficient (MFCC)  Fisher linear discriminant analysis
本文献已被 万方数据 等数据库收录!
点击此处可从《电子与信息学报》浏览原始摘要信息
点击此处可从《电子与信息学报》下载免费的PDF全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号