首页 | 本学科首页   官方微博 | 高级检索  
     检索      

嗓音多频带非线性分析的声带病变识别
引用本文:周强,张晓俊,顾济华,赵鹤鸣,朱俊杰,陶智.嗓音多频带非线性分析的声带病变识别[J].声学学报,2014,39(1):111-118.
作者姓名:周强  张晓俊  顾济华  赵鹤鸣  朱俊杰  陶智
作者单位:1. 苏州大学物理科学与技术学院 苏州 215006;
基金项目:国家自然科学基金(61271359,61071215);苏州大学捷美生物医学工程仪器联合实验室项目资助
摘    要:提出了一种嗓音多频带非线性分析的声带病变识别方法,以提高声带病变嗓音的识别率。首先采用Gammatone听觉滤波器组对嗓音信号进行滤波,求取每个频带下的最大李雅普诺夫指数;对映射到核空间的数据采用高斯最大似然度准则优化核函数,然后采用优化核主成分分析算法实现特征抽取。识别实验表明,多频带最大李雅普诺夫指数的识别率比传统的MFCC和最大李雅普诺夫指数分别有6.52%和8.45%的提高,且采用优化核主成分分析算法比传统核主成分分析算法有更好的抽取效果.将多频带非线性分析和优化核主成分分析算法结合,识别率提升至97.82%。 

关 键 词:声带病变  非线性分析  最大李雅普诺夫指数  核函数  核主成分分析  滤波器组  多频带  特征抽取  识别率  嗓音
收稿时间:2012-09-21

Vocal cords diseases detection by multi-band nonlinear analysis of voice
Institution:1. School of Physical Science and technology, Soochow university Suzhou 215006;2. School of Electronics and Information Engineering, Soochow University Suzhou 215006
Abstract:In order to improve the recognition rate of pathological voices caused by disease of vocal cords, multi-band nonlinear analysis is proposed. Gammatone filter bank is applied to voice signal for front-end time-domain filtering, and then calculate the largest Lyapunov exponent of every band. Data is first mapped into kernel space and use Gaussian maximum likelihood rule to get the best parameter for kernel, which is used for kernel principal component analysis to extract feature. The proposed feature achieves higher recognition rate of 6.25% and 8.45% than MFCC and the largest Lyapunov exponent respectively. When the proposed kernel function is used for kernel principal component analysis, it achieves better performance than traditional function. Ultimately, we get recognition rate of 97.82% by combing them. 
Keywords:
本文献已被 CNKI 等数据库收录!
点击此处可从《声学学报》浏览原始摘要信息
点击此处可从《声学学报》下载免费的PDF全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号