听觉计算模型在鲁棒性语音识别中的应用 Computational auditory model and its application in robust speech signal recognition期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

听觉计算模型在鲁棒性语音识别中的应用

引用本文：	卢绪刚, 陈道文. 听觉计算模型在鲁棒性语音识别中的应用[J]. 声学学报, 2000, 25(6): 492-498. DOI: 10.15949/j.cnki.0371-0025.2000.06.003

作者姓名：	卢绪刚陈道文

作者单位：	中国科学院自动化研究所模式识别实验室!北京，100080，中国科学院自动化研究所模式识别实验室!北京，100080

基金项目：	自然科学基金!69635020

摘要：	利用听觉感知机理,建立一个基于听觉感知机理的语音信号特征提取模型。本文由两部分组成,一部分是在传统听觉计算模型基础上提出听觉倒谱特征AFCC(AnditoryFrequencyCepstralCoefficient)的提取方法,这样既压缩了特征维数,减小计算量,又使各个特征维之间相互独立,满足HMM模型的要求。并且根据听觉神经中枢的长时整合特性,文中提出了用低通滤波模型来模拟这种功能。结合该低通模型,提取的语音信号的听觉倒谱特征在HMM框架下取得较好的鲁律性。另一部分在研究听觉侧抑制机理的基础上,提出一个简单有效的听觉侧抑制处理模型。美尔倒谱特征MFCC谱特征经过该侧抑制模型处理,得到侧抑制美倒谱特征MFCCI,实验表明,该新特征MFCCI鲁棒性能比MFCC有大大提高。听觉倒谱特征AFCC经过该侧抑制处理得到侧抑制听觉倒谱特征AFCCI,实验表明,该新特征AFCCI鲁律性能比AFCC有大大提高。
收稿时间：	1999-06-09
修稿时间：	2000-05-19
Computational auditory model and its application in robust speech signal recognition

LU Xugang, CHEN Daowen. Computational auditory model and its application in robust speech signal recognition[J]. ACTA ACUSTICA, 2000, 25(6): 492-498. DOI: 10.15949/j.cnki.0371-0025.2000.06.003

Authors:	LU Xugang CHEN Daowen

Abstract:	A computational auditory model is designed based on hearing perception mechanisms,and speech feattire can be extracted by this basic model.This paper is made up of two parts,in the first part,based on the model,an auditory representation as AFCC (Auditory Frequency Cepstral Coefficient) is proposed,thus the vector dimensions can be compressed by this method,also decorrelate each dimensions of the vector.Also a low pass filter is used to simulate the long temporal integration effect of auditory central system.With this low pass filter,new auditory spectrum is extracted,experiments show,the new feature AFCC have high robustness than traditional feature MFCC under HMM recognition system.In the second part,A very simple but efficient model of lateral inhibition is proposed based on lateral inhibition mechanism of auditory perception.MFCCI can be gotten if MFCC is processed by this model,the new representation shows high robustness in speech recognition experiment.AFCCI can be gotten if AFCC is processed by this model,the new represelltation also shows high robustness in speech recognition experiment.

Keywords:
本文献已被 CNKI 等数据库收录！
	点击此处可从《声学学报》浏览原始摘要信息
	点击此处可从《声学学报》下载全文

设为首页 | 免责声明 | 关于勤云 | 加入收藏