Dysarthric speaker identification with different degrees of dysarthria severity using deep belief networks期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

按检索

Dysarthric speaker identification with different degrees of dysarthria severity using deep belief networks

Authors:	Aref Farhadipour Hadi Veisi Mohammad Asgari Mohammad Ali Keyvanrad

Institution:	1. Department of Media Engineering, IRI Broadcast University, Tehran, Iran;2. Faculty of New Sciences and Technologies, University of Tehran, Tehran, Iran;3. Department of Computer Engineering and Information Technology, Amirkabir University of Technology, Tehran, Iran

Abstract:	Dysarthria is a degenerative disorder of the central nervous system that affects the control of articulation and pitch; therefore, it affects the uniqueness of sound produced by the speaker. Hence, dysarthric speaker recognition is a challenging task. In this paper, a feature-extraction method based on deep belief networks is presented for the task of identifying a speaker suffering from dysarthria. The effectiveness of the proposed method is demonstrated and compared with well-known Mel-frequency cepstral coefficient features. For classification purposes, the use of a multi-layer perceptron neural network is proposed with two structures. Our evaluations using the universal access speech database produced promising results and outperformed other baseline methods. In addition, speaker identification under both text-dependent and text-independent conditions are explored. The highest accuracy achieved using the proposed system is 97.3%.

Keywords:	deep belief network deep neural network dysarthria MFCC speaker identification

设为首页 | 免责声明 | 关于勤云 | 加入收藏