首页 | 本学科首页   官方微博 | 高级检索  
     检索      


A frequency bin-wise nonlinear masking algorithm in convolutive mixtures for speech segregation
Authors:Chi Tai-Shih  Huang Ching-Wen  Chou Wen-Sheng
Institution:Department of Electrical Engineering, National Chiao Tung University, Hsinchu 300, Taiwan. tschi@mail.nctu.edu.tw
Abstract:A frequency bin-wise nonlinear masking algorithm is proposed in the spectrogram domain for speech segregation in convolutive mixtures. The contributive weight from each speech source to a time-frequency unit of the mixture spectrogram is estimated by a nonlinear function based on location cues. For each sound source, a non-binary mask is formed from the estimated weights and is multiplied to the mixture spectrogram to extract the sound. Head-related transfer functions (HRTFs) are used to simulate convolutive sound mixtures perceived by listeners. Simulation results show our proposed method outperforms convolutive independent component analysis and degenerate unmixing and estimation technique methods in almost all test conditions.
Keywords:
本文献已被 PubMed 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号