首页 | 本学科首页   官方微博 | 高级检索  
     检索      

多反复结构模型的精确音乐分离方法
引用本文:张天骐,徐昕,吴旺军,刘瑜.多反复结构模型的精确音乐分离方法[J].声学学报,2016,41(1):135-142.
作者姓名:张天骐  徐昕  吴旺军  刘瑜
作者单位:1.重庆邮电大学 信号与信息处理重庆市重点实验室 重庆 400065
基金项目:国家自然科学基金(61371164,61275099,61102131)、信号与信息处理重庆市市级重点实验室建设项目(CSTC2009CA2003)、重庆市杰出青年基金(CSTC2011jjjq40002)、重庆市自然科学基金(CSTC2012JJA40008)、重庆市教育委员会科研项目(KJ120525,KJ130524)和重庆市研究生科研创新项目(CYS14140)资助
摘    要:针对基本反复模型音乐分离方法自适应性差的问题,提出一种基于美标度倒谱系数(MFCC)的多反复结构模型的音乐分离方法。首先,提取出音乐信号的MFCC系数矩阵(39维的数据构成);然后利用余弦特性得到其相似矩阵,进而将相似度一致的片段划分到一起,建立不同的反复结构模型;之后结合理想二元掩蔽(]BM)分离出背景音乐及歌声的频谱,相应的时域信号则由傅里叶逆变换获得;最后,在不同类型、长度的音乐文件上测试了算法性能,将提出的算法与Rafii的反复算法和Ozerov的灵活窗非负矩阵分解方法进行对比。实验结果表明,改进方法在分离性能上最高提高3 dB左右,并且对于曲调变换大的音乐提高效果更为明显,从而证实了改进方法是一种有效的音乐分离方法,并且更具稳定性。

收稿时间:2014-07-14
修稿时间:2014-10-29

Music/voice separation based on the multi-repeating structure of Mel-frequency cepstrum coefficients
ZHANG Tianqi,XU Xin,WU Wangjun,LIU Yu.Music/voice separation based on the multi-repeating structure of Mel-frequency cepstrum coefficients[J].Acta Acustica,2016,41(1):135-142.
Authors:ZHANG Tianqi  XU Xin  WU Wangjun  LIU Yu
Institution:1.Chongqing University of Posts and Telecommunications, Chongqing Key Laboratory of Signal and Information Processing Chongqing 400065
Abstract:For the poor adaptability of the original repeating pattern,an improved music separation method of multirepeating structure of Mel-Frequency Cepstrum Coefficient(MFCC) was proposed.Firstly,the MFCC coefficient matrix(39-dimensional data) of the music signal was extracted;then the cosine characteristic was applied to the count of similarity matrix of MFCC,and putted the fragments with consistent similarity together,next built different repeating patterns for groups with different,thereby the spectrums of the background music and vocal were separated combined with ideal binary masking(IBM),the corresponding time domain signals were obtained by inverse Fourier transform;finally,the improved method was tested on the music database of different types and length,and the separation results were compared with repeating method of Rafii and the non-negative matrix factorization based on flexible framework method of Ozerov.The experimental results showed that the separation performance of improved method was improved about 3 dB,the performance of music with melody changed larger was significantly improved,thus verifying that that the improved method was an effective music separation algorithm and more 
Keywords:
本文献已被 CNKI 等数据库收录!
点击此处可从《声学学报》浏览原始摘要信息
点击此处可从《声学学报》下载免费的PDF全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号