首页 | 本学科首页   官方微博 | 高级检索  
     

基于自组织聚类和改进粒子群算法的语音转换方法
引用本文:解伟超, 张玲华. 基于自组织聚类和改进粒子群算法的语音转换方法[J]. 声学学报, 2014, 39(1): 130-136. DOI: 10.15949/j.cnki.0371-0025.2014.01.012
作者姓名:解伟超  张玲华
作者单位:1.南京邮电大学通信与信息工程学院 南京 210003
基金项目:国家自然科学基金(60872105);江苏省“青蓝工程”中青年学术带头人课题;江苏高校优势学科建设工程项目(PAPD)资助
摘    要:提出一种基于自组织聚类,并且利用改进粒子群算法确定转换模型参数的语音转换方法.该方法首先基于自组织特征映射网络对特征参数进行聚类,再对每个聚类分别建立转换规则,并且利用柯西变异的粒子群算法确定每个转换规则中的模型参数.与传统的单一转换规则相比,聚类后建立的多转换规则以及利用改进粒子群算法确定参数能够提高映射关系的准确度,避免参数陷入局部最优点。以女声转男声为例,主观测试表明该方法得到的转换语音与目标的相似度提高了27.6%,平均主观意见分(Mean Opinion Score,MOS)提高了0.6,客观测试也表明该方法谱失真最小,与目标的包络更接近.

关 键 词:柯西变异  改进粒子群算法  转换规则  转换方法  自组织聚类  神经元  语音质量  特征参数  竞争层  模型参数
收稿时间:2012-05-04
修稿时间:2012-09-18

Voice conversion based on self organization clustering and modified particle swarm optimization
JIE Weichao, ZHANG Linghua. Voice conversion based on self organization clustering and modified particle swarm optimization[J]. ACTA ACUSTICA, 2014, 39(1): 130-136. DOI: 10.15949/j.cnki.0371-0025.2014.01.012
Authors:JIE Weichao  ZHANG Linghua
Affiliation:1.College of Telecommunication&Information Engineering, Nanjing Univ. of Posts and Telecommunications Nanjing 210003
Abstract:A method of voice conversion based on self organization clustering and determining parameters of conversion model by modified Particle Swarm Optimization (PSO) is proposed. Firstly, Self Organization Feature Mapping (SOFM) Network is used to cluster the characteristic parameters, and then the conversion rule for each cluster is established, where the parameters of conversion model in each conversion rule are determined by modified PSO with Cauchy mutation. Compared with the single conversion rule in conventional method, the multiple rules established by using clustering and parameters determined by modified PSO can improve the accuracy of mapping function and avoid the model parameters being trapped in the local optimum. The experiments take the conversion from female to male as example, by subjective evaluation, the proposed method increases the similarity by 27.6% through ABX test, and increases the Mean Opinion Score (MOS) by 0.6, and by objective evaluation, the spectral distortion with proposed method is the least. 
Keywords:
本文献已被 CNKI 等数据库收录!
点击此处可从《声学学报》浏览原始摘要信息
点击此处可从《声学学报》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号