首页 | 本学科首页   官方微博 | 高级检索  
     

汉语重音的凸显度分析与合成
引用本文:孟凡博, 吴志勇, 贾珈, 蔡莲红. 汉语重音的凸显度分析与合成[J]. 声学学报, 2015, 40(1): 1-11. DOI: 10.15949/j.cnki.0371-0025.2015.01.001
作者姓名:孟凡博  吴志勇  贾珈  蔡莲红
作者单位:1. 清华大学 计算机科学与技术系, 普适计算教育部重点实验室, 清华信息科学与技术国家实验室 北京 100084;
基金项目:国家973项目(2013CB329304)、国家自然科学基金(61375027,61370023)、香港政府研究资助局项目(N-CUHK414/09)和国家社会科学基金(13&ZD189)资助
摘    要:重音是重要的语调特征,重音合成技术可以提高语音的自然度和表现力。针对重音的局部凸显性,该文提出了声学特征凸显度的表示方法,分析了不同韵律位置(韵律词首、中、尾,韵律短语首、中、尾等)重音音节的声学特征凸显度,发现在韵律单元末(韵律词末音节和韵律短语末韵律词)的重音其基频最大值凸显度要低于非韵律单元末重音,提出了基于声学特征凸显度的非线性的重音声学参数生成算法,解决了传统重音声学参数线性修改算法的修改幅度不足或过大的问题。采用该算法建立了基于隐Markov模型的支持重音合成的语音合成系统。实验表明,该系统可以有效合成带有重音的语音,提高了合成语音的自然度和表现力。

关 键 词:语音合成  重音合成  声学特征凸显度  韵律位置
收稿时间:2013-07-21
修稿时间:2014-03-20

The prominence analysis and synthesis of emphasis in Putonghua
MENG Fanbo, WU Zhiyong, JIA Jia, CAI Lianhong. The prominence analysis and synthesis of emphasis in Putonghua[J]. ACTA ACUSTICA, 2015, 40(1): 1-11. DOI: 10.15949/j.cnki.0371-0025.2015.01.001
Authors:MENG Fanbo  WU Zhiyong  JIA Jia  CAI Lianhong
Affiliation:1. Tsinghua National Laboratory for Information Science and Technology, Department of Computer Science and Technology, Tsinghua University Beijing 100084;2. Tsinghua-CUHK Joint Research Center for Media Sciences, Technologies and Systems, Graduate School at Shenzhen, Tsinghua University Shenzhen 518055;3. Department of Systems Engineering and Engineering Management, The Chinese University of Hong Kong Hong Kong
Abstract:Emphasis is an important feature of intonation. The technology of emphatic speech synthesis can improve the naturalness and the expressiveness of the synthesized speech. This paper defined the prominences of acoustic features, analyzed the prominences of the acoustic features of the emphasized syllables in different prosody locations, e.g. the head, the body, and found that the prominences the maximum F0 of the emphasized syllables at the end of prosody units(prosody words or prosody phrases) are lower than other emphasized syllables. A parameter generation algorithm of emphasized syllables based on the prominences of the acoustic features was proposed based on the analysis. It avoided the problem of oversize modification of the traditional linear modification algorithm. An emphatic speech synthesis system based on hidden Markov model (HMM) was built with the proposed algorithm. Experiments demonstrated that the system could synthesize emphasized speeches and improve the naturalness and the expressive of the synthesized speeches. 
Keywords:speech synthesis  emphasis synthesis  prominence of acoustic feature  prosody location
点击此处可从《声学学报》浏览原始摘要信息
点击此处可从《声学学报》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号