首页 | 本学科首页   官方微博 | 高级检索  
     检索      

基于伪标签和迁移学习的双关语识别方法
引用本文:姜思羽,张智恒,姜立标,马乐,陈博远,王连喜,赵亮.基于伪标签和迁移学习的双关语识别方法[J].重庆大学学报(自然科学版),2024,47(2):51-61.
作者姓名:姜思羽  张智恒  姜立标  马乐  陈博远  王连喜  赵亮
作者单位:1.广东外语外贸大学 信息科学与技术学院,广州 510006;2.2a华南理工大学,软件学院,广州 510000;3.2b华南理工大学,机械与汽车工程学院,广州 510000;4.3a广州城市理工学院,机械工程学院,广州510800;5.3b广州城市理工学院,工程研究院,广州510800;6.广东轻工职业技术学院 继续教育学院, 广州 510300
基金项目:广州市科技计划资助项目( 202102020637,202002030227);广东外语外贸大学师生合作资助项目(21SS10)。
摘    要:针对双关语样本短缺问题,研究提出了基于伪标签和迁移学习的双关语识别模型(pun detection based on Pseudo-label and transfer learning)。该模型利用上下文语义、音素向量和注意力机制生成伪标签;然后,迁移学习和置信度结合挑选可用的伪标签;最后,将伪标签数据和真实数据混合到网络中进行训练,重复伪标签标记和混合训练过程。一定程度上解决了双关语样本量少且获取困难的问题。使用该模型在SemEval 2017 shared task 7以及Pun of the Day数据集上进行双关语检测实验,结果表明模型性能均优于现有主流双关语识别方法。

关 键 词:双关语检测  伪标签  迁移学习
收稿时间:2021/6/25 0:00:00

Pun detection basd on pseudo-label and transfer learning
JIANG Siyu,ZHANG Zhiheng,JIANG Libiao,MA Le,CHEN Boyuan,WANG Lianxi,ZHAO Liang.Pun detection basd on pseudo-label and transfer learning[J].Journal of Chongqing University(Natural Science Edition),2024,47(2):51-61.
Authors:JIANG Siyu  ZHANG Zhiheng  JIANG Libiao  MA Le  CHEN Boyuan  WANG Lianxi  ZHAO Liang
Institution:1.School of Information Science and Technology, Guangdong University of Foreign Studies,Guangzhou 510006, P. R. China;2.2a School of SoftwareGuangzhou 510000;3.2bSchool of Mechanical and Automotive Engineering, South China University of Technology, Guangzhou 510000, P. R. China;4.3a School of Mechanical EngineeringGuangzhou;5.3bEngineering Research Institute, Guangzhou City University of Technology, Guangzhou 510800, P. R. China;6.College of Further Education, Guangdong Industry Polytechnic, Guangzhou 510300, P. R. China
Abstract:To address the problem of shortage of the pun samples, this paper proposes a pun recognition model based on pseudo-label speech-focused context (pun detection based on pseudo-label and transfer learning). Firstly, the model uses contextual semantics, phoneme vector and attention mechanism to generate pseudo-labels. Then, it combines transfer learning and confidence to select useful pseudo-labels. Finally, the pseudo-label data and real data are used for network theory and training, and the pseudo-label labeling and mixed training procedures are repeated. To a certain extent, the problem of small sample size and difficulty in obtaining puns has been solved. By this model, we carry out pun detection experiments on both the SemEval 2017 shared task 7 dataset and the Pun of the Day dataset. The results show that the performance of this model is better than that of the existing mainstream pun recognition methods.
Keywords:pun detection  pseudo-label  transfer learning
点击此处可从《重庆大学学报(自然科学版)》浏览原始摘要信息
点击此处可从《重庆大学学报(自然科学版)》下载免费的PDF全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号