首页 | 本学科首页   官方微博 | 高级检索  
     检索      

基于词汇注释的层次化领域标注
引用本文:朱朝勇,黄河燕,史树敏.基于词汇注释的层次化领域标注[J].中国通信学报,2012,9(3):19-27.
作者姓名:朱朝勇  黄河燕  史树敏
摘    要:

收稿时间:2012-04-20;

Hierarchical Domain Assignment Based on Word-Gloss
Zhu Chaoyong,Huang Heyan,Shi Shumin.Hierarchical Domain Assignment Based on Word-Gloss[J].China communications magazine,2012,9(3):19-27.
Authors:Zhu Chaoyong  Huang Heyan  Shi Shumin
Institution:1School of Computer Science and Technology, University of Science and Technology of China, Hefei 230027, P. R. China
2School of Computer Science and Technology, Beijing Institute of Technology, Beijing 100081, P. R. China
3Beijing Engineering Research Center of High Volume Language Information Processing & Cloud Computing Applications, Beijing Institute of Technology, Beijing 100081, P. R. China
Abstract:This paper proposes a hierarchical word domain assignment algorithm to automatically build domain dictionaries from Machine-Readable Dictionary (MRD). The process for word domain assignment can be divided into three steps: 1) Hierarchical structure constructing; 2 ) Classifier training; 3 ) Word domain assigning. Compared with the traditional methods, the hierarchical word domain assignment algorithm enhances the accuracy of word domain assignment while reducing human efforts on collecting corpus. Experiments on WordNet 2.0 show that 62.53% of the first domain labels are matched with the WordNet Domains 3.0 by using gloss-based word domain assignment, and the performance can be further improved by utilizing the hierarchical relationships among the domain sets.
Keywords:natural language processing  domain dictionary  hierarchical classification  domain assignment  WordNet  MRD
点击此处可从《中国通信学报》浏览原始摘要信息
点击此处可从《中国通信学报》下载免费的PDF全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号