首页 | 本学科首页   官方微博 | 高级检索  
     检索      

一种基于最小损失的垃圾邮件屏蔽算法
引用本文:邹磊,卢炎生,崔得暄,胡蓉.一种基于最小损失的垃圾邮件屏蔽算法[J].华中科技大学学报(自然科学版),2005,33(Z1):352-355.
作者姓名:邹磊  卢炎生  崔得暄  胡蓉
作者单位:华中科技大学,计算机科学与技术学院,湖北,武汉,430074
摘    要:考虑到反垃圾邮件本身特点,借鉴文本分类中的已有技术,将其应用到垃圾邮件的屏蔽中来.因为将合法邮件判别为垃圾邮件对于邮件用户造成的损失明显大于相反的操作,所以定义了一个损失函数,将其与朴素贝叶斯算法结合,实现了基于最小损失的垃圾邮件屏蔽算法.在一个公认的垃圾数据集上的实验结果验证了引入损失函数的有效性.

关 键 词:垃圾邮件屏蔽  最小损失  贝叶斯分类  文本分类
文章编号:1671-4512(2005)S1-0352-04
修稿时间:2005年9月1日

An anti-spam filtering algorithm based on cost minimization
Zou Lei,Lu Yansheng,Cui Dexuan,Hu Rong.An anti-spam filtering algorithm based on cost minimization[J].JOURNAL OF HUAZHONG UNIVERSITY OF SCIENCE AND TECHNOLOGY.NATURE SCIENCE,2005,33(Z1):352-355.
Authors:Zou Lei  Lu Yansheng  Cui Dexuan  Hu Rong
Institution:Zou Lei Lu Yansheng Cui Dexuan Hu Rong Doctoral Candidate,College of Computer Sci.& Tech.,Huazhong Univ.of Sci.& Tech.,Wuhan 430074,China.
Abstract:Due to the characteristics of anti-spam,the technology of text categorization is introduced into anti-spam filtering.Since the cost of mistaking the legal mails as spam is obviously higher than the reverse, a cost function is defined.Compining the cost function with Nave Bayes algorithm,an anti-spam filtering algorithm based on cost minimization is presented.The results of experiments on a well-known spam collection have proved the efficiency of this method.
Keywords:anti-spam filtering  cost minimizing  bayes categorization  text categorization  
本文献已被 CNKI 万方数据 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号