首页 | 本学科首页   官方微博 | 高级检索  
     


A semi-supervised learning approach for detection of phishing webpages
Authors:Yuancheng Li  Rui Xiao  Jingang Feng  Liujun Zhao
Affiliation:1. School of Control and Computer Engineering, North China Electric Power University, Beijing 102206, PR China;2. State Key Laboratory of Alternate Electrical Power System with Renewable Energy Sources (North China Electric Power University), Beijing 102206, PR China
Abstract:This paper proposes a new phishing webpage detection approach based on a kind of semi-supervised learning method-transductive support vector machine (TSVM). Firstly the features of web image are extracted for complementing the disadvantage of phishing detection only based on document object model (DOM); they include gray histogram, color histogram, and spatial relationship between subgraphs. Then the features of sensitive information are examined by using page analysis based on DOM objects. In contrast to the drawback of support vector machine (SVM) algorithm which simply trains classifier by learning little and poor representative labeled samples, this method introduces the TSVM to train classifier that it takes into account the distribution information implicitly embodied in the large quantity of the unlabeled samples, and have better performance than SVM. The experimental results show that the proposed method not only achieves better classification accuracy, but also has strong applicability as the independent method of phishing detection.
Keywords:Phishing webpage detection   Web image   Features extraction   TSVM   Classifier
本文献已被 ScienceDirect 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号