首页 | 本学科首页   官方微博 | 高级检索  
     


Multiple outlier detection in multivariate data using self-organizing maps title
Authors:Ashok K. Nag  Amit Mitra  Sharmishtha Mitra
Affiliation:(1) Department of Statistical Analysis & Computer Services, Reserve Bank of India, 400 051 Mumbai;(2) Department of Mathematics, Indian Institute of Technology, Bombay, Powai, Mumbai, 400 076
Abstract:Summary  The problem of detection of multidimensional outliers is a fundamental and important problem in applied statistics. The unreliability of multivariate outlier detection techniques such as Mahalanobis distance and hat matrix leverage has led to development of techniques which have been known in the statistical community for well over a decade. The literature on this subject is vast and growing. In this paper, we propose to use the artificial intelligence technique ofself-organizing map (SOM) for detecting multiple outliers in multidimensional datasets. SOM, which produces a topology-preserving mapping of the multidimensional data cloud onto lower dimensional visualizable plane, provides an easy way of detection of multidimensional outliers in the data, at respective levels of leverage. The proposed SOM based method for outlier detection not only identifies the multidimensional outliers, it actually provides information about the entire outlier neighbourhood. Being an artificial intelligence technique, SOM based outlier detection technique is non-parametric and can be used to detect outliers from very large multidimensional datasets. The method is applied to detect outliers from varied types of simulated multivariate datasets, a benchmark dataset and also to real life cheque processing dataset. The results show that SOM can effectively be used as a useful technique for multidimensional outlier detection.
Keywords:Artificial intelligence  minimum covariance determinant  minimum volume ellipsoid  multivariate outliers  robust estimation  self-organizing maps  unified distance matrix
本文献已被 SpringerLink 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号