首页 | 本学科首页   官方微博 | 高级检索  
     检索      


Iterative Denoising
Authors:Kendall E Giles  Michael W Trosset  David J Marchette  Carey E Priebe
Institution:(1) Department of Statistical Sciences and Operations Research, Virginia Commonwealth University, Richmond, VA 23284, USA;(2) Department of Statistics, Indiana University, Bloomington, IN 47405, USA;(3) Dahlgren Division, Naval Surface Warfare Center, Dahlgren, VA 22448, USA;(4) Department of Applied Mathematics and Statistics, Johns Hopkins University, Baltimore, MD 21218, USA
Abstract:One problem in many fields is knowledge discovery in heterogeneous, high-dimensional data. As an example, in text mining an analyst often wishes to identify meaningful, implicit, and previously unknown information in an unstructured corpus. Lack of metadata and the complexities of document space make this task difficult. We describe Iterative Denoising, a methodology for knowledge discovery in large heterogeneous datasets that allows a user to visualize and to discover potentially meaningful relationships and structures. In addition, we demonstrate the features of this methodology in the analysis of a heterogeneous Science News corpus.
Keywords:Knowledge discovery  Text mining  Classification  Clustering
本文献已被 SpringerLink 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号