Iterative Denoising |
| |
Authors: | Kendall E Giles Michael W Trosset David J Marchette Carey E Priebe |
| |
Institution: | (1) Department of Statistical Sciences and Operations Research, Virginia Commonwealth University, Richmond, VA 23284, USA;(2) Department of Statistics, Indiana University, Bloomington, IN 47405, USA;(3) Dahlgren Division, Naval Surface Warfare Center, Dahlgren, VA 22448, USA;(4) Department of Applied Mathematics and Statistics, Johns Hopkins University, Baltimore, MD 21218, USA |
| |
Abstract: | One problem in many fields is knowledge discovery in heterogeneous, high-dimensional data. As an example, in text mining an
analyst often wishes to identify meaningful, implicit, and previously unknown information in an unstructured corpus. Lack
of metadata and the complexities of document space make this task difficult. We describe Iterative Denoising, a methodology
for knowledge discovery in large heterogeneous datasets that allows a user to visualize and to discover potentially meaningful
relationships and structures. In addition, we demonstrate the features of this methodology in the analysis of a heterogeneous
Science News corpus. |
| |
Keywords: | Knowledge discovery Text mining Classification Clustering |
本文献已被 SpringerLink 等数据库收录! |
|