fastWKendall: an efficient algorithm for weighted Kendall correlation |
| |
Authors: | Jie Lin Donald A. Adjeroh Bing-Hua Jiang Yue Jiang |
| |
Affiliation: | 1.Faculty of Software,Fujian Normal University,Fuzhou,China;2.Computer Science and Electrical Engineering,West Virginia University,Morgantown,USA;3.Department of Pathology,Carver College of Medicine, the University of Iowa,Iowa,USA |
| |
Abstract: | The Kendall correlation is a non-parametric method that measures the strength of dependence between two sequences. Like Pearson correlation and Spearman correlation, Kendall correlation is widely applied in sequence similarity measurements and cluster analysis. We propose an efficient algorithm, fastWKendall, to compute the approximate weighted Kendall correlation in (O(nlog n)) time and O(n) space complexity. This is an improvement to the state-of-the-art (O(n^2)) time requirement. The proposed method can be incorporated to perform conventional sequential similarity measurement and cluster analysis much more rapidly. This is important for analysis of huge-volume datasets, such as genome databases, streaming stock market data, and publicly available huge datasets on the Internet. The code which is implemented in R is available for public access. |
| |
Keywords: | |
本文献已被 SpringerLink 等数据库收录! |
|