首页 | 本学科首页   官方微博 | 高级检索  
     检索      


Sequential imputation for missing values
Authors:Verboven Sabine  Branden Karlien Vanden  Goos Peter
Institution:

aUniversity of Antwerp, Department of Mathematics, Statistics & Actuarial Sciences, Prinsstraat 13, 2000 Antwerp, Belgium

bJoint Research Centre, TP 361, 21020 Ispra, VA, Italy

Abstract:As missing values are often encountered in gene expression data, many imputation methods have been developed to substitute these unknown values with estimated values. Despite the presence of many imputation methods, these available techniques have some disadvantages. Some imputation techniques constrain the imputation of missing values to a limited set of genes, whereas other imputation methods optimise a more global criterion whereby the computation time of the method becomes infeasible. Others might be fast but inaccurate. Therefore in this paper a new, fast and accurate estimation procedure, called SEQimpute, is proposed. By introducing the idea of minimisation of a statistical distance rather than a Euclidean distance the method is intrinsically different from the thus far existing imputation methods. Moreover, this newly proposed method can be easily embedded in a multiple imputation technique which is better suited to highlight the uncertainties about the missing value estimates. A comparative study is performed to assess the estimation of the missing values by different imputation approaches. The proposed imputation method is shown to outperform some of the existing imputation methods in terms of accuracy and computation speed.
Keywords:Missing genes  Microarray data  Imputation methods
本文献已被 ScienceDirect PubMed 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号