首页 | 本学科首页   官方微博 | 高级检索  
     


An Algorithm Framework for Drug-Induced Liver Injury Prediction Based on Genetic Algorithm and Ensemble Learning
Authors:Bowei Yan  Xiaona Ye  Jing Wang  Junshan Han  Lianlian Wu  Song He  Kunhong Liu  Xiaochen Bo
Affiliation:1.Department of Bioinformatics, Institute of Health Service and Transfusion Medicine, Beijing 100850, China; (B.Y.); (J.H.);2.Department of Biotechnology, Institute of Radiation Medicine, Beijing 100850, China;3.School of Informatics, Xiamen University, Xiamen 361005, China;4.School of Medicine, Tsinghua University, Beijing 100084, China;5.Institute of Medical Engineering and Translational Medicine, Tianjin University, Tianjin 300072, China;
Abstract:
In the process of drug discovery, drug-induced liver injury (DILI) is still an active research field and is one of the most common and important issues in toxicity evaluation research. It directly leads to the high wear attrition of the drug. At present, there are a variety of computer algorithms based on molecular representations to predict DILI. It is found that a single molecular representation method is insufficient to complete the task of toxicity prediction, and multiple molecular fingerprint fusion methods have been used as model input. In order to solve the problem of high dimensional and unbalanced DILI prediction data, this paper integrates existing datasets and designs a new algorithm framework, Rotation-Ensemble-GA (R-E-GA). The main idea is to find a feature subset with better predictive performance after rotating the fusion vector of high-dimensional molecular representation in the feature space. Then, an Adaboost-type ensemble learning method is integrated into R-E-GA to improve the prediction accuracy. The experimental results show that the performance of R-E-GA is better than other state-of-art algorithms including ensemble learning-based and graph neural network-based methods. Through five-fold cross-validation, the R-E-GA obtains an ACC of 0.77, an F1 score of 0.769, and an AUC of 0.842.
Keywords:DILI   genetic algorithm   ensemble learning   PCA/MCA   QSAR   molecular representation
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号