首页 | 本学科首页   官方微博 | 高级检索  
     检索      


The role of human in the loop: lessons from D3R challenge 4
Authors:Oleg V Stroganov  Fedor N Novikov  Michael G Medvedev  Artem O Dmitrienko  Igor Gerasimov  Igor V Svitanko  Ghermes G Chilov
Institution:1.N. D. Zelinsky Institute of Organic Chemistry RAS, Leninsky Prospect, 47, 119991, Moscow, Russian Federation;2.Skolkovo Innovation Center, Molecular Technologies, LLC, Russian Federation, Bolshoi blv., 42(1), 121205, Moscow, Russia;3.BioMolTech Corp, 226 York Mills Road, M2L 1L1, Toronto, Ontario, Canada;4.A. N. Nesmeyanov Institute of Organoelement Compounds RAS, Russian Federation, Vavilov str., 28, 119991, Moscow, Russia;5.National Research University Higher School of Economics (HSE), Myasnitskaya str., 20, 101000, Moscow, Russian Federation;6.Department of Chemistry, Lomonosov Moscow State University, Leninskie Gory 1/3, 119991, Moscow, Russian Federation;7.National University of Science and Technology ?MISiS?, 119490, Moscow, Russian Federation
Abstract:The rapid development of new machine learning techniques led to significant progress in the area of computer-aided drug design. However, despite the enormous predictive power of new methods, they lack explainability and are often used as black boxes. The most important decisions in drug discovery are still made by human experts who rely on intuitions and simplified representation of the field. We used D3R Grand Challenge 4 to model contributions of human experts during the prediction of the structure of protein–ligand complexes, and prediction of binding affinities for series of ligands in the context of absence or abundance of experimental data. We demonstrated that human decisions have a series of biases: a tendency to focus on easily identifiable protein–ligand interactions such as hydrogen bonds, and neglect for a more distributed and complex electrostatic interactions and solvation effects. While these biases still allow human experts to compete with blind algorithms in some areas, the underutilization of the information leads to significantly worse performance in data-rich tasks such as binding affinity prediction.
Keywords:
本文献已被 SpringerLink 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号