首页 | 本学科首页   官方微博 | 高级检索  
     检索      


Cheminformatics analysis and learning in a data pipelining environment
Authors:Moises Hassan  Robert D Brown  Shikha Varma-O’Brien  David Rogers
Institution:(1) SciTegic, Inc., 10188 Telesis Court, Suite 100, San Diego, CA 92121, USA;(2) Accelrys, Inc., 10188 Telesis Court, Suite 100, San Diego, CA 92121, USA
Abstract:Summary Workflow technology is being increasingly applied in discovery information to organize and analyze data. SciTegic's Pipeline Pilot is a chemically intelligent implementation of a workflow technology known as data pipelining. It allows scientists to construct and execute workflows using components that encapsulate many cheminformatics based algorithms. In this paper we review SciTegic's methodology for molecular fingerprints, molecular similarity, molecular clustering, maximal common subgraph search and Bayesian learning. Case studies are described showing the application of these methods to the analysis of discovery data such as chemical series and high throughput screening results. The paper demonstrates that the methods are well suited to a wide variety of tasks such as building and applying predictive models of screening data, identifying molecules for lead optimization and the organization of molecules into families with structural commonality.
Keywords:Bayesian models  bioactivity prediction  data mining  data pipelining  maximal common substructure search  molecular fingerprints  molecular similarity  virtual screening
本文献已被 PubMed SpringerLink 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号