首页 | 本学科首页   官方微博 | 高级检索  
     检索      


A High-Performance and Flexible Chemical Structure & Data Search Engine Built on CouchDB & ElasticSearch
Authors:Ren-zhi Li  Bo-jie Li  Guo-zhen Zhang  Jun Jiang  Yi Luo
Institution:Hefei National Laboratory for Physical Sciences at the Microscale, School of Chemistry and Materials Science, University of Science and Technology of China, Hefei 230026, China
Abstract:Computer-assisted chemical structure searching plays a critical role for efficient structure screening in cheminformatics. We designed a high-performance chemical structure & data search engine called DCAIKU, built on CouchDB and ElasticSearch engines. DCAIKU converts the chemical structure similarity search problem into a general text search problem to utilize off-the-shelf full-text search engines. DCAIKU also supports flexible document structures and heterogeneous datasets with the help of schema-less document database. Our evaluations show that DCAIKU can handle both keyword search and structural search against millions of records with both high accuracy and low latency. We expect that DCAIKU will lay the foundation towards large-scale and cost-effective structural search in materials science and chemistry research.
Keywords:Search engine  Cheminformatics  Structural search  Schema-less databases
点击此处可从《化学物理学报(中文版)》浏览原始摘要信息
点击此处可从《化学物理学报(中文版)》下载免费的PDF全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号