Integrated Access to Genomic and Other Bioinformation: An Essential Ingredient of the Drug Discovery Process |
| |
Authors: | D. Benton |
| |
Affiliation: | 1. National Human Genome Research Institute, National Institutes of Health , 38 Library Drive, MSC 6050, Building 38A, Room 610, Bethesda, MD, 20892-6050, USA;2. Advanced Information Technology Department , Mail Stop UW2230, SmithKline Beecham Pharmaceuticals , 709 Swedeland Road, PO Box 1539, King of Prussia, PA, 19406-0939, USA |
| |
Abstract: | Abstract Due to the high rate of data production and the need of researchers to have rapid access to new data, public databases have become the major medium through which genome mapping and sequencing data as well as macromolecular structural data are published. There are now more than 250 databases of biomolecular, structural, genetic, or phenotypic data, many of which are doubling in size annually. These databases, many of which were created and are maintained by experimentalists for their own research use, provide valuable collections of organized, validated data. However, the very number and diversity of databases now make efficient data resource discovery as important as effective data resource use. Existing autonomous biological databases contain related data which are more valuable when interconnected than when isolated. Political and scientific realities dictate that these databases will be built by different teams, in different locations, for different purposes, and using different data models and supporting DBMSs. As a consequence, connecting the related data they contain is not straightforward. Experience with existing biological databases indicates that it is possible to form useful queries across these databases, but that doing so usually requires expertise in the semantic structure of each source database. Advancing to the next level of integration among biological information resources poses significant technical and sociological challenges. |
| |
Keywords: | Biological databases information retrieval heterogeneous databases database federation |
|
|