BIOINFORMATICS<-->STRUCTURE
Jerusalem, Israel, November 17-21, 1996

Abstract


Three dimensional database of biomacromolecules structure (3DB): a 'multi-tool' based browser as a solution for complex data and complex queries

Jaime Prilusky (1), Joel Sussman (2,3) and Enrique Abola (3)

(1) Bioinformatics Unit, The Weizmann Institute of Science, Rehovot 76100, Israel
(2) Department of Structural Biology, Weizmann Institute of Science, Rehovot 76100, Israel
(3) Protein Data Bank, Biology Department, Brookhaven National Laboratory, Upton, NY 11973 USA

lsprilus@weizmann.weizmann.ac.il

The traditional Protein Data Bank's flat file archival as evolved into a dynamic database based on a Object-Relational Database Management System: 3DB. The development effort attempts to address the needs of the diverse user community served by the PDB and the increasingly richness of solved structures. The 3DB schema supports queries by those interested in answering both crystallographic questions and molecular biology questions. 3DB was designed as an open system, with the expectation that it will shortly be federated with other biological databases, allowing complex queries to be submitted to our database, returning a composite answer built from partial answers that traverse several databases.

A querying tool for accessing this database, 3DB browser , has been developed using readily available elements, like the perl [1] programming language, Web/Genera [2] software toolset and the Glimpse [3] indexing and query system. Users specify search criteria by filling out an HTML form. Software at BNL processes this form and generates the required SQL. System performance is improved by using stored SYBASE SQL procedures that access each predefined object. The fields available are similar to those in our PDBBrowse program, and answer most of the questions that users have been asking.

For those familiar with (or willing to learn about) the OPM protocol [4], access to the object layer will be provided using a high level OPM-based query language. As part of the 3DB open database policy, direct access to the underlying RDBMS will be allowed and actively supported. These queries are not parsed by the 3DB-QA module, so better response time can be expected. This provides third-party developers with the opportunity to either incorporate SQL clients in their products or to learn more of the OPM protocol and, thereby, gain access to all of the benefits that the Object model affords (e.g., active external links, programs, etc.).

The output generator will return query results using a variety of data interchange formats. PDB will continue to support its current format in the foreseeable future. We also plan to extend this format to allow us to represent objects being stored in 3DB. In addition, a "raw format" is being provided which returns an attribute/value pair. This form is easily parsed and is more compact than the PDB format.

REFERENCES

[1]
Larry Wall, Perl (Practical Extraction and Report Language). Related info available from URL: http://www.perl.com/perl/index.html
[2]
Stan Letovsky, Web/Genera, a software toolset. Available from URL: http://gdbdoc.gdb.org/letovsky/genera/genera.html
[3]
Udi Manber, Sun Wu, and Burra Gopal. Glimpse, A tool to search entire file systems. Available from URL: http://glimpse.cs.arizona.edu/
[4]
I. A. Chen, and V. M. Markowitz, An overview of the Object-Protocol Model (OPM) and OPM data management tools, Information Systems 20 (5), 393-418, (1995). Article and related information available at URL: http://gizmo.lbl.gov/DM_TOOLS/OPM/opm.html

Back to the Abstract Index.