|
|
 |
|
 |
Volume 40, Number 2, 2001
Deep computing for the life sciences |
|
Table of contents: HTML PDF ASCII |
|
This article: HTML PDF ASCII |
Copyright info |
 |
 |
 |
 |
| |
|
An integration platform for heterogeneous bioinformatics software components - References |
 |
by A. C. Siepel A. N. Tolopko, A. D. Farmer, P. A. Steadman, F. D. Schilkey, B. D. Perry, and W. D. Beavis |
 |
 |
 |
Cited references and notes
-
M. Shaw and D. Garlan, Software Architecture: Perspectives on an Emerging Discipline, Prentice Hall, Upper Saddle River, NJ (1996).
-
Comparative genomics is a broad term used here in reference to various techniques for learning about one organism's genome through comparison to a better-studied relative.
-
K. Sirotkin, NCBI: Integrated Data for Molecular Biology Research, Bioinformatics: Databases and Systems, S. Letovsky, Editor, Kluwer Academic Publishers, Norwell, MA (1999), pp. 1119.
-
R. Unwin, J. Fenton, M. Whitsitt, C. Jamison, M. Stupar, E. Jakobsson, and S. Subramaniam, Biology Workbench: A Computing and Analysis Environment for the Biological Sciences, Bioinformatics: Databases and Systems, S. Letovsky, Editor, Kluwer Academic Publishers, Norwell, MA (1999), pp. 233244.
-
T. Etzold, A. Ulyanov, and P. Argos, SRS: Information Retrieval System for Molecular Biology Data Banks, Methods in Enzymology 266, 114128 (1996).
-
R. F. Smith, B. A. Wiese, M. K. Wojzynski, D. B. Davison, and K. C. Worley, BCM Search LauncherAn Integrated Interface to Molecular Biology Data Base Search and Analysis Services, Genome Research 6, No. 5, 454462 (1996).
-
Entigen Corporation (formerly eBioinformatics, Inc., and Empatheon, Inc.), http://www.bionavigator.com.
-
DoubleTwist, Inc., http://www.doubletwist.com.
-
V. M. Markowitz, I. M. A. Chen, A. Kosky, and E. Szeto, OPM: Object-Protocol Model Data Management Tools '97, Bioinformatics: Databases and Systems, S. Letovsky, Editor, Kluwer Academic Publishers, Norwell, MA (1999), pp. 187199.
-
S. B. Davidson, O. P. Buneman, J. Crabtree, V. Tannen, G. C. Overton, and L. Wong, BioKleisli: Integrating Biomedical Data and Analysis Packages, Bioinformatics: Databases and Systems, S. Letovsky, Editor, Kluwer Academic Publishers, Norwell, MA (1999), pp. 201211.
-
P. G. Baker, A. Brass, S. Bechhofer, C. Goble, N. Paton, and R. Stevens, TAMBIS: Transparent Access to Multiple Bioinformatics Information Sources, Proceedings, Sixth International Conference on Intelligent Systems for Molecular Biology, Montreal, Canada (June 28July 1, 1998), pp. 2534.
-
Mapping one database schema to another, even with the aid of human-defined meta-data, is an extremely difficult problem, unless the semantics and syntactic conventions of both schemas are formally defined and rigidly obeyed (which is generally not true of bioinformatics databases). This is one of the main motivations for the looser data model we have used.
-
We elaborate on this claim in: A. Siepel, A. Farmer, A. Tolopko, M. Zhuang, P. Mendes, W. Beavis, and B. Sobral, ISYS: A Decentralized, Component-Based Approach to the Integration of Heterogeneous Bioinformatics Resources, Bioinformatics 17, No. 1, 8394 (2001). Note, however, that some query systems for heterogeneous databases have created advanced graphical user interfaces that do support exploratory querying to a degree. Examples are GeneLogic, Inc.'s Object-Protocol Model (see Reference 9) and IBM's PESTO (M. J. Carey, L. M. Haas, V. Maganty, and J. H. Williams, PESTO: An Integrated Query/Browser for Object Databases, Proceedings 22nd International Conference on Very Large Databases (VLDB), Bombay, India (September 36, 1996), pp. 203214.
-
Life Sciences Research Task Force of the Object Management Group, http://www.omg.org/homepages/lsr.
-
C. Szyperski, Component Software: Beyond Object-Oriented Programming, Addison-Wesley Longman Limited, Essex, England (1998).
-
K. Jungfer, G. Cameron, and T. Flores, EBI: CORBA and the EBI Databases, Bioinformatics: Databases and Systems, S. Letovsky, Editor, Kluwer Academic Publishers, Norwell, MA (1999), pp. 245254.
-
NetGenics, Inc., http://www.netgenics.com.
-
Synomics, Ltd., http://www.synomics.com.
-
J. Hu, C. Mungall, D. Nicholson, and A. L. Archibald, Design and Implementation of a CORBA-Based Genome Mapping System Prototype, Bioinformatics 14, No. 2, 112120 (1998).
-
H. R. Pagels, Dreams of Reason: The Computer and the Rise of the Sciences of Complexity, Simon and Schuster, New York (1988).
-
S. Levy, Artificial Life: A Report from the Frontier Where Computers Meet Biology, Random House, New York (1992).
-
In contrast to the elements of cellular automata and Boolean networks, the components in ISYS are not necessarily simple; but they encapsulate their complexity and interact with one another in simple ways.
-
S. F. Altschul, W. Gish, W. Miller, E. W. Myers, and D. L. Lipman, Basic Local Alignment Search Tool, Journal of Molecular Biology 215, No. 3, 403410 (1990).
-
J. D. Thompson, D. G. Higgins, and T. J. Gibson, CLUSTAL W: Improving the Sensitivity of Progressive Multiple Sequence Alignment through Sequence Weighting, Position-Specific Gap Penalties and Weight Matrix Choice, Nucleic Acids Research 22, 46734680 (1994).
-
JalView, http://www2.ebi.ac.uk/~michele/jalview/.
-
M. P. Skupski, M. Booker, A. Farmer, M. Harpold, W. Huang, J. Inman, D. Kiphart, C. Kodira, S. Root, F. Schilkey, J. Schwertfeger, A. Siepel, D. Stamper, N. Thayer, R. Thompson, J. Wortman, M. Zhuang, and C. Harger, The Genome Sequence DataBase: Toward an Integrated Functional Genomics Resource, Nucleic Acids Research 27, 3538 (1999).
-
This capability is perhaps more powerful than the example here suggests. A slightly better illustration appears in Reference 13.
-
The EventChannel and ServiceBroker components represent a division into two parts of the component called the ClientBus in Reference 13, Siepel et al. (2000).
-
E. Gamma, R. Helm, R. Johnson, and J. Vlissides, Design Patterns: Elements of Reusable Object-Oriented Software, Addison-Wesley Publishing Co., Reading, MA (1995).
-
F. Buschmann, R. Meunier, H. Rohnert, P. Sommerlad, and M. Stal, A System of Patterns: Pattern-Oriented Software Architecture, John Wiley & Sons Ltd., Chichester, England (1996).
-
CORBA: Common Request Broker Architecture and Specification, available from the Object Management Group (1996). See http://www.omg.org/ for contact information.
-
The latter is called dynamic discovery in Reference 13.
-
Clement Szyperski defines components as XXX. We have taken the liberty of allowing that while they may be independently executable, they need not be independently executing. In fact, ISYS components are JAR files that may have, but are not required to have, classes with main methods that allow execution with or without the ISYS platform.
-
P. Karp, A Strategy for Database Interoperation, Journal of Computational Biology 2, No. 4, 573586 (1995).
-
V. M. Markowitz and O. Ritter, Characterizing Heterogeneous Molecular Biology Database Systems, Journal of Computational Biology 2, No. 4, 547556 (1995).
-
S. B. Davidson, C. Overton, and P. Buneman, Challenges in Integrating Biological Data Sources, Journal of Computational Biology 2, No. 4, 557572 (1995).
-
See Reference 13, Siepel et al.
-
Although the IsysObject interface does not explicitly support this capability, it is possible to implement an IsysObject such that new attributes can be added later in its life. In this way, an object can gain attributes as new data are produced by the system.
-
Space does not allow a full discussion of attribute groups. Essentially, they are necessary to maintain a sense of which attributes describe the same entities when networks of objects are flattened (as discussed later in the text). See the ISYS Web site for additional information.
-
A full treatment of the issue of the data model is not possible here. As they create their components, developers will be able to contribute attributes to the data model, if it does not meet their needs. Thus, the model for object attributes and relationships will evolve according to the needs of individual component developers.
|
 |
|
|