10.4. A Genetic Information Model

Let's envision a genetic-based information model. One method of basing our model is to start with a DNA sequence. This base is used by NCBI because GenBank is commissioned as a sequence database as opposed to a protein database such as Swiss-Prot. It is instructive to follow this starting point and see where it leads in terms of the variety of information that can be attached to a sequence that contains a gene. The information model and some of the data within the model will be somewhat contrived to bring out the key points. A detailed and accurate biological model is beyond the scope of this chapter.

Listing 10.1 is an XML document derived from an NCBI DNA sequence entry. For each sequence, a name or definition ...

Get XML Data Management: Native XML and XML-Enabled Database Systems now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.