Vol. 11, Issue 10, 1766-1779, October 2001
METHODS
DIAN: A Novel Algorithm for Genome Ontological Classification
Yannick
Pouliot,
Jing
Gao,
Qiaojuan Jane
Su,
Guozhen Gordon
Liu, and
Xuefeng Bruce
Ling1,2
DoubleTwist, Inc., Oakland, California 94612, USA
Faced with the determination of many completely sequenced genomes,
computational biology is now faced with the challenge of interpreting
the significance of these data sets. A multiplicity of data-related
problems impedes this goal: Biological annotations associated with raw
data are often not normalized, and the data themselves are often poorly
interrelated and their interpretation unclear. All of these problems
make interpretation of genomic databases increasingly difficult. With
the current explosion of sequences now available from the human genome
as well as from model organisms, the importance of sorting this vast
amount of conceptually unstructured source data into a limited universe of genes, proteins, functions, structures, and pathways has become a
bottleneck for the field. To address this problem, we have developed a
method of interrelating data sources by applying a novel method of
associating biological objects to ontologies. We have developed an
intelligent knowledge-based algorithm, DIAN, to support biological knowledge mapping, and, in particular, to facilitate the
interpretation of genomic data. In this respect, the method makes it
possible to inventory genomes by collapsing multiple types of
annotations and normalizing them to various ontologies. By relying on a
conceptual view of the genome, researchers can now easily navigate the
human genome in a biologically intuitive, scientifically accurate manner.
1
Present address: Tularik, Inc., 2 Corporate Drive, South
San Francisco, CA 94080, USA.
2
Corresponding author.
11:1766-1779 ©2001 by Cold Spring Harbor Laboratory Press ISSN 1088-9051/01 $5.00