|
Vol. 9, Issue 4, 373-382, April 1999
METHODS
Panning for Genes A Visual Strategy for Identifying Novel Gene Orthologs and Paralogs
Jacques D.
Retief,1
Kevin R.
Lynch,2 and
William R.
Pearson2,3
1 Information Technology and Communications-Academic
Computing Health Sciences, and 2 Department of Biochemistry,
University of Virginia, Charlottesville, Virginia 22908 USA
We have developed a rapid visual method for identifying novel
members of gene families. Starting with an evolutionary tree, 20-50
protein query sequences for a gene family are selected from different
branches of the tree. These query sequences are used to search the
GenBank and expressed sequence tag (EST) DNA databases and their
nightly updates using the tfastx3 or tfasty3 programs. The results of
all 20-50 searches are collated and resorted to highlight EST or
genomic sequences that share significant similarity with the query
sequences. The statistical significance of each DNA/protein alignment
is plotted, highlighting the portion of the query sequence that is
present in the database sequence and the percent identity in the
aligned region. The collated results for database sequences are linked
using the WWW to the underlying scores and alignments; these links can
also be used to perform additional searches to characterize the novel
sequence further. With traditional "deep" scoring matrices
(BLOSUM50) one can search for previously unrecognized families of large
protein superfamilies. Alternatively, by using query sequences and EST
libraries from the same species (e.g., human or mouse) together with
"shallow" scoring matrices and filters that remove high-identity
sequences, one can highlight new paralogs of previously described
subfamilies. Using query sequences from the glutathione transferase
superfamily, we identified two novel mammalian glutathione transferase
families that were recognized previously only in plants. Using query
sequences from known mammalian glutathione transferase subfamilies, we
identified new candidate paralogs from the mouse class-mu, class-pi,
and class-theta families.
3
Corresponding author.
9:373-382 ©1999 by Cold Spring Harbor Laboratory Press ISSN 1088-9051/99 $5.00

CiteULike Connotea Del.icio.us Digg Reddit Technorati What's this?
This article has been cited by other articles:

|
 |

|
 |
 
D. W. Mount
Using a FASTA Sequence Database Similarity Search
CSH Protocols,
July 1, 2007;
2007(14):
pdb.top16 - pdb.top16.
[Abstract]
[Full Text]
|
 |
|

|
 |

|
 |
 
T. Nguyen, D. A. Shapiro, S. R. George, V. Setola, D. K. Lee, R. Cheng, L. Rauser, S. P. Lee, K. R. Lynch, B. L. Roth, et al.
Discovery of a Novel Member of the Histamine Receptor Family
Mol. Pharmacol.,
March 1, 2001;
59(3):
427 - 433.
[Abstract]
[Full Text]
|
 |
|

|
 |

|
 |
 
D.-S. Im, C. E. Heise, N. Ancellin, B. F. O'Dowd, G.-j. Shei, R. P. Heavens, M. R. Rigby, T. Hla, S. Mandala, G. McAllister, et al.
Characterization of a Novel Sphingosine 1-Phosphate Receptor, Edg-8
J. Biol. Chem.,
May 5, 2000;
275(19):
14281 - 14286.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
P. G. Board, M. Coggan, G. Chelvanayagam, S. Easteal, L. S. Jermiin, G. K. Schulte, D. E. Danley, L. R. Hoth, M. C. Griffor, A. V. Kamath, et al.
Identification, Characterization, and Crystal Structure of the Omega Class Glutathione Transferases
J. Biol. Chem.,
August 4, 2000;
275(32):
24798 - 24806.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
C. E. Heise, B. F. O'Dowd, D. J. Figueroa, N. Sawyer, T. Nguyen, D.-S. Im, R. Stocco, J. N. Bellefeuille, M. Abramovitz, R. Cheng, et al.
Characterization of the Human Cysteinyl Leukotriene 2 Receptor
J. Biol. Chem.,
September 22, 2000;
275(39):
30531 - 30536.
[Abstract]
[Full Text]
[PDF]
|
 |
|
|
|