Genome Res. 13:2129-2141, 2003
©2003 by Cold Spring Harbor Laboratory Press; ISSN 1088-9051/03 $5.00
Methods
PANTHER: A Library of Protein Families and Subfamilies Indexed by Function
Paul D. Thomas1,3,
Michael J. Campbell1,
Anish Kejariwal,
Huaiyu Mi,
Brian Karlak2,
Robin Daverman,
Karen Diemer,
Anushya Muruganujan and
Apurva Narechania
Protein Informatics, Celera Genomics, Foster City, California 94404, USA
In the genomic era, one of the fundamental goals is to characterize the function of proteins on a large scale. We describe a method, PANTHER, for relating protein sequence relationships to function relationships in a robust and accurate way. PANTHER is composed of two main components: the PANTHER library (PANTHER/LIB) and the PANTHER index (PANTHER/X). PANTHER/LIB is a collection of "books," each representing a protein family as a multiple sequence alignment, a Hidden Markov Model (HMM), and a family tree. Functional divergence within the family is represented by dividing the tree into subtrees based on shared function, and by subtree HMMs. PANTHER/X is an abbreviated ontology for summarizing and navigating molecular functions and biological processes associated with the families and subfamilies. We apply PANTHER to three areas of active research. First, we report the size and sequence diversity of the families and subfamilies, characterizing the relationship between sequence divergence and functional divergence across a wide range of protein families. Second, we use the PANTHER/X ontology to give a high-level representation of gene function across the human and mouse genomes. Third, we use the family HMMs to rank missense single nucleotide polymorphisms (SNPs), on a database-wide scale, according to their likelihood of affecting protein function.
[Supplemental material is available online at http://panther.celera.com/publications/gr7724_03=suppl.]
Article and publication are at http://www.genome.org/cgi/doi/10.1101/gr.772403.
1 These authors contributed equally to this work.
3 Corresponding author. E-MAIL paul.thomas{at}fc.celera.com; FAX (650) 554-2344.
2 Present address: Syrrx, Inc., San Diego, CA 92121, USA.

CiteULike Connotea Del.icio.us Digg Reddit Technorati What's this?
This article has been cited by other articles:

|
 |

|
 |
 
A. L. Fisher, K. E. Page, G. J. Lithgow, and L. Nash
The Caenorhabditis elegans K10C2.4 Gene Encodes a Member of the Fumarylacetoacetate Hydrolase Family: A CAENORHABDITIS ELEGANS MODEL OF TYPE I TYROSINEMIA
J. Biol. Chem.,
April 4, 2008;
283(14):
9127 - 9135.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
E. W. Deutsch, H. Lam, and R. Aebersold
Data analysis and bioinformatics tools for tandem mass spectrometry in proteomics
Physiol Genomics,
March 10, 2008;
33(1):
18 - 25.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
A. P.A. van Montfoort, J. P.M. Geraedts, J. C.M. Dumoulin, A. P.M. Stassen, J. L.H. Evers, and T. A.Y. Ayoubi
Differential gene expression in cumulus cells as a prognostic indicator of embryo viability: a microarray analysis
Mol. Hum. Reprod.,
March 1, 2008;
14(3):
157 - 168.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
M. J. Liguori, E. A.G. Blomme, and J. F. Waring
Trovafloxacin-Induced Gene Expression Changes in Liver-Derived in Vitro Systems: Comparison of Primary Human Hepatocytes to HepG2 Cells
Drug Metab. Dispos.,
February 1, 2008;
36(2):
223 - 233.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
L. Yao and A. Rzhetsky
Quantitative systems-level determinants of human genes targeted by successful drugs
Genome Res.,
February 1, 2008;
18(2):
206 - 213.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
Z. Du, Y. Zhao, and N. Li
Genome-wide analysis reveals regulatory role of G4 DNA in gene transcription
Genome Res.,
February 1, 2008;
18(2):
233 - 241.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
P. Uawithya, T. Pisitkun, B. E. Ruttenberg, and M. A. Knepper
Transcriptional profiling of native inner medullary collecting duct cells from rat kidney
Physiol Genomics,
January 17, 2008;
32(2):
229 - 253.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
K. C. Teeter, B. A. Payseur, L. W. Harris, M. A. Bakewell, L. M. Thibodeau, J. E. O'Brien, J. G. Krenz, M. A. Sans-Fuentes, M. W. Nachman, and P. K. Tucker
Genome-wide patterns of gene flow across a house mouse hybrid zone
Genome Res.,
January 1, 2008;
18(1):
67 - 76.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
A. L. Amelio, L. J. Miraglia, J. J. Conkright, B. A. Mercer, S. Batalov, V. Cavett, A. P. Orth, J. Busby, J. B. Hogenesch, and M. D. Conkright
A coactivator trap identifies NONO (p54nrb) as a component of the cAMP-signaling pathway
PNAS,
December 18, 2007;
104(51):
20314 - 20319.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
L. Whyte, Y.-Y. Huang, K. Torres, and R. G. Mehta
Molecular Mechanisms of Resveratrol Action in Lung Cancer Cells Using Dual Protein and Microarray Analyses
Cancer Res.,
December 15, 2007;
67(24):
12007 - 12017.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
M. J. Potthoff, M. A. Arnold, J. McAnally, J. A. Richardson, R. Bassel-Duby, and E. N. Olson
Regulation of Skeletal Muscle Sarcomere Integrity and Postnatal Muscle Function by Mef2c
Mol. Cell. Biol.,
December 1, 2007;
27(23):
8143 - 8151.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
J. Wang, H. Cao, M. R. Ban, B. A. Kennedy, S. Zhu, S. Anand, S. Yusuf, R. L. Pollex, and R. A. Hegele
Resequencing Genomic DNA of Patients With Severe Hypertriglyceridemia (MIM 144650)
Arterioscler. Thromb. Vasc. Biol.,
November 1, 2007;
27(11):
2450 - 2455.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
A. Torkamani and N. J. Schork
Accurate prediction of deleterious protein kinase polymorphisms
Bioinformatics,
November 1, 2007;
23(21):
2918 - 2925.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
B. Jacquelin, V. Mayau, G. Brysbaert, B. Regnault, O. M. Diop, F. Arenzana-Seisdedos, L. Rogge, J.-Y. Coppee, F. Barre-Sinoussi, A. Benecke, et al.
Long oligonucleotide microarrays for African green monkey gene expression profile analysis
FASEB J,
October 1, 2007;
21(12):
3262 - 3271.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
A. M. Mlynarek, R. L. Balys, J. Su, M. P. Hier, M. J. Black, and M. A. Alaoui-Jamali
A Cell Proteomic Approach for the Detection of Secretable Biomarkers of Invasiveness in Oral Squamous Cell Carcinoma
Arch Otolaryngol Head Neck Surg,
September 1, 2007;
133(9):
910 - 918.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
B. M. Hallstrom, M. Kullberg, M. A. Nilsson, and A. Janke
Phylogenomic Data Analyses Provide Evidence that Xenarthra and Afrotheria Are Sister Groups
Mol. Biol. Evol.,
September 1, 2007;
24(9):
2059 - 2068.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
R. L. Montgomery, C. A. Davis, M. J. Potthoff, M. Haberland, J. Fielitz, X. Qi, J. A. Hill, J. A. Richardson, and E. N. Olson
Histone deacetylases 1 and 2 redundantly regulate cardiac morphogenesis, growth, and contractility
Genes & Dev.,
July 15, 2007;
21(14):
1790 - 1802.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
M. Eickhoff, J. Thalmann, S. Hess, M. Martin, T. Laue, J. Kruppa, G. Brandes, and A. Klos
Host Cell Responses to Chlamydia pneumoniae in Gamma Interferon-Induced Persistence Overlap Those of Productive Infection and Are Linked to Genes Involved in Apoptosis, Cell Cycle, and Metabolism
Infect. Immun.,
June 1, 2007;
75(6):
2853 - 2863.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
P. Kallio, M. Kolehmainen, D. E Laaksonen, J. Kekalainen, T. Salopuro, K. Sivenius, L. Pulkkinen, H. M Mykkanen, L. Niskanen, M. Uusitupa, et al.
Dietary carbohydrate modification induces alterations in gene expression in abdominal subcutaneous adipose tissue in persons with the metabolic syndrome: the FUNGENUT Study
Am. J. Clinical Nutrition,
May 1, 2007;
85(5):
1417 - 1427.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
H. Mi, N. Guo, A. Kejariwal, and P. D. Thomas
PANTHER version 6: protein sequence and function evolution data with expanded representation of biological pathways
Nucleic Acids Res.,
January 12, 2007;
35(suppl_1):
D247 - D252.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
R. R. Singaraja, H. Visscher, E. R. James, A. Chroni, J. M. Coutinho, L. R. Brunham, M. H. Kang, V. I. Zannis, G. Chimini, and M. R. Hayden
Specific Mutations in ABCA1 Have Discrete Effects on ABCA1 Function and Lipid Phenotypes Both In Vivo and In Vitro
Circ. Res.,
August 18, 2006;
99(4):
389 - 397.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
J. Gough
Genomic scale sub-family assignment of protein domains
Nucleic Acids Res.,
July 28, 2006;
34(13):
3625 - 3633.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
P. D. Thomas, A. Kejariwal, N. Guo, H. Mi, M. J. Campbell, A. Muruganujan, and B. Lazareva-Ulitsky
Applications for protein sequence-function evolution data: mRNA/protein expression analysis and coding SNP scoring tools.
Nucleic Acids Res.,
July 1, 2006;
34(Web Server issue):
W645 - W650.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
G. A. Reeves, J. M. Thornton, and the BioSapiens Network of Excellence
Integrating biological data through the genome
Hum. Mol. Genet.,
April 15, 2006;
15(suppl_1):
R81 - R87.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
V. Albertini, A. Jain, S. Vignati, S. Napoli, A. Rinaldi, I. Kwee, M. Nur-e-Alam, J. Bergant, F. Bertoni, G. M. Carbone, et al.
Novel GC-rich DNA-binding compound produced by a genetically engineered mutant of the mithramycin producer Streptomyces argillaceus exhibits improved transcriptional repressor activity: implications for cancer therapy.
Nucleic Acids Res.,
January 1, 2006;
34(6):
1721 - 1734.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
D. G. Phinney, K. Hill, C. Michelson, M. DuTreil, C. Hughes, S. Humphries, R. Wilkinson, M. Baddoo, and E. Bayly
Biological Activities Encoded by the Murine Mesenchymal Stem Cell Transcriptome Provide a Basis for Their Developmental Potential and Broad Therapeutic Efficacy
Stem Cells,
January 1, 2006;
24(1):
186 - 198.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
B. Lazareva-Ulitsky, K. Diemer, and P. D. Thomas
On the quality of tree-based protein classification
Bioinformatics,
May 1, 2005;
21(9):
1876 - 1890.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
X. H. Zheng, F. Lu, Z.-Y. Wang, F. Zhong, J. Hoover, and R. Mural
Using shared genomic synteny and shared protein functions to enhance the identification of orthologous gene pairs
Bioinformatics,
March 15, 2005;
21(6):
703 - 710.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
S. Abhiman and E. L. L. Sonnhammer
FunShift: a database of function shift analysis on protein subfamilies
Nucleic Acids Res.,
January 1, 2005;
33(suppl_1):
D197 - D200.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
H. Mi, B. Lazareva-Ulitsky, R. Loo, A. Kejariwal, J. Vandergriff, S. Rabkin, N. Guo, A. Muruganujan, O. Doremieux, M. J. Campbell, et al.
The PANTHER database of protein families, subfamilies, functions and pathways
Nucleic Acids Res.,
January 1, 2005;
33(suppl_1):
D284 - D288.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
R. A. Drysdale, M. A. Crosby, and The FlyBase Consortium
FlyBase: genes and gene models
Nucleic Acids Res.,
January 1, 2005;
33(suppl_1):
D390 - D395.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
P. A. Gray, H. Fu, P. Luo, Q. Zhao, J. Yu, A. Ferrari, T. Tenzen, D.-i. Yuk, E. F. Tsung, Z. Cai, et al.
Mouse Brain Organization Revealed Through Direct Genome-Scale TF Expression Analysis
Science,
December 24, 2004;
306(5705):
2255 - 2257.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
P. D. Thomas and A. Kejariwal
Coding single-nucleotide polymorphisms associated with complex vs. Mendelian disease: Evolutionary evidence for differences in molecular effects
PNAS,
October 26, 2004;
101(43):
15398 - 15403.
[Abstract]
[Full Text]
[PDF]
|
 |
|
|
|