Published online before print
February 15, 2002, 10.1101/gr.193702
Vol. 12, Issue 3, 367-378, March 2002
LETTER
Exploration of Novel Motifs Derived from Mouse cDNA Sequences
Hideya
Kawaji,1,2
Christian
Schönbach,3,6
Yo
Matsuo,4
Jun
Kawai,5
Yasushi
Okazaki,5
Yoshihide
Hayashizaki,5 and
Hideo
Matsuda1,6
1 Department of Informatics and Mathematical Science,
Graduate School of Engineering Science, Osaka University, Toyonaka
560-8531, Japan; 2 Nippon Telegraph and Telephone Software
Corporation, Yokohama 231-8554, Japan;
3 Computational Genomics Team, Bioinformatics Group, RIKEN
Genomic Sciences Center (GSC), Yokohama 230-0045, Japan;
4 Computational Proteomics Team, Bioinformatics Group, RIKEN
Genomic Sciences Center (GSC), Yokohama 230-0045, Japan;
5 Laboratory for Genome Exploration Research Group, RIKEN
Genomic Sciences Center (GSC), Yokohama 230-0045, Japan
We performed a systematic maximum density subgraph (MDS) detection
of conserved sequence regions to discover new, biologically relevant
motifs from a set of 21,050 conceptually translated mouse cDNA
(FANTOM1) sequences. A total of 3202 candidate sequences, which shared
similar regions over >20 amino acid residues, were screened against
known conserved regions listed in Pfam, ProDom, and InterPro. The
filtering procedure resulted in 139 FANTOM1 sequences belonging to 49 new motif candidates. Using annotations and multiple sequence alignment
information, we removed by visual inspection 42 candidates whose
members were found to be false positives because of sequence
redundancy, alternative splicing, low complexity, transcribed
retroviral repeat elements contained in the region of the predicted
open reading frame, and reports in the literature. The remaining seven
motifs have been expanded by hidden Markov model (HMM)
profile searches of SWISS-PROT/TrEMBL from 28 FANTOM1 sequences to 164 members and analyzed in detail on sequence and structure level to
elucidate the possible functions of motifs and members. The novel and
conserved motif MDS00105 is specific for the mammalian inhibitor of
growth (ING) family. Three submotifs MDS00105.1-3 are specific for
ING1/ING1L, ING1-homolog, and ING3 subfamilies. The motif MDS00105
together with a PHD finger domain constitutes a module for ING
proteins. Structural motif MDS00113 represents a leucine zipper-like
motif. Conserved motif MDS00145 is a novel
1-acyl-SN-glycerol-3-phosphate acyltransferase (AGPAT) submotif
containing a transmembrane domain that distinguishes AGPAT3 and AGPAT4
from all other acyltransferase domain-containing proteins. Functional
motif MDS00148 overlaps with the kazal-type serine protease inhibitor
domain but has been detected only in an extracellular loop region of
solute carrier 21 (SLC21) (organic anion transporters) family members,
which may regulate the specificity of anion uptake. Our motif discovery
not only aided in the functional characterization of new mouse
orthologs for potential drug targets but also allowed us to predict
that at least 16 other new motifs are waiting to be discovered from the
current SWISS-PROT/TrEMBL database.
6
Corresponding authors.
12:367-378 ©2002 by Cold Spring Harbor Laboratory Press ISSN 1088-9051/02 $5.00

CiteULike Connotea Del.icio.us Digg Reddit Technorati What's this?
This article has been cited by other articles:

|
 |

|
 |
 
N. J. Mulder, R. Apweiler, T. K. Attwood, A. Bairoch, A. Bateman, D. Binns, P. Bork, V. Buillard, L. Cerutti, R. Copley, et al.
New developments in the InterPro database
Nucleic Acids Res.,
January 12, 2007;
35(suppl_1):
D224 - D228.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
L. Vergnes, A. P. Beigneux, R. Davis, S. M. Watkins, S. G. Young, and K. Reue
Agpat6 deficiency causes subdermal lipodystrophy and resistance to obesity
J. Lipid Res.,
April 1, 2006;
47(4):
745 - 754.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
G. H.Y. He, C. C. Helbing, M. J. Wagner, C. W. Sensen, and K. Riabowol
Phylogenetic Analysis of the ING Family of PHD Finger Proteins
Mol. Biol. Evol.,
January 1, 2005;
22(1):
104 - 116.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
G S Nouman, J J Anderson, J Lunec, and B Angus
The role of the tumour suppressor p33ING1b in human neoplasia
J. Clin. Pathol.,
July 1, 2003;
56(7):
491 - 496.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
V. Brusic, R. S. Pillai, D. G. Silva, N. Petrovsky, RIKEN GER Group, GSL Members, and C. Schonbach
Cytokine-Related Genes Identified From the RIKEN Full-Length Mouse cDNA Data Set
Genome Res.,
June 1, 2003;
13(6):
1307 - 1317.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
A. Kanapin, S. Batalov, M. J. Davis, J. Gough, S. Grimmond, H. Kawaji, M. Magrane, H. Matsuda, C. Schonbach, R. D. Teasdale, et al.
Mouse Proteome Analysis
Genome Res.,
June 1, 2003;
13(6):
1335 - 1344.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
T. Nagashima, D. G. Silva, N. Petrovsky, L. A. Socha, H. Suzuki, R. Saito, T. Kasukawa, I. V. Kurochkin, A. Konagaya, and C. Schonbach
Inferring Higher Functional Information for RIKEN Mouse Full-Length cDNA Clones With FACTS
Genome Res.,
June 1, 2003;
13(6):
1520 - 1533.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
T. Kasukawa, M. Furuno, I. Nikaido, H. Bono, D. A. Hume, C. Bult, D. P. Hill, R. Baldarelli, J. Gough, A. Kanapin, et al.
Development and Evaluation of an Automated Annotation Pipeline and cDNA Annotation System
Genome Res.,
June 1, 2003;
13(6):
1542 - 1551.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
J. Magre, M. Delepine, L. Van Maldergem, J.-J. Robert, J. A. Maassen, M. Meier, V. R. Panz, C. A. Kim, N. Tubiana-Rufi, P. Czernichow, et al.
Prevalence of Mutations in AGPAT2 Among Human Lipodystrophies
Diabetes,
June 1, 2003;
52(6):
1573 - 1578.
[Abstract]
[Full Text]
[PDF]
|
 |
|
|
|