|
|
|
|
Genome Res. 15:780-789, 2005 ©2005 by Cold Spring Harbor Laboratory Press; ISSN 1088-9051/05 $5.00 OPEN ACCESS ARTICLE Letter Analysis of 5' junctions of human LINE-1 and Alu retrotransposons suggests an alternative model for 5'-end attachment requiring microhomology-mediated end-joining1 Fachgebiet Pr2/Retroelemente, Paul-Ehrlich-Institut, D-63225 Langen, Germany 2 Zentrum für Bioinformatik, Universität Hamburg, D-20146 Hamburg, Germany 3 Institut für Medizinische Biometrie und Epidemiologie, Universitätsklinikum Hamburg-Eppendorf, D-20246 Hamburg, Germany 4 Department of Human Genetics and Department of Internal Medicine, University of Michigan Medical School, Ann Arbor, Michigan, 48105-0618, USA
Insertion of the human non-LTR retrotransposon LINE-1 (L1) into chromosomal DNA is thought to be initiated by a mechanism called target-primed reverse transcription (TPRT). This mechanism readily accounts for the attachment of the 3'-end of an L1 copy to the genomic target, but the subsequent integration steps leading to the attachment of the 5'-end to the chromosomal DNA are still cause for speculation. By applying bioinformatics to analyze the 5' junctions of recent L1 insertions in the human genome, we provide evidence that L1 uses at least two distinct mechanisms to link the 5'-end of the nascent L1 copy to its genomic target. While 5'-truncated L1 elements show a statistically significant preference for short patches of overlapping nucleotides between their target site and the point of truncation, full-length insertions display no distinct bias for such microhomologies at their 5'-ends. In a second genome-wide approach, we analyzed Alu elements to examine whether these nonautonomous retrotransposons, which are thought to be mobilized through L1 proteins, show similar characteristics. We found that Alu elements appear to be predominantly integrated via a pathway not involving overlapping nucleotides. The results indicate that a cellular nonhomologous DNA end-joining pathway may resolve intermediates from incomplete L1 retrotransposition events and thus lead to 5' truncations.
The human LINE-1 (Long interspersed nuclear element 1, L1) is one of the best characterized members of the extensive group of non-LTR retrotransposons (Malik et al. 1999 17% of the human genome (Lander et al. 2001 13% of the human genome by mobilizing Alu elements (Dewannieux et al. 2003
A functional full-length L1 element is
Retrotransposition of a new L1 copy into the degenerate genomic consensus sequence 5'-TTTT/A-3' (Cost and Boeke 1998
Several mechanisms have been suggested to explain the attachment of the 5'-end of non-LTR retrotransposons to the chromosome, which are based on regions of microcomplementarity found at the junctions between the 5'-end of the retrotransposon and the 3'-end of the adjacent TSD (Fig. 1). These overlapping nucleotides have been described initially for L1 insertions in the mouse genome and for Cin4 elements in maize, and led to replication models that require bridging of chromosomal double-strand breaks (DSBs) by L1 RNA (Voliva et al. 1984
In the course of our efforts to investigate the means by which the L1 5'-end is attached to the chromosomal DNA, we evaluated whether there is a preference for overlapping nucleotides (nt) between the 5'-end of pre-existing L1 insertions and the 3'-end of the adjacent genomic TSDs, as previously reported for a small number of de novo L1 integrants obtained from tissue culture experiments (Symer et al. 2002
Microhomology patches in the range of 1-12 nt are significantly overrepresented at the 5' junctions of endogenous 5'-truncated L1s In order to gain insight into the enigmatic mechanism of 5'-end attachment of human L1 to the genomic target DNA, we initiated a genome-wide analysis of TSDs flanking endogenous L1 insertions. For this purpose, we applied a recently developed computer program called TSDfinder (Szak et al. 2002 99% of the euchromatic genome. To identify predominantly young L1s, we used the DNA sequence of the highly active L1.3 element (Dombroski et al. 1993
The program identified 10,034 L1 insertions with an intact 3'-end, a poly(A) tail, and TSDs, with the majority (90%) belonging to the younger, primate-specific subfamilies L1Hs, L1P1, or L1P2 (Smit et al. 1995
In our TSDfinder output file we reproduced the previously described distribution of TSD lengths (Szak et al. 2002
Analysis of TSDs flanking 5'-truncated L1 insertions uncovered microhomologies between the 3'-end of the TSD and the 5'-end of their adjacent L1 insertion in 62% (1503/2427) of all cases (Fig. 2A). The regions of microcomplementarity covered 1-12 consecutive nucleotides. Using three different methods, we evaluated statistically whether the observed frequencies of microhomologies at the 5' junctions were significantly different from what was expected by chance. For that purpose, we initially applied methods similar to those described previously to investigate viral/host junction sequences (Roth et al. 1985
Microhomologies associated with full-length insertions had to be investigated in a separate study, as the mathematical basis for the calculation of the expected distribution of overlapping nucleotides differs between 5'-truncated and full-length L1s. Since in the case of full-length insertions the sequence of one of the joined ends is specified, the distribution function shifts toward fewer randomly occurring microhomologies. Also, considering the A+T-rich target sequences, patches of microcomplementarity are more rarely expected to form with the guanine-rich 5'-termini of full-length L1 insertions.
An unambiguous definition of the transcriptional initiation sites of full-length L1 elements is not possible as it has been found that initiation sites can vary in both downstream and upstream directions of nucleotide +1 (Athanikar et al. 2004
Statistical analysis yielded the surprising result that the distribution of microhomologies in endogenous full-length elements differs quite dramatically from the situation observed in 5'-truncated L1 insertions (Fig. 2A,B). While the observed distribution is clearly shifted toward longer patches of microhomology (1-12 nt) in the case of 5'-truncated L1s, this is not the case for full-length insertions, which display overlapping nucleotides roughly at the expected rates (Fig. 2B). Differences between the expected distributions derived from "exact pattern matching" and observed frequencies are not statistically significant as the p-value, p = 0.0268, is above the significance level of
To ensure that false-positive, short TSDs do not bias our results, we repeated these analyses with only those L1s from the initial data set that are flanked by TSDs at least 14 bp long. Although this led to minor reductions of microhomology frequencies observed for truncated L1s, the preference for 1-12-nt overlaps remained highly significant. The results for full-length elements did not change either (Supplemental Table 1). Similarly, we ascertained that the elimination of TSDs containing mismatches (see Szak et al. 2002 The presented data lead to the conclusion that there are at least two mechanisms involved in the attachment of the 5'-end of an L1 copy to the chromosomal DNA: (1) 5'-truncated L1s preferentially use a mechanism involving complementary base-pairing; (2) full-length L1 insertions are attached to the chromosomal DNA predominantly by means of a second mechanism that does not require base-pairing. By analyzing our data set of 3000 endogenous standard L1 insertions for a correlation between insertion length and incidence of microhomologies (Supplemental Fig. 3), this hypothesis of two distinct mechanisms being responsible for the formation of full-length and 5'-truncated elements was supported: While microhomologies were only found in 25% of L1 insertions starting at positions +1 to +6, 50%-65% of all insertions with 5'-truncations exhibit overlapping nucleotides, regardless of the extent of the truncation.
L1 Ta elements display the same distribution of microhomologies at their 5' junctions as evolutionary older L1s
Statistical evaluation showed that the observed frequency of overlapping nucleotides at the 5'-end of L1 Ta insertions (Fig. 2C,D) reflects our findings observed for the initial, larger data set including the older classes of L1 elements (Fig. 2A,B). While 59% (33/56) of the 5'-truncated L1 Ta insertions are characterized by 1-12 overlapping nucleotides at their 5'-end (extensive set of L1s: 62%, 1503/2427), only 25% (13/52) of the full-length insertions have short stretches of overlapping nucleotides (extensive set of L1s: 24%, 137/573). The similarities in observed frequencies of overlapping nucleotides between our initial set of L1 insertions and the subset of young L1 Ta elements are also consistent with the similarly high Dmax-values reflecting the biological relevance of the data (Fig. 2A-D). However, owing to the comparatively low number of elements included in our L1 Ta study, the p-values differ from those calculated for the extensive set of L1s. The results suggest that the difference between observed and expected distributions of microhomologies is only significant for 5'-truncated elements (p1,2,3
Alu elements are mainly inserted through a mechanism not requiring complementary base-pairing
As described before for the L1 TSD data sets (Supplemental Fig. 2A), the length distributions of Alu TSDs from the TSDfinder output file and from the refined Alu TSD data set were compared with each other (Supplemental Fig. 2B). As expected, the distributions of Alu TSDs and L1 TSDs are almost identical, supporting the finding that L1s and Alus are mobilized by the same L1-encoded protein machinery (Dewannieux et al. 2003
Although it was shown that Alu transcription predominantly starts at the first nucleotide of the Alu repeat (Elder et al. 1981 Clearly, differences in the sequence composition between L1 and Alu elements (A+T content: 57% vs. 37%) increase the probability of finding a sequence match in the case of L1 relative to Alu in the same A+T-rich genomic insertion site. However, since we considered the differences in the base composition of L1s and Alus in our calculation of biased/expected and pattern-matched frequencies, our data indicate that different preferences of L1 and Alu for microhomologies are a phenomenon that is not caused by the base composition of these elements. If a microhomology-dependent pathway is involved in Alu integration at all, it is apparently used much less frequently than in L1 integration.
Expanding a recent report by Martin and coworkers who analyzed 73 human and 215 mouse L1 elements (Martin et al. 2005
Based on the microhomologies at the 5' junctions of 5'-truncated standard L1 insertions, we propose a model for the generation of 5'-truncated L1 integrants that offers an alternative to the double-TPRT model: In contrast to double-TPRT, which relies exclusively on L1 proteins for synthesis and attachment of both L1 DNA strands (Supplemental Fig. 1), we suggest that joining of the cDNA to the genomic target DNA and subsequent second-strand synthesis are carried out by host-encoded factors that are involved in "error-prone" NHEJ (Fig. 3C). This proposed mechanism is closely related to twin priming since both models involve alternative NHEJ as a key part. While in the former model alternative NHEJ was proposed to be responsible for the attachment of the noninverted cDNA to the inverted cDNA (Fig. 3B), in the latter model the same DSB repair pathway facilitates the attachment of a noninverted cDNA to the genomic target (Fig. 3C). Our model is strongly supported by a recent study investigating the joining of unknown, complex DSBs (Odersky et al. 2002
Our genome-wide analysis also uncovered that, in contrast to 5'-truncated L1s, 5' junctions of full-length L1 insertions as well as full-length and 5'-truncated Alu elements exhibit no or only a very weak preference for overlapping nucleotides. Neither the double-TPRT model nor our "microhomology-mediated end-joining" model can explain the lack of microhomologies at the junctions of 76% of endogenous full-length L1s, 38% of endogenous truncated L1 insertions, and the majority of Alu insertions analyzed (Fig. 2B,D,E,F). Some of these integrants might derive from alternative NHEJ events as it was reported that a minority of joining events mediated by this pathway lacks microhomologies (Kabotyanski et al. 1998
Adding our proposed microhomology-mediated end-joining mechanism to the previously described twin-priming model (Ostertag and Kazazian Jr. 2001b
The presented combination of models (Fig. 3) also provides an explanation for the characteristics observed in Alu insertions: The frequencies of microhomologies at the junctions of full-length Alus are almost identical to those of full-length L1s, suggesting that the 5'-ends of full-length Alus are preferentially attached by the same mechanism as discussed for full-length L1s. However, the minor bias for microhomologies found in 5'-truncated Alus is in striking contrast to that of 5'-truncated L1s, and could be a consequence of the short size of full-length Alu RNAs: As Alus are only
So far, there is no direct evidence as to how, or even whether, the second DNA strand is synthesized by the LINE-1 machinery. However, the presented data imply that considerable assistance from the DNA repair machinery is involved in the attachment of the 5'-end of L1 to the chromosome. Interestingly, it was shown by Morrish and coworkers that the retrotransposition rate of EN-deficient L1s was significantly elevated in the absence of functional "accurate NHEJ" repair factors (Morrish et al. 2002
Identification of endogenous L1 and Alu sequences flanked by TSDs Endogenous human L1 elements were identified by applying the program TSDfinder (Szak et al. 2002 99% of the euchromatic genome (build 33 as of April 10, 2003) served as the data set, with the exception that the file "unplaced_contigs" was excluded from the study. For identification of L1 elements and their respective 5'- and 3'-flanking sequences, we used the method described by Szak et al. (2002 90% identity to L1.3 sequences returned 30,265 hits in the RepeatMasker Output, and 10,031 of them were annotated to contain a TSD by the TSDfinder program. This data set was used for further studies (Supplemental material: Li_coord_ seq.txt). A similar search strategy was performed with the consensus sequence for AluY elements (Batzer and Deininger 2002 90% identity. In the RepeatMasker search, 147,620 hits were identified, and 34,913 of these were reported to contain a TSD by the TSDfinder program. Further studies were carried out with this data set (Supplemental material: Alu_coord_seq.txt). A Perl program was written to parse information from the TSDfinder output for each hit and join it with sequence information from the DNA reference sequence of the human genome. The program produced an output file in FASTA format reporting the TSDfinder identification numbers, GenBank accession numbers, and the position of the elements within the human sequence contigs. Additionally, it provided the L1 or Alu sequences as well as 150 nt of genomic sequence flanking the respective insertion. Another Perl program parsed the sequence information of the TSD as assigned by the program TSDfinder and the respective identification number for each element in a tabular fashion. Both programs are available on request. The data sets are listed as Supplemental material (L1_TSD.txt, Alu_TSD.txt).
Identification of microhomologies localized at the 5' junctions of L1 and Alu insertions In order to include as many full-length L1s in our analysis as possible, the entire procedure was repeated with two additional L1 consensus sequences, which differ from the original L1.3 sequence in a major polymorphism at the transcriptional start site, where a perfect match was required by our selection criteria. These two alternative sequences start with GAGGG and GGAGG instead of GGGGG. The data sets are available as Supplemental material (L1_selection.xls, Alu_selection.xls).
Statistical analyses In the case of endogenous full-length L1 insertions, the 5'-end is defined by a G-rich purine stretch. Assuming the first nucleotides to be solely Gs, the probability to observe exactly j consecutive ties by chance is P(X = j)= pj · (1 - p), where j =0, 1, 2,... and p denotes the proportion of G in the target sequence. This means that the random variable X being defined as the number of ties until the first non-tie follows a geometric distribution with probability 1 - p. However, the assumption of the geometric distribution holds true only if the nucleotides at the 5'-end remain Gs. In order to consider the polymorphisms that we included in our analysis of full-length L1 elements (see above), we adjusted the formula to allow for the sporadic occurrence of As instead of Gs. The formula was broken down into a decision tree that takes into account all possible L1 start sequences and weights them by the observed occurrence of As at position 2 or 3 of the L1 consensus sequence. In full-length Alu insertions, the first 12 nt consist of Gs and Cs, so that the probabilities of only these two nucleotides were used in the formula.
To get an even more realistic assessment of the influence of the actual base composition directly at the TSD/L1 (or TSD/Alu) junction, we also determined the probability of microhomologies between each individual TSD sequence and any portion of its adjacent L1 (Alu) sequence. For full-length elements, the TSDs were sorted according to the sequence at their 3'-end, and overlaps between TSD and the first 5 nt of the respective consensus sequence (GRRGG for L1; GGCCG for Alu) were determined manually. For 5'-truncated insertions, a Perl program (available on request) was implemented that slides each TSD along its adjacent L1 sequence. The program determines and counts the number of longest, consecutive matches (suffices) between the 3'-end of the TSD and each position within the L1 sequence by exact pattern matching. The result is reported in a tabular fashion giving the length of the suffixes and number of hits obtained for each element. As microhomologies of the length j can be generated in j + 1 ways (see above and Roth et al. 1985 For full-length as well as for 5'-truncated insertions, we tested whether the observed data could originate from the distribution specified under the assumption of random events. Therefore, we performed a Kolmogoroff-Smirnow Test computing p-values and 95% Clopper-Pearson confidence limits in a Monte Carlo simulation. This simulation consisted of 100,000 independent draws from the hypothesized distribution, and for each draw the maximum absolute distance (Dmax) of the observed and the theoretical cumulative distribution function was calculated. The proportion of draws that exceeded the analogous distance observed in our data is reported as an unbiased estimator of the true p-value. Simulations were performed with the software program S-Plus 4.5 (MathSoft Inc.). S-Plus programs are available as Supplemental material. Values for statistical significance (p), biological relevance (Dmax), and confidence intervals that were calculated for biased, unbiased, and "pattern-matched" distributions of microhomologies associated with extant human L1 and Alu insertions are listed in Supplemental Table 1.
We thank S.L. Martin for sharing unpublished information and S. Szak for helpful advice concerning the TSDfinder program. Special thanks to C. Stocking for crucial support and critical reading of the manuscript. This research was supported by grants Schu1014/2-1 and Schu1014/2-2 of the Deutsche Forschungsgemeinschaft and grant AZ.10.01.1.104 of the "Fritz-Thyssen Stiftung" to G.G.S. T.A.M. was supported, in part, by a grant from the NIH (GM60518).
5 Corresponding author. E-mail schgr{at}pei.de; fax 49-6103-771265. [Supplemental material is available online at www.genome.org and at http://www.zbh.uni-hamburg.de/research/GI/projects. php. The following individual kindly provided reagents, samples, or unpublished information as indicated in the paper: S.L. Martin.] Article and publication are at http://www.genome.org/cgi/doi/10.1101/gr.3421505. Freely available online through the Genome Research Immediate Open Access option.
Athanikar, J.N., Badge, R.M., and Moran, J.V. 2004. A YY1-binding site is required for accurate human LINE-1 transcription initiation. Nucleic Acids Res. 32: 3846-3855. Batzer, M.A. and Deininger, P.L. 2002. Alu repeats and human genomic diversity. Nat. Rev. Genet. 3: 370-379.[CrossRef][Medline]
Bentley, J., Diggle, C.P., Harnden, P., Knowles, M.A., and Kiltie, A.E. 2004. DNA double strand break repair in human bladder cancer is error prone and involves microhomology-associated end-joining. Nucleic Acids Res. 32: 5249-5259.
Bibillo, A. and Eickbush, T.H. 2002a. High processivity of the reverse transcriptase from a non-long terminal repeat retrotransposon. J. Biol. Chem. 277: 34836-34845. ____. 2002b. The reverse transcriptase of the non-LTR retrotransposon: Continuous synthesis of cDNA on non-continuous RNA templates. J. Mol. Biol. 316: 459-473.[CrossRef][Medline]
____. 2004. End-to-end template jumping by the reverse transcriptase encoded by the R2 retrotransposon. J. Biol. Chem. 279: 14945-14953.
Boissinot, S., Chevret, P., and Furano, A.V. 2000. L1(LINE-1) retrotransposon evolution and amplification in recent human history. Mol. Biol. Evol. 17: 915-928. Burke, W.D., Malik, H.S., Jones, J.P., and Eickbush, T.H. 1999. The domain structure and retrotransposition mechanism of R2 elements are conserved throughout arthropods. Mol. Biol. Evol. 16: 502-511.[Abstract] Cost, G.J. and Boeke, J.D. 1998. Targeting of human retrotransposon integration is directed by the specificity of the L1 endonuclease for regions of unusual DNA structure. Biochemistry 37: 18081-18093.[CrossRef][Medline] Cost, G.J., Feng, Q., Jacquier, A., and Boeke, J.D. 2002. Human L1 element target-primed reverse transcription in vitro. EMBO J. 21: 5899-5910.[CrossRef][Medline] Dewannieux, M., Esnault, C., and Heidmann, T. 2003. LINE-mediated retrotransposition of marked Alu sequences. Nat. Genet. 35: 41-48.[CrossRef][Medline]
Dombroski, B.A., Mathias, S.L., Nanthakumar, E., Scott, A.F., and Kazazian Jr., H.H. 1991. Isolation of an active human transposable element. Science 254: 1805-1808.
Dombroski, B.A., Scott, A.F., and Kazazian Jr., H.H. 1993. Two additional potential retrotransposons isolated from a human L1 subfamily that contains an active retrotransposable element. Proc. Natl. Acad. Sci. 90: 6513-6517.
Dombroski, B., Feng, Q., Mathias, S.L., Sassaman, D.M., Scott, A.F., Kazazian, H.H., and Boeke, J.D. 1994. An in vivo assay for the reverse transcriptase of human retrotransposon L1 in Saccharomyces cerevisiae. Mol. Cell. Biol. 14: 4485-4492. Eickbush, T.H. 2002. R2 and related site-specific non-long terminal repeat retrotransposons. In Mobile DNA II (eds. N.L. Craig et al.), pp. 813-835. American Society for Microbiology, Washington, DC.
Elder, J.T., Pan, J., Duncan, C.H., and Weissman, S.M. 1981. Transcriptional analysis of interspersed repetitive polymerase III transcription units in human DNA. Nucleic Acids Res. 9: 1171-1189. Esnault, C., Maestre, J., and Heidmann, T. 2000. Human LINE retrotransposons generate processed pseudogenes. Nat. Genet. 24: 363-367.[CrossRef][Medline]
Fanning, T. and Singer, M. 1987. The LINE-1 DNA sequences in four mammalian orders predict proteins that conserve homologies to retrovirus proteins. Nucleic Acids Res. 15: 2251-2260. Feng, Q., Moran, J., Kazazian Jr., H.H., and Boeke, J.D. 1996. Human L1 retrotransposon encodes a conserved endonuclease required for retrotransposition. Cell 87: 905-916.[CrossRef][Medline]
Feng, Q., Schumann, G., and Boeke, J.D. 1998. Retrotransposon R1Bm endonuclease cleaves the target sequence. Proc. Natl. Acad. Sci. 95: 2083-2088.
Fuhrman, S., Deininger, P.L., LaPorte, P., Friedmann, T., and Geiduschek, E.P. 1981. Analysis of transcription of the human Alu family ubiquitous repeating element by eukaryotic RNA polymerase III. Nucleic Acids Res. 9: 6439-6456.
Fujimoto, H., Hirukawa, Y., Tani, H., Matsuura, Y., Hashido, K., Tsuchida, K., Takada, N., Kobayashi, M., and Maekawa, H. 2004. Integration of the 5' end of the retrotransposon, R2Bm, can be complemented by homologous recombination. Nucleic Acids Res. 32: 1555-1565. Gilbert, N., Lutz-Prigge, S., and Moran, J.V. 2002. Genomic deletions created upon LINE-1 retrotransposition. Cell 110: 315-325.[CrossRef][Medline] Göttlich, B., Reichenberger, S., Feldmann, E., and Pfeiffer, P. 1998. Rejoining of DNA double-strand breaks in vitro by single-strand annealing. Eur. J. Biochem. 258: 387-395.[Medline]
Hattori, M., Hidaka, S., and Sakaki, Y. 1985. Sequence analysis of a KpnI family member near the 3' end of human Izsvák, Z., Stüve, E.E., Fiedler, D., Katzer, A., Jeggo, P., and Ivics, Z. 2004. Healing the wounds inflicted by Sleeping Beauty transposition by double-strand break repair in mammalian somatic cells. Mol. Cell 13: 279-290.[CrossRef][Medline]
Jurka, J. 1997. Sequence patterns indicate an enzymatic involvement in integration of mammalian retroposons. Proc. Natl. Acad. Sci. 94: 1872-1877.
Kabotyanski, E.B., Gomelsky, L., Han, J.-O., Stamato, T.D., and Roth, D.B. 1998. Double-strand break repair in Ku86- and XRCC4-deficient cells. Nucleic Acids Res. 26: 5333-5342.
Kolosha, V.O. and Martin, S.L. 1997. In vitro properties of the first ORF protein from mouse LINE-1 support its role in ribonucleoprotein particle formation during retrotransposition. Proc. Natl. Acad. Sci. 94: 10155-10160.
____. 2003. High-affinity, non-sequence-specific RNA binding by the open reading frame 1 (ORF1) protein from Long Interspersed Nuclear Element 1 (LINE-1). J. Biol. Chem. 278: 8112-8117. Lander, E.S., Linton, L.M., Birren, B., Nusbaum, C., Zody, M.C., Baldwin, J., Devon, K., Dewar, K., Doyle, M., FitzHugh, W., et al. 2001. Initial sequencing and analysis of the human genome. Nature 409: 860-921.[CrossRef][Medline]
Lavie, L., Maldener, E., Brouha, B., Meese, E.U., and Mayer, J. 2004. The human L1 promoter: Variable transcription initiation sites and a major impact of upstream flanking sequence on promoter activity. Genome Res. 14: 2253-2260. Li, L., Olvera, J.M., Yoder, K.E., Mitchell, R.S., Butler, S.L., Lieber, M., Martin, S.L., and Bushman, F.D. 2001. Role of the non-homologous DNA end joining pathway in the early steps of retroviral infection. EMBO J. 20: 3272-3281.[CrossRef][Medline]
Liu, W.-M., Maraia, R.J., Rubin, C.M., and Schmid, C.W. 1994. Alu transcripts: Cytoplasmic localisation and regulation by DNA methylation. Nucleic Acids Res. 22: 1087-1095. Luan, D.D., Korman, M.H., Jakubczak, J.L., and Eickbush, T.H. 1993. Reverse transcription of R2Bm RNA is primed by a nick at the chromosomal target site: A mechanism for non-LTR retrotransposition. Cell 72: 595-605.[CrossRef][Medline] Malik, H.S., Burke, W.D., and Eickbush, T.H. 1999. The age and evolution of non-LTR retrotransposable elements. Mol. Biol. Evol. 16: 793-805.[Abstract]
Martin, S.L. and Bushman, F.D. 2001. Nucleic acid chaperone activity of the ORF1 protein from the mouse LINE-1 retrotransposon. Mol. Cell. Biol. 21: 467-475. Martin, S.L., Li, W.P., Furano, A.V., and Boissinot, S. 2005. The structures of mouse and human L1 elements reflect their insertion mechanism. Cytogenet. Genome Res. 110: 223-228.
Mathias, S.L., Scott, A.F., Kazazian Jr., H.H., Boeke, J.D., and Gabriel, A. 1991. Reverse transcriptase encoded by a human retrotransposon. Science 254: 1808-1810. Moore, J.K. and Haber, J.E. 1996. Capture of retrotransposon DNA at the sites of chromosomal double-strand breaks. Nature 383: 644-646.[CrossRef][Medline] Morrish, T.A., Gilbert, N., Myers, J.S., Vincent, B.J., Stamato, T.D., Taccioli, G.E., Batzer, M.A., and Moran, J.V. 2002. DNA repair mediated by endonuclease-independent LINE-1 retrotransposition. Nat. Genet. 31: 159-165.[CrossRef][Medline] Myers, J.S., Vincent, B.J., Udall, H., Watkins, W.S., Morrish, T.A., Kilroy, G.E., Swergold, G.D., Henke, J., Henke, L., Moran, J.V., et al. 2002. A comprehensive analysis of recently integrated human Ta L1 elements. Am. J. Hum. Genet. 71: 312-326.[CrossRef][Medline]
Odersky, A., Panyutin, I.V., Panyutin, I.G., Schunck, C., Feldmann, E., Goedecke, W., Neumann, R.D., Obe, G., and Pfeiffer, P. 2002. Repair of sequence-specific 125I-induced double-strand breaks by nonhomologous DNA end joining in mammalian cell-free extract. J. Biol. Chem. 277: 11756-11764. Ostertag, E.M. and Kazazian Jr., H.H. 2001a. Biology of mammalian L1 retrotransposons. Annu. Rev. Genet. 35: 501-538.[CrossRef][Medline]
____. 2001b. Twin priming: A proposed mechanism for the creation of inversions in L1 retrotransposition. Genome Res. 11: 2059-2065.
Pfeiffer, P., Goedecke, W., and Obe, G. 2000. Mechanisms of DNA double-strand break repair and their potential to induce chromosomal aberrations. Mutagenesis 15: 289-302. Roth, D.B. 2003. Restraining the V(D)J recombinase. Nat. Rev. Immunol. 3: 656-666.[CrossRef][Medline]
Roth, D.B., Porter, T.N., and Wilson, J.H. 1985. Mechanisms of nonhomologous recombination in mammalian cells. Mol. Cell. Biol. 5: 2599-2607. Sassaman, D.M., Dombroski, B.A., Moran, J.V., Kimberland, M.L., Naas, T.P., DeBerardinis, R.J., Gabriel, A., Swergold, G.D., and Kazazian Jr., H.H. 1997. Many human L1 elements are capable of retrotransposition. Nat. Genet. 16: 37-43.[CrossRef][Medline] Schwarz-Sommer, Z., Leclercq, L., Göbel, E., and Saedler, H. 1987. Cin 4, an insert altering the structure of the A1 gene in Zea mays, exhibits properties of nonviral retrotransposons. EMBO J. 6: 3873-3880.[Medline]
Skowronski, J., Fanning, T.G., and Singer, M.F. 1988. Unit-length LINE-1 transcripts in human teratocarcinoma cells. Mol. Cell. Biol. 8: 1385-1397. Smit, A.F.A., Toth, G., Riggs, A.D., and Jurka, J. 1995. Ancestral, mammalian-wide subfamilies of LINE-1 repetitive sequences. J. Mol. Biol. 246: 401-417.[CrossRef][Medline] Symer, D.E., Connelly, C., Szak, S.T., Caputo, E.M., Cost, G.J., Parmigani, G., and Boeke, J.D. 2002. Human L1 retrotransposition is associated with genetic instability in vivo. Cell 110: 327-338.[CrossRef][Medline] Szak, S.T., Pickeral, O.K., Makalowski, W., Boguski, M.S., Landsman, D., and Boeke, J.D. 2002. Molecular archeology of L1 insertions in the human genome. Genome Biol. 3: 0052.0051-0052.0018. Teng, S.-C., Kim, B., and Gabriel, A. 1996. Retrotransposon reverse transcriptase-mediated repair of chromosomal breaks in Saccharomyces cerevisiae. Nature 383: 641-644.[CrossRef][Medline] Thacker, J., Chalk, J., Ganesh, A., and North, P. 1992. A mechanism for deletion formation in DNA by human cell extracts: The involvement of short sequence repeats. Nucleic Acids Res. |