The Role of Retrotransposons in Gene Family Expansions in the Human and Mouse Genomes
Jazyk angličtina Země Anglie, Velká Británie Médium electronic
Typ dokumentu časopisecké články, Research Support, N.I.H., Extramural, práce podpořená grantem
Grantová podpora
P30 CA023074
NCI NIH HHS - United States
U54 CA143924
NCI NIH HHS - United States
PubMed
27503295
PubMed Central
PMC5631067
DOI
10.1093/gbe/evw192
PII: evw192
Knihovny.cz E-zdroje
- Klíčová slova
- LINE, LTR, SINE, gene families, retrotransposons, transposable elements,
- MeSH
- expanze repetic DNA MeSH
- genom lidský * MeSH
- lidé MeSH
- molekulární evoluce * MeSH
- multigenová rodina * MeSH
- myši MeSH
- polymorfismus genetický MeSH
- retroelementy * MeSH
- zvířata MeSH
- Check Tag
- lidé MeSH
- myši MeSH
- zvířata MeSH
- Publikační typ
- časopisecké články MeSH
- práce podpořená grantem MeSH
- Research Support, N.I.H., Extramural MeSH
- Názvy látek
- retroelementy * MeSH
Retrotransposons comprise a large portion of mammalian genomes. They contribute to structural changes and more importantly to gene regulation. The expansion and diversification of gene families have been implicated as sources of evolutionary novelties. Given the roles retrotransposons play in genomes, their contribution to the evolution of gene families warrants further exploration. In this study, we found a significant association between two major retrotransposon classes, LINEs and LTRs, and lineage-specific gene family expansions in both the human and mouse genomes. The distribution and diversity differ between LINEs and LTRs, suggesting that each has a distinct involvement in gene family expansion. LTRs are associated with open chromatin sites surrounding the gene families, supporting their involvement in gene regulation, whereas LINEs may play a structural role promoting gene duplication. Our findings also suggest that gene family expansions, especially in the mouse genome, undergo two phases. The first phase is characterized by elevated deposition of LTRs and their utilization in reshaping gene regulatory networks. The second phase is characterized by rapid gene family expansion due to continuous accumulation of LINEs and it appears that, in some instances at least, this could become a runaway process. We provide an example in which this has happened and we present a simulation supporting the possibility of the runaway process. Altogether we provide evidence of the contribution of retrotransposons to the expansion and evolution of gene families. Our findings emphasize the putative importance of these elements in diversification and adaptation in the human and mouse lineages.
Zobrazit více v PubMed
Akaike H. 1974. A new look at the statistical model identification. IEEE Trans Automatic Control 19:716–723.
Argueso JL, et al. 2008. Double-strand breaks associated with repetitive DNA can reshape the genome. Proc Natl Acad Sci U S A. 105:11845–11850. PubMed PMC
Ashburner M, et al. 2000. Gene ontology: tool for the unification of biology. The Gene Ontology Consortium. Nat Genet. 25:25–29. PubMed PMC
Bailey JA, Liu G, Eichler EE. 2003. An Alu transposition model for the origin and expansion of human segmental duplications. Am J Hum Genet. 73:823–834. PubMed PMC
Batzer MA, Deininger PL. 2002. Alu repeats and human genomic diversity. Nat Rev Genet. 3:370–379. PubMed
Blake JA, et al. 2014. The Mouse Genome Database: integration of and access to knowledge about the laboratory mouse. Nucleic Acids Res. 42:D810–D817. PubMed PMC
Bourque G. 2009. Transposable elements in gene regulation and in the evolution of vertebrate genomes. Curr Opin Genet Dev. 19:607–612. PubMed
Bourque G, et al. 2008. Evolution of the mammalian transcription factor binding repertoire via transposable elements. Genome Res. 18:1752–1762. PubMed PMC
Brouha B, Jr., et al. 2003. Hot L1s account for the bulk of retrotransposition in the human population. Proc Natl Acad Sci U S A. 100:5280–5285. PubMed PMC
Campbell IM, et al. 2014. Human endogenous retroviral elements promote genome instability via non-allelic homologous recombination. BMC Biol. 12:74.. PubMed PMC
Capy P, Gasperi G, Biemont C, Bazin C. 2000. Stress and transposable elements: co-evolution or useful parasites? Heredity (Edinb) 85 (Pt 2):101–106. PubMed
Consortium EP. 2012. An integrated encyclopedia of DNA elements in the human genome. Nature 489:57–74. PubMed PMC
Emes RD, Goodstadt L, Winter EE, Ponting CP. 2003. Comparison of the genomes of human and mouse lays the foundation of genome zoology. Hum Mol Genet. 12:701–709. PubMed
Fablet M, Vieira C. 2011. Evolvability, epigenetics and transposable elements. Biomol Concepts 2:333–341. PubMed
Fedoroff NV. 2012. Presidential address. Transposable elements, epigenetics, and genome evolution. Science 338:758–767. PubMed
Feschotte C. 2008. Transposable elements and the evolution of regulatory networks. Nat Rev Genet. 9:397–405. PubMed PMC
Fitch DH, et al. 1991. Duplication of the gamma-globin gene mediated by L1 long interspersed repetitive elements in an early ancestor of simian primates. Proc Natl Acad Sci U S A. 88:7396–7400. PubMed PMC
Gene Ontology C. 2015. Gene Ontology Consortium: going forward. Nucleic Acids Res. 43:D1049–D1056. PubMed PMC
Gibbs RA, et al. 2004. Genome sequence of the Brown Norway rat yields insights into mammalian evolution. Nature 428:493–521. PubMed
Goodier JL, Ostertag EM, Du K, Kazazian HH., Jr. 2001. A novel active L1 retrotransposon subfamily in the mouse. Genome Res. 11:1677–1685. PubMed PMC
Hallstrom BM, Janke A. 2010. Mammalian evolution may not be strictly bifurcating. Mol Biol Evol. 27:2804–2816. PubMed PMC
Hastings PJ, Lupski JR, Rosenberg SM, Ira G. 2009. Mechanisms of change in gene copy number. Nat Rev Genet. 10:551–564. PubMed PMC
He X, Zhang J. 2005. Rapid subfunctionalization accompanied by prolonged and substantial neofunctionalization in duplicate gene evolution. Genetics 169:1157–1164. PubMed PMC
Hedges DJ, Deininger PL. 2007. Inviting instability: transposable elements, double-strand breaks, and the maintenance of genome integrity. Mutat Res. 616:46–59. PubMed PMC
Jacques PE, Jeyakani J, Bourque G. 2013. The majority of primate-specific regulatory sequences are derived from transposable elements. PLoS Genet. 9:e1003504.. PubMed PMC
Janoušek V, Karn RC, Laukaitis CM. 2013. The role of retrotransposons in gene family expansions: insights from the mouse Abp gene family. BMC Evol Biol. 13:107.. PubMed PMC
Jordan IK, Rogozin IB, Glazko GV, Koonin EV. 2003. Origin of a substantial fraction of human regulatory sequences from transposable elements. Trends Genet. 19:68–72. PubMed
Jurka J, Bao W, Kojima KK. 2011. Families of transposable elements, population structure and the origin of species. Biol Direct. 6:44.. PubMed PMC
Kapitonov VV, Jurka J. 2008. A universal classification of eukaryotic transposable elements implemented in Repbase. Nat Rev Genet. 9:411–412. author reply 414. PubMed
Karn RC, Laukaitis CM. 2009. The mechanism of expansion and the volatility it created in three pheromone gene clusters in the mouse (Mus musculus) genome. Genome Biol Evol. 1:494–503. PubMed PMC
Karn RC, Chung AG, Laukaitis CM. 2014. Did androgen-binding protein paralogs undergo neo- and/or Subfunctionalization as the Abp gene region expanded in the mouse genome? PLoS One 9:e115454.. PubMed PMC
Katju V, Lynch M. 2003. The structure and early evolution of recently arisen gene duplicates in the Caenorhabditis elegans genome. Genetics 165:1793–1803. PubMed PMC
Kim PM, et al. 2008. Analysis of copy number variants and segmental duplications in the human genome: evidence for a change in the process of formation in recent evolutionary history. Genome Res. 18:1865–1874. PubMed PMC
Kondrashov FA. 2012. Gene duplication as a mechanism of genomic adaptation to a changing environment. Proc Biol Sci. 279:5048–5057. PubMed PMC
Kondrashov FA, Kondrashov AS. 2006. Role of selection in fixation of gene duplications. J Theor Biol. 239:141–151. PubMed
Korbel JO, et al. 2008. The current excitement about copy-number variation: how it relates to gene duplications and protein families. Curr Opin Struct Biol. 18:366–374. PubMed PMC
Lander ES, et al. 2001. Initial sequencing and analysis of the human genome. Nature 409:860–921. PubMed
Lau M. 2012. DiversitySasmpler: functions for re-sampling a community matrix to compute diversity indices at different sampling levels. R package version 2.1.
Lynch M, Force A. 2000. The probability of duplicate gene preservation by subfunctionalization. Genetics 154:459–473. PubMed PMC
Mazerolle M. 2016. AICcmodavg: model selection and multimodel inference based on (Q)AIC(c). R package version 2.0-4.
McClintock B. 1950. The origin and behavior of mutable loci in maize. Proc Natl Acad Sci U S A. 36:344–355. PubMed PMC
McClintock B. 1956. Controlling elements and the gene. Cold Spring Harb Symp Quant Biol. 21:197–216. PubMed
Moretti S, et al. 2007. The M-Coffee web server: a meta-method for computing multiple sequence alignments by combining alternative alignment methods. Nucleic Acids Res. 35:W645–W648. PubMed PMC
Nellaker C, et al. 2012. The genomic landscape shaped by selection on transposable elements across 18 mouse strains. Genome Biol. 13:R45.. PubMed PMC
Neph S, et al. 2012. BEDOPS: high-performance genomic feature operations. Bioinformatics 28:1919–1920. PubMed PMC
Ohno S. 1970. Evolution by gene duplication. New York: Springer Verlag.
Pinheiro J, Bates D, DebRoy S, Sarkar D, Team RC. 2016. nlme: linear and nonlinear mixed effects models. R package version 3.1-124.
Polavarapu N, Marino-Ramirez L, Landsman D, McDonald JF, Jordan IK. 2008. Evolutionary rates and patterns for human transcription factor binding sites derived from repetitive DNA. BMC Genomics 9:226.. PubMed PMC
Ponting CP, Goodstadt L. 2009. Separating derived from ancestral features of mouse and human genomes. Biochem Soc Trans. 37:734–739. PubMed
Povey S, et al. 2001. The HUGO Gene Nomenclature Committee (HGNC). Hum Genet. 109:678–680. PubMed
Qian W, Zhang J. 2008. Gene dosage and gene duplicability. Genetics 179:2319–2324. PubMed PMC
Quinlan AR, Hall IM. 2010. BEDTools: a flexible suite of utilities for comparing genomic features. Bioinformatics 26:841–842. PubMed PMC
Robinson JT, et al. 2011. Integrative genomics viewer. Nat Biotechnol. 29:24–26. PubMed PMC
Romanish MT, Lock WM, van de Lagemaat LN, Dunn CA, Mager DL. 2007. Repeated recruitment of LTR retrotransposons as promoters by the anti-apoptotic locus NAIP during mammalian evolution. PLoS Genet. 3:e10.. PubMed PMC
Schmidt D, et al. 2012. Waves of retrotransposon expansion remodel genome organization and CTCF binding in multiple mammalian lineages. Cell 148:335–348. PubMed PMC
Schrader L, et al. 2014. Transposable element islands facilitate adaptation to novel environments in an invasive species. Nat Commun. 5:5495.. PubMed PMC
She X, Cheng Z, Zollner S, Church DM, Eichler EE. 2008. Mouse segmental duplication and copy number variation. Nat Genet. 40:909–914. PubMed PMC
Sonnhammer EL, Koonin EV. 2002. Orthology, paralogy and proposed classification for paralog subtypes. Trends Genet. 18:619–620. PubMed
Stamatoyannopoulos JA, et al. 2012. An encyclopedia of mouse DNA elements (Mouse ENCODE). Genome Biol. 13:418.. PubMed PMC
Stapley J, Santure AW, Dennis SR. 2015. Transposable elements as agents of rapid adaptation may explain the genetic paradox of invasive species. Mol Ecol. 24:2241–2252. PubMed
Startek M, et al. 2015. Genome-wide analyses of LINE-LINE-mediated nonallelic homologous recombination. Nucleic Acids Res. 43:2188–2198. PubMed PMC
Sundaram V, et al. 2014. Widespread contribution of transposable elements to the innovation of gene regulatory networks. Genome Res. 24:1963–1976. PubMed PMC
Vilella AJ, et al. 2009. EnsemblCompara GeneTrees: complete, duplication-aware phylogenetic trees in vertebrates. Genome Res. 19:327–335. PubMed PMC
Wallace IM, O’Sullivan O, Higgins DG, Notredame C. 2006. M-Coffee: combining multiple sequence alignment methods with T-Coffee. Nucleic Acids Res. 34:1692–1699. PubMed PMC
Waterston RH, et al. 2002. Initial sequencing and comparative analysis of the mouse genome. Nature 420:520–562. PubMed
Weissman DB, Barton NH. 2012. Limits to the rate of adaptive substitution in sexual populations. PLoS Genet. 8:e1002740.. PubMed PMC
Wicker T, et al. 2007. A unified classification system for eukaryotic transposable elements. Nat Rev Genet. 8:973–982. PubMed
Wickham H. 2009. ggplot2: elegant graphics for data analysis. New York: Springer.
Wood S. 2011. Fast stable restricted maximum likelihood and marginal likelihood estimation of semiparametric generalized linear models. J R Stat Soc B Methodological 73:3–36.
Xie M, et al. 2013. DNA hypomethylation within specific transposable element families associates with tissue-specific enhancer landscape. Nat Genet. 45:836–841. PubMed PMC
Yang S, et al. 2004. Patterns of insertions and their covariation with substitutions in the rat, mouse, and human genomes. Genome Res. 14:517–527. PubMed PMC
Expansion of the fatty acyl reductase gene family shaped pheromone communication in Hymenoptera