- Research
- Open access
- Published:
Further varieties of ancient endogenous retrovirus in human DNA
Mobile DNA volume 16, Article number: 11 (2025)
Abstract
A retrovirus inserts its genome into the DNA of a cell, occasionally a germ-line cell that gives rise to descendants of the host organism: it is then called an endogenous retrovirus (ERV). The human genome contains relics from many kinds of ancient ERV. Some relics contributed new genes and regulatory elements. This study finds further kinds of ancient ERV, in the thoroughly-studied human genome version hg38: ERV-Hako, ERV-Saru, ERV-Hou, ERV-Han, and ERV-Goku. It also finds many relics of ERV-V, previously known from just two copies on chromosome 19 with placental genes. It finds a type of ERV flanked by MER41E long terminal repeats (LTRs), with surprisingly little similarity to the known MER41 ERV. ERV-Hako has subtypes that contain sequence from host genes SUSD6 and SPHKAP: the SUSD6 variant was transferred between catarrhine and platyrrhine primates. A retrovirus uses tRNA to prime reverse transcription: Hako is the only human ERV relic that used tRNA-Trp (tryptophan, symbol W), and HERV-W is misnamed because it used tRNA-Arg, based on the Genomic tRNA Database. One ERV-Saru LTR is the previously-described enhancer of AIM2 in innate immunity. This study contributes to understanding primate ERV history, but also shows that related ERVs can have drastic differences, challenging the goal of clearly annotating all ERV relics in genomes.
Introduction
After a retrovirus enters a cell, it reverse-transcribes its RNA genome into DNA and inserts it into the cell’s genome, where it is called a provirus. Occasionally this happens in a germ-line cell, and the provirus is inherited by the host animal’s descendants: it is then called an endogenous retrovirus (ERV). Like any mutation, the ERV may spread by parent-to-child inheritance until it becomes fixed in the host species’ genome, or it may be lost from the population.
Vertebrate genomes have many relics of ancient ERVs: they comprise about 8% of the human genome. They are “fossils” that tell us about ancient virus evolution. Some relics evolved into vertebrate genes and regulatory elements. Typically, one kind of ERV has many relics scattered through the genome. This is due to germ-line re-infection, alternatively, some ERVs evolved to replicate within a cell as retrotransposons [1]. Annotation of ERV relics in genomes (along with other types of repeated element) is often done with the RepeatMasker software [2] and the Dfam database [3]. Dfam has consensus sequences and other information for many kinds of repeated element.
A retroviral provirus has a duplicated segment at its two ends, termed long terminal repeats (LTRs). Between them is an interior region that encodes gag, pol, and env proteins. Most ERV relics, however, are solo LTRs. These presumably arise from homologous recombination between the two LTRs of an ERV, which removes the interior and leaves a single LTR.
A virus may undergo mutation that destroys viral functioning: such a “defective” virus can nevertheless propagate non-autonomously, using molecules from a “helper” virus. A retroviral genome can get joined to a host gene: this can produce new kinds of retrovirus (usually defective) that contain parts of host genes [4]. Such host gene capture was found in crocodilian ERV relics [5]. Koala retrovirus (KoRV) is currently endogenizing in koalas: it has defective forms known as RecKoRV. Interestingly, some koala populations have RecKoRV only, though presumably KoRV must have been there in the past as a helper virus [6].
The retrovirus family Retroviridae (in class Revtraviricetes) is classified into genera including Alpharetrovirus, Betaretrovirus, Gammaretrovirus, Deltaretrovirus, Epsilonretrovirus, and others. Retrovirus classification should be integrated with ERV relics. ERVs have been classified, however, into groups I (related to Gamma- and Epsilonretrovirus), II (related to Alpha- and Betaretrovirus), III, and more.
Finer classification of ERVs is complex. Vargiu et al. classified human ERV relics into 39 clades plus 31 less well-defined groups with many mosaic forms [7]. They focused on relatively complete relics of ERVs that were not drastically defective.
Dfam version 3.8 has 566 types of human ERV relic (41% of all human repeat types), split into 436 LTRs and 130 interior regions. Many of these ERV types seem to lack detailed publications explaining them, and have cryptic names. This hinders learning, and understanding in adjacent fields like genomics. Names like MER41E are hard to remember, and could be any kind of repeat (MEdium Reiteration). Non-specialists often confuse “a mobile element that has LTRs” with “an LTR”. Some RepeatMasker names (e.g. HERV17) differ from the usual name among ERV researchers (HERV-W). A reasonable suggestion is to name ERV types as ERV-name (e.g. ERV-W), where the “ERV-” prefix can be dropped when clear from context [8].
In any case, the types of ancient ERV (and other mobile element) in human DNA have been thoroughly collected. Very ancient and degraded (e.g. Paleozoic) relics are arbitrarily hard to find [9], but maybe all younger types have been found? To investigate this, the present study sought unknown types of mobile element relic, in the thoroughly-studied human genome version hg38. It found new types of not-so-ancient ERV (Table 1). It found other elements not in Dfam that turn out to be previously known: Alu-related ASR and CAS [10], an SVA retrotransposon precursor [11], and an ERV-T1 LTR [12]. All these results have been submitted to Dfam.
Methods
This study sprang from finding human genome regions homologous to transposable element proteins [9]. Some regions homologous to ERV proteins lack clear RepeatMasker annotation, indicating an unknown type of ERV (Fig. 1).
An ERV-Hako relic in human chromosome 1. The black bars near the top show two hako LTRs flanking the interior region (hakoERV-int). Below that are tracks from the UCSC Genome Browser: alignments to other genomes (green), RepeatMasker/Dfam annotations, and self alignments (matches to other parts of the human genome). Screenshot from http://genome.ucsc.edu [13]
Next, new types of mobile element were sought from gaps in genome alignments. For example, if DNA sequence is present in apes but absent in other primates, it indicates an insertion in a common ancestor of apes. This may be a new type of element if a match to known elements is absent, partial, or of unexpected age, for example, an old type of element for a young insertion.
Also, biased coverage of Dfam elements was examined. For example, if an LTR’s left half occurs several times in the genome, it may indicate a new type of LTR whose left half only is similar to the known type.
The edges of a relic were inferred from genome alignment gaps, and target site duplications (TSDs) (Fig. S1). Often, mobile element insertion duplicates some DNA at the insertion site (4–6 bp for a retrovirus) to both sides of the insert. Sometimes, an element inserts inside an older element: then its edges can be inferred from the gap in the older element. Finally, ERVs start with |tg| and end with |ca|.
For each new type of element, several instances were fed to Refiner, which aligns them and makes a consensus sequence [14]. Refiner iteratively improves the consensus by re-aligning the sequences to it. In some tests, the final consensus was impressively accurate [15].
These consensus sequences were searched in mammal genomes (Table S1), by adding them to Dfam’s consensus sequences for that genome, then using “split-alignment” [16]. This optimizes alignments where each genome base-pair matches at most one base-pair from all consensus sequences, thus adjudicating between similar consensus sequences [17].
LAST was used to find genome-to-genome, genome-to-repeat, and DNA-to-protein alignments. LAST finds alignments accurately based on rates of substitutions, insertion, and deletion for each kind of sequence comparison, including pseudogenized DNA versus homologous proteins [9]. Method details are in the Supplement.
Results
ERV-Hako
ERV-Hako is a group II ERV, with a tendency to contain parts of host genes. (Hako is Japanese for box/container). An example is shown in Fig. 1. The genome alignments (green) show this ERV was inserted in a common ancestor of catarrhines (apes and Old World monkeys), after their last common ancestor with platyrrhines (New World monkeys, e.g. marmoset and squirrel monkey). It was reduced to a solo LTR in Old World monkeys (rhesus and green monkey), but not in apes (gibbon, orangutan, human).
The RepeatMasker annotation (Fig. 1) shows patchy similarity to known ERV-K sequences, but doesn’t recognize the LTRs. The human self alignments show that the LTRs are repeated many times in the genome, and the interior is occasionally repeated. The Hako consensus sequence has regions homologous to known ERV proteins (Fig. S2), and Hako insertions produced target site duplications of 5 or 6 bp (Fig. S1).
This ERV relic contains a segment that doesn’t match the Hako consensus sequence (Fig. 1 arrowheads). This segment matches parts of the \(3^{\prime }\)-UTR (untranslated region) of a gene: SUSD6 (sushi domain containing 6) (Fig. 2). Other Hakos on chromosomes 3 and X also have this SUSD6 insert. Another Hako on chromosome 1 has part of the \(3^{\prime }\)-UTR of SPHKAP (SPHK1 interactor, AKAP domain containing) (Fig. S3). This is similar to a retrovirus carrying a cellular gene [4].
The Hako relic from Fig. 1 (vertical, top) mostly matches the Hako conensus sequence (horizontal, left), but partly matches the \(3^{\prime }\)-UTR of SUSD6 (horizontal, right). The HakoP consensus sequence (vertical, bottom) is similar. The diagonal lines show which parts of the vertical sequences are similar to which parts of the horizontal sequences
There is a similar type of ERV, AloPal-5.1366, in the New World howler monkey Alouatta palliata [3]. Here it is named ERV-HakoP (for platyrrhine). Remarkably, the HakoP consensus sequence has the same SUSD6 insertion (Fig. 2). The ERV in Fig. 1 is intermediate between Hako and HakoP: its LTRs are much more similar to Hako (Fig. S4). No HakoP LTRs were found in non-platyrrhines, but platyrrhines have both HakoP and Hako-like LTRs. Alignments between platyrrhine genomes (Table S2) show that most of these insertions are common to diverse platyrrhines (howler monkey, marmoset, squirrel monkey, titi). Thus, they were inserted before the last common ancestor of platyrrhines \(\sim\)21 million years ago (mya) [18]. Full-length Hako ERVs were not found in platyrrhines, but an intermediate form was found (Fig. S5).
Most Hako insertions are common to catarrhines, or common to platyrrhines, but not common to both. However, two Hako relics are shared by catarrhines and platyrrhines (Fig. S6). This suggests ERV-Hako entered the genomes shortly before the end of gene flow between diverging catarrhines and platyrrhines.
Retrovirus reverse transcription is primed from the \(3^{\prime }\)-end of tRNA, base-paired with a PBS (primer binding site) at the \(5^{\prime }\)-edge of the interior region. The type of tRNA was inferred by comparing ERV sequences to human mature tRNAs in GtRNAdb release 21 [19]. This indicated that Hako and HakoP used tRNA-Trp (Fig. 3), whereas most group II human ERV relics used tRNA-Lys. Remarkably, Hako is the only type of human ERV relic that used tRNA-Trp, according to GtRNAdb. Another ERV was named HERV-W because it was thought to use tRNA-Trp. Vargiu et al. mentioned ambiguity between the tRNA-Trp sequence “tggcgaccacgaagggac” and tRNA-Arg in the Leipzig tRNA database [7]. According to GtRNAdb, however, this sequence is tRNA-Arg. (At the time of writing this, the Leipzig database is not available.)
One Hako relic contains the transcription start site of MS4A15 (membrane spanning 4-domains A15), a gene that increases proliferation and ferroptosis resistance of cancer cells [20, 21]. This gene is conserved across mammals, but its promoter was replaced during catarrhine evolution.
ERV-V
ERV-V is a group I ERV, which infected a common ancestor of simians, and has two copies in the human genome, which are near neighbors in chromosome 19 [22, 23]. A third copy lies between them in non-hominines. Remarkably, they encode proteins derived from env, gag, and an unusual pre-gag gene, which are expressed in placenta and exhibit evolutionary patterns suggesting they contribute to the animal’s fitness [22,23,24]. Previous studies found no other copies or solo LTRs [22, 24], though Boso et al. found similar gag and pre-gag sequences in five other human ERV relics [23].
The present study reconstructed a consensus sequence that matches the ERV-V LTRs and occurs about 90 times in the human genome, mostly as solo LTRs. Parts of this LTR are similar to other LTRs in Dfam (Fig. S7). One of the five ERVs found by Boso et al. in chromosome X has this LTR at it’s \(5^{\prime }\)-end. (The \(3^{\prime }\)-end is missing.) Another ERV relic in chromosome 12 has this LTR at both ends (Fig. S8). These two ERVs have partial similarity to the chromosome 19 ERV-Vs (Fig. 4), and to each other (Fig. S9). They have fragmentary matches to gag, pol, and env genes (Fig. S10). Surprisingly, they were primed by tRNA-His, whereas the chromosome 19 ERV-Vs were primed by tRNA-Val (Fig. 3).
Two ERV relics, in human chromosomes 12 and X, are partially similar to ERV-V in chromosome 19. The two chromosome 19 ERVs are similar to each other apart from one gap. This figure just shows the interior ERV regions, excluding the LTRs. The ERVs are shown after removing retrotransposons (e.g. Alu) inserted within them
One ERV-V LTR contains the transcription start site of LY6K (lymphocyte antigen 6 family member K). LY6K plays a role in cancer [25], and contributes to sperm function in mice [26], which however do not have this ERV-V promoter.
ERV-Saru
ERV-Saru entered the genome in a common ancestor of simians \(\gtrsim\)43 mya, after their last common ancestor with other primates \(\sim\)69 mya. (Saru is Japanese for monkey.) It survives only as solo LTRs. It is similar to other types of LTR: most of it is similar to MER41G (59% identity), but its last 100 bp are more similar to LTR43B and its first 100 bp are not similar to any known LTR (Fig. S11). Surprisingly, MER41G is classified as Gammaretrovirus and LTR43B as Epsilonretrovirus [27]. Remarkably, one Saru LTR regulates innate immunity: it is an interferon-induced enhancer of AIM2 (Absent in Melanoma 2) [28]. It was thought to be a MER41 element, but is unambiguously a full-length Saru LTR (which could be regarded as a MER41 variant).
ERV-Hou
ERV-Hou also entered the genome in a common ancestor of simians, after their last common ancestor with other primates. (Hou is Chinese for monkey.) It survives mostly as solo LTRs, but there are two interior ERV regions, in chromsomes 1 and 22. These interiors are homologous only in their \(3^{\prime }\) parts (Fig. 5). Some parts of these interiors are homologous to other types of ERV (Fig. S12), which are classified as Gammaretrovirus [27].
Only a small fragment of the Hou LTR is similar to other known LTR types (Fig. S13). Other fragments of the Hou LTR, however, have repeated homologs in diverse mammals (Fig. S13). This suggests that those mammals harbor unknown types of LTR partially homologous to Hou LTR. Near-full-length homologs of Hou LTR appear in Cetruminantia (ruminants, hippos and whales) (Fig. S14).
One Hou LTR contains a transcription start site that produces a shorter variant of APOL3 (apolipoprotein L3). APOL3 is a protein that kills intracellular bacteria [29].
ERV-Han
ERV-Han survives only as solo LTRs (Fig. 6). It entered the genome in a common ancestor of simians, after their last common ancestor with other primates. This LTR’s \(3^{\prime }\) half is similar to LTR43B (Epsilonretrovirus), it has a middle region similar to the ERV-Goku LTR, and its \(5^{\prime }\) region is not similar to any other repeat element (Fig. S15).
An ERV-Han solo LTR in human chromosome 4. The green genome alignments indicate that it was inserted in a common ancestor of simians (human, baboon, and squirrel monkey), after their last common ancestor with bushbaby. The self alignments show that there are similar sequences elsewhere in the human genome, which are also ERV-Han LTR relics
One ERV-Han LTR contains a transcription start site of SIGLECL1 (SIGLEC family like 1). This gene encodes a cell-surface protein that recognizes N-glycolylneuraminic acid, except in humans, who have lost the ability to synthesize N-glycolylneuraminic acid [30].
ERV-Goku
ERV-Goku also entered the genome before the last common ancestor of simians, after their last common ancestor with other primates. (Goku is a simian story character.) It survives mostly as solo LTRs, but there are four interior regions in hg38, on chromosomes 3, 13, 19, X, and a fifth fragment on chromosome 10. Part of the LTR is similar to LTR35B (Gammaretrovirus), and part is similar to the ERV-Han LTR (Fig. S16). The interior regions have patchy similarity to other ERVs classified as Gammaretrovirus (Fig. S17). They also have full-length homology to gag, pol, and env proteins (Fig. S17), suggesting that ERV-Goku may have been autonomous.
Surprisingly, ERV-Goku also appears in non-simian primate genomes (Fig. S18). Goku LTR relics are shared by mouse lemur, slow loris, and bushbaby: they entered the genome before the last common ancestor of strepsirrhines \(\sim\)59 mya. Goku LTR relics are also in tarsiers. Thus, all primates have Goku relics, which may have entered their genomes around the same time. Goku was not found in colugo (the closest relative to primates) or other mammals (Table S1).
ERV-MER41E
A MER41 LTR was described in [31]. Today, Dfam has LTR subtypes MER41A, B, C, D, E, and G, and one interior sequence MER41-int. They are all classified as Gammaretrovirus, though MER41C is surprisingly in a separate ERV sub-group [27].
Some MER41 LTR relics bind to STAT1 transcription factors and regulate innate immunity [28, 32]. Others overlap placental enhancers and bind to SRF and other transcription factors [33, 34]. One MER41 evolved to produce the noncoding RNA BANCR, which plays a role in heart development [35].
This study reconstructed an interior sequence that appears between MER41E LTRs. Remarkably, just its final 108 bp are similar to MER41-int. Instead, it has patchy similarity to other ERV sequences in Dfam (Fig. S19). It matches gag, pol, and env proteins, but has a large deletion overlapping pol and env, implying it was non-autonomous (Fig. S19).
Genome alignments suggest that all MER41 subtypes entered the genome in a common ancestor of simians, after its last common ancestor with other primates. However, MER41-like ERVs colonized other mammals independently [28]. These MER41-like ERVs are similar to parts of the MER41 ERVs (Fig. S20). Interestingly, tarsier has LTR relics with full-length similarity to MER41E and MER41G (Fig. S21).
Discussion
This study found new types of not-so-ancient ERV in the thoroughly-studied human genome version hg38: thus not all types were found already. This contributes to understanding our genome’s history. For example, the AIM2 enhancer comes from a specific low-copy ERV type ERV-Saru, and some half LTR43B elements are actually half of ERV-Han LTRs. The ERVs triggered evolution of gene regulation, by contributing transcription start sites to genes: ERV-Hako to MS4A15, ERV-V to LY6K, ERV-Hou to APOL3, and ERV-Han to SIGLECL1.
It was hoped this study might get close to finding all types of not-too-ancient mobile element. In parallel, however, Takeda et al. found two new types of LTR (which are in hg38), with no overlap to those found here [36]. This suggests we are still not finished.
An appealing goal is to reconstruct all types of ancient mobile element, so that we can match the genome against them and clearly annotate all relics. This may be unachievable due to drastic variation among related elements, especially ERVs. For example, there are four ERV-V interiors in hg38: two are tandem duplicates, and the other two are so different that it seems hard or inappropriate to reconstruct one original sequence.
Since different types of ERV are partially similar to each other, it is not obvious when they should be regarded as different. A pragmatic answer is that they should have separate entries in a database such as Dfam when doing so clearly improves genome annotation. For example, after defining ERV-Han, we can annotate the full LTR relic in Fig. 6, which is not a one-off but occurs \(\sim\)60 times in the genome.
It would be convenient to include ERV relics in a standard biological classification with genera, subgenera, species, etc. This seems difficult since ERVs have partial similarities forming a phylogenetic network rather than a tree.
It is interesting that similar ERV insertions are in different primate lineages though they seem to be younger than the last common ancestor of those lineages. This could be explained by inter-species viral transmission, or interbreeding and gene flow. A low level of gene flow from lineage A to B might not get fixed in the B genome, leaving no trace, except that active mobile elements could be transmitted. By one of these mechanisms, an ERV-Hako containing part of a host gene was transmitted between catarrhines and platyrrhines, indicating that host DNA can be transferred between lineages. Also, ERVs in one lineage can be better understood by studying other lineages. For example, platyrrhines have only defective Hakos, like koalas with only RecKoRV: catarrhines reveal the ancestral ERV-Hako.
Data availability
For the discovered kinds of mobile element that were not in Dfam, Refiner alignments have been submitted to Dfam.
References
Magiorkinis G, Gifford RJ, Katzourakis A, De Ranter J, Belshaw R. Env-less endogenous retroviruses are genomic superspreaders. Proc Natl Acad Sci. 2012;109(19):7385–90.
Smit A, Hubley R, Green P. RepeatMasker Open-4.0. 2013–2015. http://www.repeatmasker.org. Accessed 1 Feb 2025.
Storer J, Hubley R, Rosen J, Wheeler TJ, Smit AF. The Dfam community resource of transposable element families, sequence models, and genome annotations. Mob DNA. 2021;12:1–14.
Vogt PK. Retroviral oncogenes: a historical primer. Nat Rev Cancer. 2012;12(9):639–48.
Chong AY, Kojima KK, Jurka J, Ray DA, Smit AF, Isberg SR, et al. Evolution and gene capture in ancient endogenous retroviruses - insights from the crocodilian genomes. Retrovirology. 2014;11:1–15.
Tarlinton R, Legione A, Sarker N, Fabijan J, Meers J, McMichael L, et al. Differential and defective transcription of koala retrovirus indicates the complexity of host and virus evolution. J Gen Virol. 2022;103(6):001749.
Vargiu L, Rodriguez-Tomé P, Sperber GO, Cadeddu M, Grandi N, Blikstad V, et al. Classification and characterization of human endogenous retroviruses; mosaic forms are common. Retrovirology. 2016;13:1–29.
Gifford RJ, Blomberg J, Coffin JM, Fan H, Heidmann T, Mayer J, et al. Nomenclature for endogenous retrovirus (ERV) loci. Retrovirology. 2018;15:1–11.
Frith MC. Paleozoic protein fossils illuminate the evolution of vertebrate genomes and transposable elements. Mol Biol Evol. 2022;39(4):msac068.
Parrott AM, Tsai M, Batchu P, Ryan K, Ozer HL, Tian B, et al. The evolution and expression of the snaR family of small non-coding RNAs. Nucleic Acids Res. 2011;39(4):1485–500.
Han K, Konkel MK, Xing J, Wang H, Lee J, Meyer TJ, et al. Mobile DNA in Old World monkeys: a glimpse through the rhesus macaque genome. Science. 2007;316(5822):238–40.
Blanco-Melo D, Gifford RJ, Bieniasz PD. Co-option of an endogenous retrovirus envelope for host defense in hominid ancestors. Elife. 2017;6:e22519.
Nassar LR, Barber GP, Benet-Pagès A, Casper J, Clawson H, Diekhans M, et al. The UCSC genome browser database: 2023 update. Nucleic Acids Res. 2023;51(D1):D1188–95.
Flynn JM, Hubley R, Goubert C, Rosen J, Clark AG, Feschotte C, et al. RepeatModeler2 for automated genomic discovery of transposable element families. Proc Natl Acad Sci. 2020;117(17):9451–7.
Hubley R, Wheeler TJ, Smit AF. Accuracy of multiple sequence alignment methods in the reconstruction of transposable element families. NAR Genomics Bioinforma. 2022;4(2):lqac040.
Frith MC, Kawaguchi R. Split-alignment of genomes finds orthologies more accurately. Genome Biol. 2015;16:1–17.
Carey KM, Hubley R, Lesica GT, Olson D, Roddy JW, Rosen J, et al. PolyA: a tool for adjudicating competing annotations of biological sequences. bioRxiv. 2021;2021–02. https://doiorg.publicaciones.saludcastillayleon.es/10.1101/2021.02.13.430877.
Kumar S, Suleski M, Craig JM, Kasprowicz AE, Sanderford M, Li M, et al. TimeTree 5: an expanded resource for species divergence times. Mol Biol Evol. 2022;39(8):msac174.
Chan PP, Lowe TM. GtRNAdb 2.0: an expanded database of transfer RNA genes identified in complete and draft genomes. Nucleic Acids Res. 2016;44(D1):D184–D189.
Xin S, Mueller C, Pfeiffer S, Kraft VA, Merl-Pham J, Bao X, et al. MS4A15 drives ferroptosis resistance through calcium-restricted lipid remodeling. Cell Death Differ. 2022;29(3):670–86.
Fang Y, Yu H, Zhou H. MS4A15 acts as an oncogene in ovarian cancer through reprogramming energy metabolism. Biochem Biophys Res Commun. 2022;598:47–54.
Kjeldbjerg AL, Villesen P, Aagaard L, Pedersen FS. Gene conversion and purifying selection of a placenta-specific ERV-V envelope gene during simian evolution. BMC Evol Biol. 2008;8:1–11.
Boso G, Fleck K, Carley S, Liu Q, Buckler-White A, Kozak CA. The oldest co-opted gag gene of a human endogenous retrovirus shows placenta-specific expression and is upregulated in diffuse large B-cell lymphomas. Mol Biol Evol. 2021;38(12):5453–71.
Blaise S, De Parseval N, Heidmann T. Functional characterization of two newly identified Human Endogenous Retrovirus coding envelope genes. Retrovirology. 2005;2:1–4.
Park S, Park D, Han S, Chung GE, Soh S, Ka HI, et al. LY6K depletion modulates TGF-\(\upbeta\) and EGF signaling. Cancer Med. 2023;12(11):12593–607.
Fujihara Y, Okabe M, Ikawa M. GPI-anchored protein complex, LY6K/TEX101, is required for sperm migration into the oviduct and male fertility in mice. Biol Reprod. 2014;90(3):60–1.
Kojima KK. Human transposable elements in Repbase: genomic footprints from fish to humans. Mob DNA. 2018;9(1):2.
Chuong EB, Elde NC, Feschotte C. Regulatory evolution of innate immunity through co-option of endogenous retroviruses. Science. 2016;351(6277):1083–7.
Gaudet RG, Zhu S, Halder A, Kim BH, Bradfield CJ, Huang S, et al. A human apolipoprotein L with detergent-like activity kills intracellular pathogens. Science. 2021;373(6552):eabf8113.
Angata T, Varki NM, Varki A. A second uniquely human mutation affecting sialic acid biology. J Biol Chem. 2001;276(43):40282–7.
Jurka J, Kapitonov VV, Klonowski P, Walichiewicz J, Smit AF. Identification of new medium reiteration frequency repeats in the genomes of Primates, Rodentia and Lagomorpha. Genetica. 1996;98:235–47.
Schmid CD, Bucher P. MER41 repeat sequences contain inducible STAT1 binding sites. PLoS ONE. 2010;5(7):e11425.
Sun MA, Wolf G, Wang Y, Senft AD, Ralls S, Jin J, et al. Endogenous retroviruses drive lineage-specific regulatory evolution across primate and rodent placentae. Mol Biol Evol. 2021;38(11):4992–5004.
Du C, Jiang J, Li Y, Yu M, Jin J, Chen S, et al. Regulation of endogenous retrovirus-derived regulatory elements by GATA2/3 and MSX2 in human trophoblast stem cells. Genome Res. 2023;33(2):197–207.
Wilson KD, Ameen M, Guo H, Abilez OJ, Tian L, Mumbach MR, et al. Endogenous retrovirus-derived lncRNA BANCR promotes cardiomyocyte migration in humans and non-human primates. Dev Cell. 2020;54(6):694–709.
Takeda A, Nonaka D, Imazu Y, Fukunaga T, Hamada M. REPrise: de novo interspersed repeat detection using inexact seeding. bioRxiv. 2024;2024–01. https://doiorg.publicaciones.saludcastillayleon.es/10.1101/2024.01.21.576581.
Acknowledgements
Arian Smit co-authored the MER41E interior region reconstruction.
Funding
This work was supported by the Japan Science and Technology Agency [JPMJCR21N6].
Author information
Authors and Affiliations
Contributions
There is just one author.
Corresponding author
Ethics declarations
Ethics approval and consent to participate
Not applicable.
Consent for publication
Not applicable.
Competing interests
The authors declare no competing interests.
Additional information
Publisher’s Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Supplementary Information
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Frith, M.C. Further varieties of ancient endogenous retrovirus in human DNA. Mobile DNA 16, 11 (2025). https://doiorg.publicaciones.saludcastillayleon.es/10.1186/s13100-025-00348-x
Received:
Accepted:
Published:
DOI: https://doiorg.publicaciones.saludcastillayleon.es/10.1186/s13100-025-00348-x