Supplementary table. RefSeq transcripts beginning within a TEa

Gene / Probable TE involvement in humanb / TE in mousec / Human expressiond / Mouse expressione
CYP19 / LTR is one of at least six promoters (primer extension [1]; promoter constructs [2]; review [3]) / No / LTR drives very high placental expression (transgenic mice [4]) / No placental expression [3]
TMPRSS3 / LTR and Aluf as an alternative promoter (5 RACE [5]) / No / TE form expressed primarily in PBLs;
other forms are widespread (RT-PCR [5]) / Inner ear, kidney, stomach, testis (RT-PCR [6])
HYAL-4 / Antisense L1 and Alufas only known promoter (5 RACE [7]) / No / Primarily placenta (northern [7]) / Primarily skin (1 cDNA , seven ESTs [8])
ENTPD1 (CD39) / LTR as one of two promoters and results in HERV-derived N-terminus (5 RACE [9]) / No / LTR drives expression in placenta (two cDNAs and one EST [8]) and melanoma (two ESTs [8]); overall expression widespread (northern [10,11]) / Widespread (northern [12]; three cDNAs [8])
CASPR4 / LTR one of three promoters and results in HERV-derived N-terminus / No / LTR form expressed in brain, testis, tumors (one cDNA for each [8]); overall expression high in brain and spinal cord (northern [13]; 3 cDNAs [8]). / Brain (northern [13])
MKKS / LTR and L2f as an alternative promoter (5 RACE [14]) / No / TE form in testis (one EST) and fetal tissues (5 RACE [14]); overall expression widespread (northern [14]) / Widespread (northern [14])
CA1 / LTR one of two major promoters (S1 mapping, primer extension, promoter constructs [15]) / Yes / LTR drives erythroid expression (primer extension [15,16]) / LTR drives erythroid expression
[17]
SPAM1
(PH20) / Antisense ERV as only known promoter / Yes / Primarily testis (northern [7]; two cDNAs [8]) / Primarily testis (northern [18])
KLK11 / MIR one of three promoters and leads to alternative N-terminus (5 RACE [19]) / Yes / Widespread (northern [19]; RT-PCR [20]) / Brain and prostate (northern [21])
MSLN
(MPF) / LTR as one of two promoters and part of 5 UTR of other transcript form; alternative promoter is an MIR (promoter constructs [22]) / Yes for both / Widespread (northern [23,24]) / Widespread but not from LTR or MIR (northern [25]; ~10 cDNAs and ESTs [8])
BAAT / LTR as the only known promoter / No / Liver (three cDNAs [8]) / Liver (northern [26])
MAD1L1 / LTR as one of two promoters and part of 5 UTR of other form (5 RACE [27]) / No / LTR form in tumors (~10 cDNAs and ESTs [8]); other form widespread (northern [27]) / Widespread [8]
CLDN14 / LTR as one of two promoters / No / LTR form in melanoma and skin (one cDNA [8] and kidney (northern, [28]); other form in liver (northern, [28]) / Widespread [29]
SIAT1 / ERV as one of at least three promoters / No / ERV form in mature B cells (northern [30]); other forms in various tissues (northern [30,31]) / B-cell and liver expression from multiple promoters (Northern [32])
FUT5 / Antisense Alu and L1f as only known promoter (5’ RACE [33]) / NA / Low expression in colon, liver; much lower expression compared with related FUTs that use other promoters (RT-PCR [33]) / FUTs expanded after human–mouse split so no mouse ortholog
ILT2 (LIR1, LILRB1) / ERV as alternative promoter / NA / Only ILT known to be expressed in natural killer cells [34] / ILTs expanded after human –mouse split
aAbbreviations: Alu, Alu repeat sequences; ERV, endogenous retrovirus-like element; LTR, long terminal repeat; L1 and L2, long interspersed nuclear element family members; MIR, mammalian interspersed element; NA, not applicable; TE, transposable element.
bAs predicted from Genome Browser annotation, databases or literature. TE promoters are depicted in bold if specifically analyzed in literature by promoter assays or 5’ RACE (rapid amplification of cDNA ends). Evidence for the TE as promoter is indicated in parentheses with references.
cPresence of TE in mouse was determined by Genome Browser annotation, BLAST and dot plot alignments.
dInformation from literature, where available, or expression databases. Expression pattern of the TE-initiated form is given if known. The type of evidence and references are given in parentheses.
eInformation from literature or databases with attention to patterns that differ from human.
fThe apparent promoter region is composed of sections of two different TEs.

References

1Means, G.D. et al. (1989) Structural analysis of the gene encoding human aromatase cytochrome P-450, the enzyme responsible for estrogen biosynthesis. J. Biol. Chem. 264, 19385–19391

2Kamat, A. et al. (1998) Characterization of the regulatory regions of the human aromatase (P450arom) gene involved in placenta-specific expression. Mol. Endocrinol. 12, 1764–1777

3Kamat, A. et al. (2002) Mechanisms in tissue-specific regulation of estrogen biosynthesis in humans. Trends Endocrinol. Metab. 13, 122–128

4Kamat, A. et al. (1999) A 500-bp region, ~40 kb upstream of the human CYP19 (aromatase) gene, mediates placenta-specific expression in transgenic mice. Proc. Natl. Acad. Sci. U. S. A. 96, 4575–4580

5Scott, H.S. et al. (2001) Insertion of beta-satellite repeats identifies a transmembrane protease causing both congenital and childhood onset autosomal recessive deafness. Nat. Genet. 27, 59–63

6Guipponi, M. et al. (2002) The transmembrane serine protease (TMPRSS3) mutated in deafness DFNB8/10 activates the epithelial sodium channel (ENaC) in vitro. Hum. Mol. Genet. 11, 2829–2836

7Csoka, A.B. et al. (1999) Expression analysis of six paralogous human hyaluronidase genes clustered on chromosomes 3p21 and 7q31. Genomics 60, 356–361

8Genome Browser, U.C.S.C. November 2002 and April 2003 Freeze

9Matsumoto, M. et al. (1999) The cDNA cloning of human placental ecto-ATP diphosphohydrolases I and II. FEBS Lett. 453, 335–340

10Chadwick, B.P. and Frischauf, A.M. (1998) The CD39-like gene family: identification of three new human members (CD39L2, CD39L3, and CD39L4), their murine homologues, and a member of the gene family from Drosophila melanogaster. Genomics 50, 357–367

11Kaczmarek, E. et al. (1996) Identification and characterization of CD39/vascular ATP diphosphohydrolase. J. Biol. Chem. 271, 33116–33122

12Maliszewski, C.R. et al. (1994) The CD39 lymphoid cell activation antigen. Molecular cloning and structural characterization. J. Immunol. 153, 3574–3583

13Spiegel, I. et al. (2002) Caspr3 and caspr4, two novel members of the caspr family are expressed in the nervous system and interact with PDZ domains. Mol. Cell. Neurosci. 20, 283–297

14Stone, D.L. et al. (2000) Mutation of a gene encoding a putative chaperonin causes McKusick- Kaufman syndrome. Nat. Genet. 25, 79–82

15Brady, H.J. et al. (1989) Multiple GF-1 binding sites flank the erythroid specific transcription unit of the human carbonic anhydrase I gene. FEBS Lett. 257, 451–456

16Brady, H.J. et al. (1991) The human carbonic anhydrase I gene has two promoters with different tissue specificities. Biochem. J. 277, 903–905

17Fraser, P. et al. (1989) The mouse carbonic anhydrase I gene contains two tissue-specific promoters. Mol. Cell. Biol. 9, 3308–3313

18Zheng, Y. and Martin-Deleon, P.A. (1999) Characterization of the genomic structure of the murine Spam1 gene and its promoter: evidence for transcriptional regulation by a cAMP-responsive element.Mol. Reprod. Dev. 54, 8–16

19Mitsui, S. et al. (2000) A novel isoform of a kallikrein-like protease, TLSP/hippostasin, (PRSS20), is expressed in the human brain and prostate. Biochem. Biophys. Res. Commun. 272, 205–211

20Yousef, G.M. et al. (2000) Genomic organization, mapping, tissue expression, and hormonal regulation of trypsin-like serine protease (TLSP PRSS20), a new member of the human kallikrein gene family. Genomics 63, 88–96

21Mitsui, S. et al. (2000) cDNA cloning and tissue-specific splicing variants of mouse hippostasin/TLSP (PRSS20). Biochim. Biophys. Acta 1494, 206–210

22Urwin, D. and Lake, R.A. (2000) Structure of the Mesothelin/MPF gene and characterization of its promoter. Mol. Cell Biol. Res. Commun. 3, 26–32

23Kojima, T. et al. (1995) Molecular cloning and expression of megakaryocyte potentiating factor cDNA. J. Biol. Chem. 270, 21984–21990

24Chang, K. and Pastan, I. (1996) Molecular cloning of mesothelin, a differentiation antigen present on mesothelium, mesotheliomas, and ovarian cancers. Proc. Natl. Acad. Sci. U. S. A. 93, 136–140

25Bera, T.K. and Pastan, I. (2000) Mesothelin is not required for normal mouse development or reproduction. Mol. Cell. Biol. 20, 2902–2906

26Falany, C.N. et al. (1997) Cloning, expression, and chromosomal localization of mouse liver bile acid CoA:amino acid N-acyltransferase. J. Lipid Res. 38, 1139–1148

27Jin, D.Y. et al. (1998) Human T cell leukemia virus type 1 oncoprotein Tax targets the human mitotic checkpoint protein MAD1. Cell 93, 81–91

28Wilcox, E.R. et al. (2001) Mutations in the gene encoding tight junction claudin-14 cause autosomal recessive deafness DFNB29. Cell 104, 165–172

29Reymond, A. et al. (2002) Human chromosome 21 gene expression atlas in the mouse. Nature 420, 582–586

30Wang, X. et al. (1993) Chromosome mapping and organization of the human beta-galactoside alpha 2,6-sialyltransferase gene. Differential and cell-type specific usage of upstream exon sequences in B-lymphoblastoid cells. J. Biol. Chem. 268, 4355–4361

31Kitagawa, H. and Paulson, J.C. (1994) Differential expression of five sialyltransferase genes in human tissues. J. Biol. Chem. 269, 17872–17878

32Wuensch, S.A. et al. (2000) Murine B cell differentiation is accompanied by programmed expression of multiple novel beta-galactoside alpha2, 6-sialyltransferase mRNA forms. Glycobiology 10, 67–75

33Cameron, H.S. et al. (1995) Expression of human chromosome 19p alpha(1,3)-fucosyltransferase genes in normal tissues. Alternative splicing, polyadenylation, and isoforms. J. Biol. Chem. 270, 20112–20122

34Martin, A.M. et al. (2002) Leukocyte Ig-like receptor complex (LCR) in mice and men. Trends Immunol. 23, 81–88