Gene transcriptions/Boxes

(Redirected from Boxes)

"[A] repeating sequence of nucleotides that forms a transcription or a regulatory signal"[1] is a box.

Genetics

edit
 
This is an image of Bob, the guinea pig. Credit: selbst.

Genetics involves the expression, transmission, and variation of inherited characteristics.

Def. a "branch of biology that deals with the transmission and variation of inherited characteristics, in particular chromosomes and DNA"[2] is called genetics.

Theoretical box genetics

edit

Def. one "of two specific regions in a promoter"[3] is called a box.

AGC boxes

edit
 
This is a digital photograph of Arabidopsis thaliana. Credit: Alberto Salguero Quiles en Getafe (Madrid), España.

"[T]he AGC box (10), GCC element (11), or AGCCGCC sequence (13), is an ethylene-responsive element found in the promoters of a large number of [pathogenesis related] PR genes".[4]

ATA boxes

edit

The ATA box is a variant of the TATA box that appears in the globin and other genes. Instead of a sequence TATA as in the TATA box, the ATA box lacks the first thymine (T) and may be tissue specific.

An ATA box may have the sequence AAATAT.[5]

CAAT boxes

edit
 
As representative of the Metazoa here is an image of a twaid shad. Credit: Hans Hillewaert.

A CCAAT box (also sometimes abbreviated a CAAT box or CAT box) is a distinct pattern of nucleotides along the template strand of DNA in eukaryotes.

CArG boxes

edit
File:Smooth muscle cell CArG.jpeg
The diagram shows a model for epigenetic regulation of SRF binding to CArG box chromatin. Credit: Oliver G. McDonald, Brian R. Wamhoff, Mark H. Hoofnagle, and Gary K. Owens.

CArG boxes are present in the promoters of smooth muscle cell genes.

"CArG box [CC(A/T)6GG] DNA [consensus] sequences present within the promoters of SMC genes play a pivotal role in controlling their transcription".[6]

C/D boxes

edit
 
This example of a C/D box is a small nucleolar RNA 73 (snoRNA U73). Credit: Rfam database (RF00071).

"Located within the introns of very long transcripts extending downstream of SNRPN, there are clusters of paternally expressed C/D box–containing snoRNAs that are highly expressed in the brain5,6."[7]

For "box C/D snoRNAs, boxes C and D and an adjoining stem form a vital structure, known as the box C/D motif."[8]

"The [C and D] box elements are essential for snoRNA production [transcription] and for snoRNA-directed modification of rRNA nucleotides."[8]

CENP-B boxes

edit
File:CENP-B box.png
The 17-bp motif of the CENP-B box repeats in DNA monomers. Credit: Jun-ichirou Ohzeki, Megumi Nakano, Teruaki Okada, Hiroshi Masumoto.

"The human α-satellite consensus sequence contains only three CpG sequences within its 171 base-pair sequence [23]. Interestingly, two of the three CpG sequences in the α-satellite consensus sequence are located within site 1 (5′-pTpTpCpG-3′) and site 3 (5′-pCpGpGpG-3′) of the CENP-B box (Fig. 1A;[9])."[9]

CGCG boxes

edit

"The minimum DNA-binding elements are 6-bp CGCG box, (A/C/G)CGCG(C/G/T)."[10]

"The promoter regions are assumed to be within ∼1 kb upstream of the starting transcription site (for the known genes) or the first ATG (for the predicted genes). These genes are related to ethylene signaling (EIN3) and ABA signaling (a putative ABA responsive protein), light perception (phytochrome A, phyA), stress responsive such as the DNA repairing protein, heat shock protein, touch protein (TCH 4), and CaM-regulated ion channel. CaM genes (CaM2 andCaM3) and AtSR6 also contains CGCGcis-elements in their promoter regions."[10]

Chromoboxes

edit

Notation: let chromo stand for chromatin organization modifier.[11]

A "[c]hromatin organization modifier (chromo) domain is a conserved region of around 50 amino acids found in a variety of chromosomal proteins, which appear to play a role in the functional organization of the eukaryotic nucleus."[12]

DREB boxes

edit

It "is likely that the present OsDREBL may preferably bind to other elements such as the ethylene responsive element GCC box (AGCCGCC), rather than the CRT/DREB box (TACCGACAT), since the two boxes were very similar."[13]

"Based on DNA-binding data, this group [one AP2/ERF domain only] has been subdivided into dehydration-responsive element-binding (DREB)-like proteins, which interact with the dehydration-responsive, or cold-repeat, element (consensus sequence TACCGACAT) (Liu et al. 1998; Stockinger et al. 1997) and the EREBP-like (ethylene responsive element binding protein) proteins binding the GCC box (consensus sequence AGCCGCC) (Ohme-Takagi and Shinshi 1995; Sakuma et al. 2002)."[14]

Enhancer boxes

edit
 
This is an image of Dendromus mysticalis, the chestnut climbing mouse. Credit: Kenneth Worm.

An E-box (Enhancer Box) is a DNA sequence which usually lies upstream of a gene in a promoter region.

E2 boxes

edit

"The most dramatic impact on immunoglobulin gene enhancer activity was observed upon mutation of sites that contain an E2-box motif (G/ACAGNTGN)."[15]

"The E box sites that are most important are those of the E2 box class (GCAGXTGG/T). Two E2 box sites are present in the immunoglobulin heavy chain gene enhancer, p.E2 and /~E5, and one is present in the kappa enhancer, designated KE2 [29-31]."[16]

F boxes

edit

"The F-box is a protein motif of approximately 50 amino acids that functions as a site of protein-protein interaction."[17]

"The F-box of Elongin A binds Elongin C (El C). The association of Elongins B and C with A increases Elongin A transcriptional activity."[17]

"SCF complexes generally recognize substrates after they are phosphorylated on specific epitopes [10]. Phosphorylation is one of the major mechanisms used by cells to rapidly transduce signals. SCF complexes are therefore ideal for dynamic processes that require an abrupt change to be made irreversible (at least in the short term) via the degradation of key proteins. Examples of such processes are cell-cycle phase transitions - during which the cell-cycle regulators that were required for the previous phase are degraded as the cell enters the new phase - and shifts in transcription that last for a longer time period than otherwise because a transcriptional inhibitor is degraded. There is a wide variety of SCF targets that include cell-cycle regulators, for example, G1-phase cyclins, cyclin-dependent kinase inhibitors, DNA replication factors, and transcription factors that promote cell-cycle progression, as well as non-cell-cycle functions, such as a cytoskeletal regulator, cell-surface receptors, transcription-factor inhibitors, and non-cell-cycle transcription factors (Table 2)."[17]

"Second, Elongin A, the transcriptionally active subunit of the Elongin (SIII) complex - which facilitates transcription elongation by RNA polymerase II [16] - is an F-box protein (Figure 2c). Elongin A was isolated by virtue of its ability to increase the catalytic rate of transcript elongation by RNA polymerase II in vitro [16]. Binding of the other components of the complex, Elongin B and C, increases the specific activity of Elongin A. The F-box motif of Elongin A is in the smallest region shown to be sufficient for Elongin A to bind Elongin C in both yeast and humans [17,18]. Elongin C has homology to Skp1; the F-box-Elongin C interaction may therefore be evolutionarily conserved."[17]

Forkhead boxes

edit

Forkhead "is named for the Drosophila fork head protein, a transcription factor which promotes terminal rather than segmental development."[18]

Fur boxes

edit

Notation: Let Fur stand for ferric uptake regulation.

"The Fur protein [...] acts as a transcriptional repressor of iron-regulated promoters by virtue of its Fe2+-dependent DNA binding activity (5, 25, 32, 33)."[19]

"Under iron-rich conditions Fur binds the divalent ion, acquires a configuration able to bind target DNA sequences (generally known as Fur boxes or iron boxes, [...]), and inhibits transcription from virtually all the genes and operons repressed by the metal."[19]

When "iron is scarce, the equilibrium is displaced to release Fe2+, the RNA polymerase accesses cognate promoters, and the genes for the biosynthesis of siderophores and other iron-related functions are expressed (41, 55)."[19]

The "sequence 5′-GATAATGATAATCATTATC-3′ [is] the functional target of the Fur protein."[19]

Many "iron-regulated promoters appear to have not just one Fur box but multiple, sometimes overlapping, boxes (42, 53, 87)"[19]

The "same 19-bp sequence [5′-GATAATGATAATCATTATC-3′] can be viewed as a combination of three adjacent repeats, 5′-NATA/TAT-3′."[19]

G boxes

edit

The "perfect palindrome 5'-GCCACGTGGC-3' which is also known as the G-box motif."[20]

"TAF-1 can bind to the G-box and related motifs and that it functions as a transcription activator."[20]

"A G-box-related motif, containing the core sequence CACGTG is also present in the 5' regions of two other classes of light-responsive genes".[20]

GC boxes

edit

A GC box is a distinct pattern of nucleotides found in the promoter region of some eukaryotic genes upstream of the TATA box and approximately 110 bases upstream from the transcription initiation site.

GCC boxes

edit

The "EREBP-like (ethylene responsive element binding protein) proteins [bind] the GCC box (consensus sequence AGCCGCC) (Ohme-Takagi and Shinshi 1995; Sakuma et al. 2002)."[14]

H boxes

edit

An H box has a consensus sequence of 3'-ACACCA-5'.[21]

HMG boxes

edit

"Upstream Binding Factor (UBF) is important for activation of ribosomal RNA transcription and belongs to a family of proteins containing nucleic acid binding domains, termed HMG-boxes, with similarity to High Mobility Group (HMG) chromosomal proteins."[22]

"Most HMG box proteins contain two or more HMG boxes and appear to bind DNA in a relatively sequence-aspecific manner (5, 13, 15, 16 and references therein). [...] they all appear to bind to the minor groove of the A/T A/T C A A A G-motif (10, 14, 18-20)."[23]

"In mammals, the Tcf/Lef family consists of four genes: Tcf‐1, Lef‐1, Tcf‐3 and Tcf‐4. All TCF/LEF proteins display several common structural features (48,49). They contain a nearly identical DNA‐binding domain, the HMG box, recognizing the consensus sequence A/T A/T CAAA."[24]

HY boxes

edit

A core responsive element is the hypertrophy region HY box between -89 and -60 nucleotides (nts) upstream from the transcription start site.[25]

MADS boxes

edit

"The MADS-box encodes a novel type of DNA-binding domain found so far in a diverse group of transcription factors from yeast, animals, and seed plants."[26]

"The MADS-box comprises 180 nucleotides, encoding 60 amino acids [...] MADS is an acronym for the four DNA-binding proteins MCM1 [minichromosome maintenance gene 1], AGAMOUS [...], DEFICIENS [...], and SRF [serum response factor]."[26]

The "[Antirrhinum majus mutant squamosa(squa)] SQUA is a member of a family of transcription factors which contain the MADS-box, a conserved DNA binding domain."[27]

The "MADS-box is [...] AGAGGGAAAGTACAACTGAAGAGGATAGAGAACAAGATCAATAGACAGGTGACTTT CTCAAAGAGGAGAGGTGGATTGTTGAAAAAAGCTCATGAGCTCTCTGTGCTTTGTG ATGCTGAAGTGGCTCTTATTGTCTTCTCTAATAAGGGGAAGCTATTTGAGTATTCT ACTGAT",[27] which has 174 nucleotides (nts) and begins with the nucleotides for the amino acids RGK.[27] The six nucleotides following the MADS box are "TCTTGC"[27] which may be the additional six needed to get to 180 nts.

P boxes

edit

"As VRI [target gene: vrille (VRI)] accumulates in the nucleus during the mid to late day, it binds VRI/PDP1ϵ binding sites (V/P-boxes) [consensus A(/G)TTA(/T)T(/C):GTAAT(/C)], to repress Clk and cry transcription (Hardin, 2004)."[28]

"REV-ERBα and RORa are nuclear receptors rather than bZIP transcription factors like VRI and PDP1ϵ, and they regulate transcription by binding RORE elements rather than V/P-boxes (Bell-Pedersen et al., 2005)."[28]

Pribnow boxes

edit

"Although the first five bases of the conserved sequence are identical to the first five bases of the Pribnow box (TATAA), the sixth base of the Pribnow box is a 100 per cent conserved T (refs 15-17) while the 100 per cent conserved A found here is actually more similar to eukaryotic promoter sequences20."[29]

"Two domains upstream of the start site of transcription have been identified for which a consensus sequence has been formulated(1-5). These domains are the -35 sequence (5'-T-T-G-A-C-A) and the Pribnow box (5'-T-A-T-A-A-T) in the -10 region. Both domains are in close contact with the RNA polymerase during initiation of RNAsynthesis (2,6)."[30]

Prolamin boxes

edit

"The BPBF [barley prolamin-box (P-box) binding factor] expressed in bacteria as a GST-fusion binds a P-box 5′-TGTAAAG-3′ containing oligonucleotide derived from the promoter region of anHor2gene."[31]

"The primary structure of hordein [barley prolamins] polypeptides is closely related to that of prolamins from other grass species from the Pooideae subfamily, such as wheat and rye (Shewry & Tatham 1990;Shewry et al. 1995). The close evolutionary relationship is also manifested by the conservation of a putative regulatory element in their gene promoters, the endosperm box (Forde et al. 1985;Kreis et al. 1985). This conserved region consists of two motifs, a 7 bp element (5′TGTAAAG3′) termed the Prolamin Box (P-box) or endosperm motif (EM) followed at a distance of up to 8 nucleotides by the GCN4-like motif (GLM) which has the 5′(G/A)TGA(G/C)TCA(T/C)3′ consensus sequence (reviewed by Müller et al. 1995)."[31]

Pyrimidine boxes

edit

"Functional analyses of a number of hydrolase gene promoters, induced by gibberellin (GA) in aleurone cells following germination, have identified a GA-responsive complex as a tripartite element containing a pyrimidine box motif 5′-CCTTTT-3′."[32]

"Although this GARC [GA responsive complex] may not always be tripartite, most often it includes three sequence motifs, the TAACAAA box or GA responsive element (GARE), the pyrimidine box CCTTTT, and the TATCCAC box (Skriver et al., 1991;Gubler and Jacobsen, 1992; Rogers et al., 1994)."[32]

The "complementary strand of the pyrimidine box element (5′-CTTTT-3′) in GA-induced hydrolase gene promoters was identical to the core sequence (5′-AAAAG-3′) recognized by PBF in prolamin gene promoters (P-box: 5′-T/AAAAG-3′; Vicente-Carbajosa et al., 1997; Mena et al., 1998; Yanagisawa and Schmidt, 1999)".[32]

TACTAAC boxes

edit

"The comparison of the two rp51 genes [...] suggests that this homology might be more extensive if one allows up to 3 bases to differ from a larger consensus sequence, ATTTACTAAC."[33]

"A consensus sequence TACTAA(C/T) was derived for the branch site of Dictyostelium introns."[34]

T boxes

edit

The T box is a DNA-binding domain.[35]

"T-box genes encode transcription factors involved in the regulation of developmental processes."[35]

TATA boxes

edit
 
This image is a drawing of Haloquadratum walsbyi. Credit: Rotational.

The TATA box (also called Goldberg-Hogness box)[36] is a DNA sequence (cis-regulatory element) found in the promoter region of genes in archaea and eukaryotes;[37] approximately 24% of human genes contain a TATA box within the core promoter.[38]

The TATA box is a "binding site of either general transcription factors or histones.

U boxes

edit

"The U box is a domain of ∼70 amino acids that is present in proteins from yeast to humans."[39]

"The prototype U box protein, yeast Ufd2, was identified as a ubiquitin chain assembly factor that cooperates with a ubiquitin-activating enzyme (E1), a ubiquitin-conjugating enzyme (E2), and a ubiquitin-protein ligase (E3) to catalyze ubiquitin chain formation on artificial substrates."[39]

"The UFD2 protein and its homologs in other eukaryotes share a conserved domain designated the ‘U box’."[40]

"The U box mediates the interaction of UFD2 with ubiquitin conjugated proteins [...] the U box is a derived version of the RING-finger domain that lacks the hallmark metal-chelating residues of the latter [5,6] but is likely to function similarly to the RING-finger in mediating ubiquitin-conjugation of protein substrates."[40]

X boxes

edit

"The so-called X (or X1) box in the promoter of the human MHC class II DRA gene is the binding site for a ubiquitous mammalian sequence-specific DNA-binding protein called RFX, NF-X, NF-Xc, or RFX1 (4,19,23,24,27)."[41]

"RFX is MDBP [...] the MDBP (RFX) recognition site region in the DRA promoter can be considered to extend from positions -100 to -112 [...] a possible binding site for MDBP which begins 88 bp after the first residue of the presumably full-length RFX1 (MDBP) cDNA (26). This site (RFX+88) is as follows: 5'-GTTGGCATGGCAAC-3'."[41]

Y boxes

edit

The "Y-box [is] an inverted CCAAT box, in the promoter region of many genes; this domain is highly conserved in evolution [1, 3, 4]."[42]

Hypotheses

edit
  1. A1BG is not transcribed by any of the boxes.

See also

edit

References

edit
  1. "Box (disambiguation)". San Francisco, California: Wikimedia Foundation, Inc. 23 May 2013. Retrieved 2013-06-15.
  2. genetics. San Francisco, California: Wikimedia Foundation, Inc. 16 April 2014. https://en.wiktionary.org/wiki/genetics. Retrieved 2014-05-07. 
  3. Fabgil (22 October 2015). box. San Francisco, California: Wikimedia Foundation, Inc. https://en.wiktionary.org/wiki/box. Retrieved 2017-04-04. 
  4. Michael Büttner and Karam B. Singh (27 May 1997). "Arabidopsis thaliana ethylene-responsive element binding protein (AtEBP), an ethylene-inducible, GCC box DNA-binding protein interacts with an ocs element binding protein". Proceedings of the National Academy of Sciences of the United States of America 94 (11): 5961-6. http://www.pnas.org/content/94/11/5961.long. Retrieved 2014-05-02. 
  5. Shigemi Kimura, Kuniya Abe, Misao Suzuki, Masakatsu Ogawa, Kowashi Yoshioka, Tadasi Kaname, Teruhisa Miike and Ken-ichi Yamamura (June 1997). "A 900 bp genomic region from the mouse dystrophin promoter directs lacZ reporter expression only to the right heart of transgenic mice". Development, Growth & Differentiation 39 (3): 257-65. doi:10.1046/j.1440-169X.1997.t01-2-00001.x. http://onlinelibrary.wiley.com/doi/10.1046/j.1440-169X.1997.t01-2-00001.x/full. Retrieved 2013-06-28. 
  6. Oliver G. McDonald, Brian R. Wamhoff, Mark H. Hoofnagle, and Gary K. Owens (4 January 2006). "Control of SRF binding to CArG box chromatin regulates smooth muscle gene expression in vivo". The Journal of Clinical Investigation 116 (1): 36-48. http://www.ncbi.nlm.nih.gov/pmc/articles/PMC1323266/. Retrieved 2014-06-05. 
  7. Trilochan Sahoo, Daniela del Gaudio, Jennifer R German, Marwan Shinawi, Sarika U Peters, Richard E Person, Adolfo Garnica, Sau Wai Cheung, and Arthur L Beaudet (June 2008). "Prader-Willi phenotype caused by paternal deficiency for the HBII-85 C/D box small nucleolar RNA cluster". Nat. Genet. 40 (6): 719-21. doi:10.1038/ng.158. http://www.ncbi.nlm.nih.gov/pmc/articles/PMC2705197/. Retrieved 2014-06-08. 
  8. 8.0 8.1 Dmitry A.Samarsky, Maurille J.Fournier, Robert H.Singer and Edouard Bertrand (1 July 1998). "The snoRNA box C/D motif directs nucleolar targeting and also couples snoRNA synthesis and localization". The European Molecular Biology Organization (EMBO) Journal 17 (13): 3747–3757. doi:10.1093/emboj/17.13.3747. PMID 9649444. https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1170710/pdf/003747.pdf. Retrieved 2017-02-04. 
  9. Yoshinori Tanaka, Hitoshi Kurumizaka, and Shigeyuki Yokoyama (January 2005). "CpG methylation of the CENP-B box reduces human CENP-B binding". The FEBS Journal 272 (1): 282–289. doi:10.1111/j.1432-1033.2004.04406.x. PMID 15634350. http://onlinelibrary.wiley.com/doi/10.1111/j.1432-1033.2004.04406.x/full. Retrieved 2017-02-05. 
  10. 10.0 10.1 Tianbao Yang and B. W. Poovaiah (22 November 2002). "A calmodulin-binding/CGCG box DNA-binding protein family involved in multiple signaling pathways in plants". Journal of Biological Chemistry 277 (47): 45049-45058. doi:10.1074/jbc.M207941200. http://www.jbc.org/content/277/47/45049.full. Retrieved 2017-02-05. 
  11. Jonathan J. H. Pearce, Prim B. Singh and Stephen J. Gaunt (1 April 1992). "The mouse has a Polycomb-like chromobox gene". Development 114 (4): 921-9. PMID 1352241. https://journals.biologists.com/dev/article-pdf/114/4/921/1123373/921.pdf. Retrieved 2013-06-15. 
  12. NCBI (9 June 2013). CBX5 chromobox homolog 5 ( Homo sapiens (human) ). Bethesda, MD, USA: National Center for Biotechnology Information. http://www.ncbi.nlm.nih.gov/gene/23468. Retrieved 2013-06-15. 
  13. Jian-Quan Chen, Yi Dong, Yu-Jun Wang, Qiang Liu, Jin-Song Zhang, Shou-Yi Chen (October 2003). "An AP2/EREBP-type transcription-factor gene from rice is cold-inducible and encodes a nuclear-localized protein". Theoretical and Applied Genetics 107 (6): 972–979. doi:10.1007/s00122-003-1346-5. https://link.springer.com/article/10.1007/s00122-003-1346-5. Retrieved 2017-05-30. 
  14. 14.0 14.1 Ute Fischer and Wolfgang Dröge-Laser (2004). "Overexpression of NtERF5, a New Member of the Tobacco Ethylene Response Transcription Factor Family Enhances Resistance to Tobacco mosaic virus". Molecular Plant-Microbe Interactions 17 (10): 1162–1171. doi:10.1094/MPMI.2004.17.10.1162. http://apsjournals.apsnet.org/doi/pdf/10.1094/MPMI.2004.17.10.1162. Retrieved 2017-05-30. 
  15. Cornelis Murre and David Baltimore (1992). The Helix-Loop-Helix Motif: Structure and Function, In: Transcriptional Regulation. 22B. Cold Spring Harbor Laboratory Press. pp. 861-79. doi:10.1101/087969425.22B.861. https://cshmonographs.org/csh/index.php/monographs/article/viewPDFInterstitial/3449/2723. Retrieved 2017-02-08. 
  16. Cornelis Murre, Gretchen Bain, Marc A. van Dijk, Isaac Engel, Beth A. Furnari, Mark E. Massari, James R. Matthews, Melanie W. Quong, Richard R. Rivera, Maarten H. Stuiver (June 1994). "Structure and function of helix-loop-helix proteins". Biochimica et Biophysica Acta (BBA) - Gene Structure and Expression 1218 (2): 129-35. http://www.sciencedirect.com/science/article/pii/0167478194900019. Retrieved 2017-02-08. 
  17. 17.0 17.1 17.2 17.3 Edward T Kipreos and Michele Pagano (10 November 2000). "The F-box protein family". Genome Biology 1 (5): 3002. doi:10.1186/gb-2000-1-5-reviews3002. https://genomebiology.biomedcentral.com/articles/10.1186/gb-2000-1-5-reviews3002. Retrieved 2017-02-13. 
  18. NCBI (9 June 2013). FOXP3 forkhead box P3 ( Homo sapiens (human) ). Bethesda, MD, USA: National Center for Biotechnology Information, U.S. National Library of Medicine. http://www.ncbi.nlm.nih.gov/gene/50943. Retrieved 2013-06-15. 
  19. 19.0 19.1 19.2 19.3 19.4 19.5 Lucía Escolar, Jose Pérez-Martín, and Víctor de Lorenzo (October 1999). "Opening the Iron Box: Transcriptional Metalloregulation by the Fur Protein". Journal of Bacteriology 181 (20): 6223-9. http://jb.asm.org/content/181/20/6223.short. Retrieved 2017-04-04. 
  20. 20.0 20.1 20.2 K Oeda, J Salinas, and N H Chua (July 1991). "A tobacco bZip transcription activator (TAF-1) binds to a G-box-like motif conserved in plant genes". The EMBO Journal 10 (7): 1793–1802. https://www.ncbi.nlm.nih.gov/pmc/articles/PMC452853/. Retrieved 2017-02-13. 
  21. Timofey S. Rozhdestvensky, Thean Hock Tang, Inna V. Tchirkova, Jürgen Brosius, Jean‐Pierre Bachellerie and Alexander Hüttenhofer (2003). "Binding of L7Ae protein to the K‐turn of archaeal snoRNAs: a shared RNA binding motif for C/D and H/ACA box snoRNAs in Archaea". Nucleic Acids Research 31 (3): 869-77. doi:10.1093/nar/gkg175. http://nar.oxfordjournals.org/content/31/3/869.long. Retrieved 2014-06-08. 
  22. Gregory P. Copenhaver, Christopher D. Putnam, Michael L. Denton and Craig S. Pikaard (1994). "The RNA polymerase I transcription factor UBF is a sequence-tolerant HMG-box protein that can recognize structured nucleic acids". Nucleic Acids Research 22 (13): 2651-7. https://academic.oup.com/nar/article-pdf/22/13/2651/7122053/22-13-2651.pdf. Retrieved 2017-04-05. 
  23. Vincent Laudet, Dominique Stehelin and Hans Clevers (1993). "Ancestry and diversity of the HMG box superfamily". Nucleic Acids Research 21 (10): 2493-501. https://academic.oup.com/nar/article-pdf/21/10/2493/4086740/21-10-2493.pdf. Retrieved 2017-04-05. 
  24. Tomas Valenta, Jan Lukas, Vladimir Korinek (2003). "HMG box transcription factor TCF‐4’s interaction with CtBP1 controls the expression of the Wnt target Axin2/Conductin in human embryonic kidney cells". Nucleic Acids Research 31 (9): 2369-80. doi:10.1093/nar/gkg346. https://academic.oup.com/nar/article/31/9/2369/1080420/HMG-box-transcription-factor-TCF-4-s-interaction. Retrieved 2017-04-05. 
  25. Akiro Higashikawa, Taku Saito, Toshiyuki Ikeda, Satoru Kamekura, Naohiro Kawamura, Akinori Kan, Yasushi Oshima, Shinsuke Ohba, Naoshi Ogata, Katsushi Takeshita, Kozo Nakamura, Ung-Il Chung, Hiroshi Kawaguchi (January 2009). "Identification of the core element responsive to runt-related transcription factor 2 in the promoter of human type x collagen gene". Arthritis & Rheumatism 60 (1): 166-78. doi:10.1002/art.24243. PMID 19116917. http://onlinelibrary.wiley.com/doi/10.1002/art.24243/full. Retrieved 2013-06-18. 
  26. 26.0 26.1 Günter Theißen, Jan T. Kim, Heinz Saedler (1 November 1996). "Classification and phylogeny of the MADS-box multigene family suggest defined roles of MADS-box gene subfamilies in the morphological evolution of eukaryotes". Journal of Molecular Evolution 43 (5): 484-516. doi:10.1007/BF02337521. http://link.springer.com/article/10.1007/BF02337521. Retrieved 2015-03-31. 
  27. 27.0 27.1 27.2 27.3 Peter Huijser, Joachim Klein, Wolf-Ekkehard Lonnig, Hans Meijer, Heinz Saedler and Hans Sommer (April 1992). "Bracteomania,an inflorescence anomaly,is caused by the loss of function of the MADS-box gene squamosa in Antirrhinum majus". The EMBO Journal 11 (4): 1239-49. PMID 556572. http://www.ncbi.nlm.nih.gov/pmc/articles/PMC556572/pdf/emboj00089-0025.pdf. Retrieved 2015-04-01. 
  28. 28.0 28.1 Wangjie Yu and Paul E. Hardin (2006). "Circadian oscillators of Drosophila and mammals". Journal of Cell Science 119: 4793-5. doi:10.1242/jcs.03174. https://journals.biologists.com/jcs/article/119/23/4793/29176/Circadian-oscillators-of-Drosophila-and-mammals. Retrieved 2017-02-19. 
  29. Alan C. Christensen & Elton T. Young (23 September 1982). "T4 late transcripts are initiated near a conserved DNA sequence". Nature 299 (5881): 369-71. doi:10.1038/299369a0. http://www.nature.com/nature/journal/v299/n5881/abs/299369a0.html. Retrieved 2017-02-19. 
  30. Herman A. de Boer, Lisa J. Comstock, and Mark Vasser (January 1983). "The tac promoter: A functional hybrid derived from the trpand lac promoters". Proceedings of the National Academy of Sciences USA 80 (1): 21-5. http://www.pnas.org/content/80/1/21.full.pdf. Retrieved 2017-02-19. 
  31. 31.0 31.1 Montaña Mena, Jesus Vicente-Carbajosa, Robert J. Schmidt and Pilar Carbonero (October 1998). "An endosperm-specific DOF protein from barley, highly conserved in wheat, binds to and activates transcription from the prolamin-box of a native B-hordein promoter in barley endosperm". The Plant Journal 16 (1): 53-62. doi:10.1046/j.1365-313x.1998.00275.x. http://onlinelibrary.wiley.com/doi/10.1046/j.1365-313x.1998.00275.x/full. Retrieved 2017-02-19. 
  32. 32.0 32.1 32.2 Montaña Mena, Francisco Javier Cejudo, Ines Isabel-Lamoneda and Pilar Carbonero (1 September 2002). "A Role for the DOF Transcription Factor BPBF in the Regulation of Gibberellin-Responsive Genes in Barley Aleurone". Plant Physiology 130 (1): 111-9. doi:10.​1104/​pp.​005561. http://www.plantphysiol.org/content/130/1/111.full. Retrieved 2017-02-19. 
  33. Claudio W. Pikielny, John L. Teem, Michael Rosbash (September 1983). "Evidence for the biochemical role of an internal sequence in yeast nuclear mRNA introns: implications for U1 RNA and metazoan mRNA splicing". Cell 34 (2): 395-403. doi:10.1016/0092-8674(83)90373-2. https://pdfs.semanticscholar.org/82a1/d2fb5feabbfc7a64e0b3d84be536c7a4110d.pdf. Retrieved 2017-04-05. 
  34. Francisco Rivero (June 2002). "mRNA processing in Dictyostelium: sequence requirements for termination and splicing". Protist 153 (2): 169-76. doi:10.1078/1434-4610-00095. http://www.academia.edu/download/45750040/1434-4610-0009520160518-7945-1y17rxg.pdf. Retrieved 2017-04-05. 
  35. 35.0 35.1 NCBI (9 June 2013). TBX2 T-box 2 ( Homo sapiens (human) ). Bethesda, MD, USA: National Center for Biotechnology Information, U.S. National Library of Medicine. http://www.ncbi.nlm.nih.gov/gene/6909. Retrieved 2013-06-15. 
  36. Lifton RP, Goldberg ML, Karp RW, Hogness DS (1978). "The organization of the histone genes in Drosophila melanogaster: functional and evolutionary implications". Cold Spring Harb Symp Quant Biol 42: 1047–51. PMID 98262. 
  37. Stephen T. Smale and James T. Kadonaga (July 2003). "The RNA Polymerase II Core Promoter". Annual Review of Biochemistry 72 (1): 449-79. doi:10.1146/annurev.biochem.72.121801.161520. PMID 12651739. http://www.annualreviews.org/doi/abs/10.1146/annurev.biochem.72.121801.161520. Retrieved 2012-05-07. 
  38. C Yang, E Bolotin, T Jiang, FM Sladek, E Martinez (March 2007). "Prevalence of the initiator over the TATA box in human and yeast genes and identification of DNA motifs enriched in human TATA-less core promoters". Gene 389 (1): 52–65. doi:10.1016/j.gene.2006.09.029. PMID 17123746. PMC 1955227. http://www.ncbi.nlm.nih.gov/pmc/articles/PMC1955227/?tool=pubmed. 
  39. 39.0 39.1 Shigetsugu Hatakeyama, Masayoshi Yada, Masaki Matsumoto, Noriko Ishida and Kei-Ichi Nakayama (31 August 2001). "U Box Proteins as a New Family of Ubiquitin-Protein Ligases". The Journal of Biological Chemistry 276: 33111-20. doi:10.1074/jbc.M102755200. http://www.jbc.org/content/276/35/33111.full. Retrieved 2014-06-16. 
  40. 40.0 40.1 L. Aravind and Eugene V. Koonin (2000). "The U box is a modified RING finger — a common domain in ubiquitination". Current Biology 10 (4): R132-4. 
  41. 41.0 41.1 XIAN-YANG ZHANG, NABILA JABRANE-FERRAT, CLEMENT K. ASIEDU, SANJA SAMAC, B. MATIJA PETERLIN, AND MELANIE EHRLICH (November 1993). "The Major Histocompatibility Complex Class II Promoter-Binding Protein RFX (NF-X) Is a Methylated DNA-Binding Protein". MOLECULAR AND CELLULAR BIOLOGY 13 (11): 6810-8. https://www.ncbi.nlm.nih.gov/pmc/articles/PMC364743/pdf/molcellb00023-0210.pdf. Retrieved 2017-04-05. 
  42. Koji Koike, Takeshi Uchiumi, Takefumi Ohga, Satoshi Toh, Morimasa Wada, Kimitoshi Kohno, and Michihiko Kuwano (17 November 1997). "Nuclear translocation of the Y-box binding protein by ultraviolet irradiation". FEBS Letters 417 (3): 390-4. doi:10.1016/S0014-5793(97)01296-9. http://onlinelibrary.wiley.com/doi/10.1016/S0014-5793(97)01296-9/full. Retrieved 2017-04-05. 
edit