Human Gene Set: GTGACGY_E4F1_Q6


Standard name GTGACGY_E4F1_Q6
Systematic name M3403
Brief description Genes having at least one occurrence of the highly conserved motif M20 GTGACGY in the regions spanning 4 kb centered on their transcription starting sites [-2kb, +2kb]. This matches the E4F1 [GeneSymbol=E4F1] transcription factor binding site V$E4F1_Q6 (v7.4 TRANSFAC).
Full description or abstract Comprehensive identification of all functional elements encoded in the human genome is a fundamental need in biomedical research. Here, we present a comparative analysis of the human, mouse, rat and dog genomes to create a systematic catalogue of common regulatory motifs in promoters and 3' untranslated regions (3' UTRs). The promoter analysis yields 174 candidate motifs, including most previously known transcription-factor binding sites and 105 new motifs. The 3'-UTR analysis yields 106 motifs likely to be involved in post-transcriptional regulation. Nearly one-half are associated with microRNAs (miRNAs), leading to the discovery of many new miRNA genes and their likely target genes. Our results suggest that previous estimates of the number of human miRNA genes were low, and that miRNAs regulate at least 20% of human genes. The overall results provide a systematic view of gene regulation in the human, which will be refined as additional mammalian genomes become available.
Collection C3: Regulatory Target
      TFT: Transcription Factor Targets
            TFT:TFT_LEGACY: TFT_Legacy
Source publication Pubmed 15735639   Authors: Xie X,Lu J,Kulbokas EJ,Golub TR,Mootha V,Lindblad-Toh K,Lander ES,Kellis M
Exact source  
Related gene sets (show 168 additional gene sets from the source publication)

(show 174 gene sets from the same authors)
External links
Filtered by similarity ?
Source species Homo sapiens
Contributed by Xiaohui Xie (Broad Institute)
Source platform or
identifier namespace
HUMAN_GENE_SYMBOL
Dataset references  
Download gene set format: grp | gmt | xml | json | TSV metadata
Compute overlaps ? (show collections to investigate for overlap with this gene set)
Compendia expression profiles ? NG-CHM interactive heatmaps
(Please note that clustering takes a few seconds)
GTEx compendium
Human tissue compendium (Novartis)
Global Cancer Map (Broad Institute)
NCI-60 cell lines (National Cancer Institute)

Legacy heatmaps (PNG)
GTEx compendium
Human tissue compendium (Novartis)
Global Cancer Map (Broad Institute)
NCI-60 cell lines (National Cancer Institute)
Advanced query Further investigate these 672 genes
Gene families ? Categorize these 672 genes by gene family
Show members (show 682 source identifiers mapped to 672 genes)
Version history 7.1: Moved to TFT_Legacy sub-collection.

See MSigDB license terms here. Please note that certain gene sets have special access terms.