Human Gene ELMSAN1 (uc010tud.1) Description and Page Index
  Description: Homo sapiens ELM2 and Myb/SANT-like domain containing 1 (ELMSAN1), transcript variant 2, mRNA.
Transcript (Including UTRs)
   Position: hg19 chr14:74,185,549-74,206,958 Size: 21,410 Total Exon Count: 12 Strand: -
Coding Region
   Position: hg19 chr14:74,185,549-74,206,711 Size: 21,163 Coding Exon Count: 12 

Page IndexSequence and LinksUniProtKB CommentsCTDGene AllelesRNA-Seq Expression
Microarray ExpressionRNA StructureProtein StructureOther SpeciesGO AnnotationsmRNA Descriptions
Other NamesModel InformationMethods
Data last updated: 2013-06-15

-  Sequence and Links to Tools and Databases
Genomic Sequence (chr14:74,185,549-74,206,958)mRNA (may differ from genome)Protein (1099 aa)
Gene SorterGenome BrowserOther Species FASTAVisiGeneGene interactionsTable Schema
BioGPSCGAPEnsemblEntrez GeneExonPrimerGeneCards
neXtProtPubMedStanford SOURCETreefamUniProtKB

-  Comments and Description Text from UniProtKB
DESCRIPTION: RecName: Full=ELM2 and SANT domain-containing protein 1;
SUBCELLULAR LOCATION: Nucleus (By similarity).
SIMILARITY: Contains 1 ELM2 domain.
SIMILARITY: Contains 1 SANT domain.
SEQUENCE CAUTION: Sequence=AAH06511.1; Type=Erroneous initiation; Note=Translation N-terminally shortened;

-  Comparative Toxicogenomics Database (CTD)
  The following chemicals interact with this gene

+  Common Gene Haplotype Alleles
  Press "+" in the title bar above to open this section.

-  RNA-Seq Expression Data from GTEx (53 Tissues, 570 Donors)
  Highest median expression: 18.65 RPKM in Skin - Sun Exposed (Lower leg)
Total median expression: 401.68 RPKM

View in GTEx track of Genome Browser    View at GTEx portal     View GTEx Body Map

+  Microarray Expression Data
  Press "+" in the title bar above to open this section.

-  mRNA Secondary Structure of 3' and 5' UTRs
RegionFold EnergyBasesEnergy/Base
Display As
5' UTR -92.50247-0.374 Picture PostScript Text

The RNAfold program from the Vienna RNA Package is used to perform the secondary structure predictions and folding calculations. The estimated folding energy is in kcal/mol. The more negative the energy, the more secondary structure the RNA is likely to have.

-  Protein Domain and Structure Information
  InterPro Domains: Graphical view of domain structure
IPR000949 - ELM2_dom
IPR009057 - Homeodomain-like
IPR001005 - SANT/Myb
IPR017884 - SANT_dom

Pfam Domains:
PF01448 - ELM2 domain

SCOP Domains:
46689 - Homeodomain-like
57667 - C2H2 and C2HC zinc fingers

ModBase Predicted Comparative 3D Structure on Q6PJG2
The pictures above may be empty if there is no ModBase structure for the protein. The ModBase structure frequently covers just a fragment of the protein. You may be asked to log onto ModBase the first time you click on the pictures. It is simplest after logging in to just click on the picture again to get to the specific info on that model.

-  Orthologous Genes in Other Species
  Orthologies between human, mouse, and rat are computed by taking the best BLASTP hit, and filtering out non-syntenic hits. For more distant species reciprocal-best BLASTP hits are used. Note that the absence of an ortholog in the table below may reflect incomplete annotations in the other species rather than a true absence of the orthologous gene.
MouseRatZebrafishD. melanogasterC. elegansS. cerevisiae
No orthologNo orthologGenome BrowserNo orthologNo orthologNo ortholog
Gene Details     
Gene Sorter     
  Protein Sequence   

-  Gene Ontology (GO) Annotations with Structured Vocabulary
  Molecular Function:
GO:0000981 RNA polymerase II transcription factor activity, sequence-specific DNA binding
GO:0003677 DNA binding
GO:0003700 transcription factor activity, sequence-specific DNA binding
GO:0008134 transcription factor binding
GO:0044212 transcription regulatory region DNA binding

Biological Process:
GO:0006351 transcription, DNA-templated
GO:0006355 regulation of transcription, DNA-templated
GO:0006357 regulation of transcription from RNA polymerase II promoter

Cellular Component:
GO:0000118 histone deacetylase complex
GO:0005634 nucleus
GO:0005654 nucleoplasm
GO:0005667 transcription factor complex

-  Descriptions from all associated GenBank mRNAs
  BC052976 - Homo sapiens chromosome 14 open reading frame 43, mRNA (cDNA clone IMAGE:6193955), with apparent retained intron.
AB385229 - Synthetic construct DNA, clone: pF1KB9471, Homo sapiens C14orf43 gene for C14orf43 protein, complete cds, without stop codon, in Flexi system.
BC006511 - Homo sapiens chromosome 14 open reading frame 43, mRNA (cDNA clone IMAGE:3010441), partial cds.
BC025330 - Homo sapiens chromosome 14 open reading frame 43, mRNA (cDNA clone IMAGE:4111658), partial cds.
BC006451 - Homo sapiens, clone IMAGE:3960432, mRNA.
BC003121 - Homo sapiens, clone IMAGE:3357079, mRNA.
BC015668 - Homo sapiens chromosome 14 open reading frame 43, mRNA (cDNA clone IMAGE:4328688), partial cds.
BC009202 - Homo sapiens chromosome 14 open reading frame 43, mRNA (cDNA clone IMAGE:3614143), partial cds.
AK127785 - Homo sapiens cDNA FLJ45886 fis, clone OCBBF3021361.
LF209593 - JP 2014500723-A/17096: Polycomb-Associated Non-Coding RNAs.
AK308604 - Homo sapiens cDNA, FLJ98645.
JD479949 - Sequence 460973 from Patent EP1572962.
MA445170 - JP 2018138019-A/17096: Polycomb-Associated Non-Coding RNAs.
AK021512 - Homo sapiens cDNA FLJ11450 fis, clone HEMBA1001432.
AK090424 - Homo sapiens mRNA for FLJ00335 protein.
JD519887 - Sequence 500911 from Patent EP1572962.
JD064165 - Sequence 45189 from Patent EP1572962.
JD393152 - Sequence 374176 from Patent EP1572962.
JD522814 - Sequence 503838 from Patent EP1572962.
JD126367 - Sequence 107391 from Patent EP1572962.
JD052249 - Sequence 33273 from Patent EP1572962.
L37690 - Homo sapiens (clone 95) macronuclear mRNA.
L37692 - Homo sapiens (clone 88) macronuclear mRNA.
JD485531 - Sequence 466555 from Patent EP1572962.
JD395265 - Sequence 376289 from Patent EP1572962.
JD159798 - Sequence 140822 from Patent EP1572962.
JD400095 - Sequence 381119 from Patent EP1572962.
JD390960 - Sequence 371984 from Patent EP1572962.
JD260915 - Sequence 241939 from Patent EP1572962.
JD419362 - Sequence 400386 from Patent EP1572962.
JD261341 - Sequence 242365 from Patent EP1572962.
JD348437 - Sequence 329461 from Patent EP1572962.
JD436227 - Sequence 417251 from Patent EP1572962.
JD418885 - Sequence 399909 from Patent EP1572962.
JD132887 - Sequence 113911 from Patent EP1572962.
JD070743 - Sequence 51767 from Patent EP1572962.
JD103999 - Sequence 85023 from Patent EP1572962.
JD436458 - Sequence 417482 from Patent EP1572962.

-  Other Names for This Gene
  Alternate Gene Symbols: AB385229, C14orf117, C14orf43, EMSA1_HUMAN, NM_194278, NP_919254, Q6PJG2, Q6PK13, Q6PK59, Q6ZS23
UCSC ID: uc010tud.1
RefSeq Accession: NM_194278
Protein: Q6PJG2 (aka EMSA1_HUMAN)
CCDS: CCDS9819.1

-  Gene Model Information
category: coding nonsense-mediated-decay: no RNA accession: AB385229.1
exon count: 12CDS single in 3' UTR: no RNA size: 3314
ORF size: 3297CDS single in intron: no Alignment % ID: 99.94
txCdsPredict score: 6707.00frame shift in genome: no % Coverage: 99.52
has start codon: yes stop codon in genome: no # of Alignments: 1
has end codon: no retained intron: no # AT/AC introns 0
selenocysteine: no end bleed into intron: 0# strange splices: 0
Click here for a detailed description of the fields of the table above.

-  Methods, Credits, and Use Restrictions
  Click here for details on how this gene model was made and data restrictions if any.