FB2026_01 , released March 12, 2026
FB2026_01 , released March 12, 2026
Gene: Dmel\HmgD
Open Close
General Information
Symbol
Dmel\HmgD
Species
D. melanogaster
Name
High mobility group protein D
Annotation Symbol
CG17950
Feature Type
FlyBase ID
FBgn0004362
Gene Model Status
Stock Availability
Gene Summary
High mobility group protein D (HmgD) encodes a highly abundant chromosomal protein involved in DNA bending and chromatin organization. [Date last reviewed: 2018-09-06] (FlyBase Gene Snapshot)
Also Known As

HMG-D, HMG D, HMG1

Key Links
Genomic Location
Cytogenetic map
Sequence location
Recombination map
2-96
RefSeq locus
NT_033778 REGION:21701458..21716727
Sequence
Genomic Maps
Other Genome Views
The following external sites may use different assemblies or annotations than FlyBase.
Function
Gene Ontology (GO) Annotations (7 terms)
Molecular Function (3 terms)
Terms Based on Experimental Evidence (3 terms)
CV Term
Evidence
References
inferred from direct assay
enables DNA binding
inferred from direct assay
Terms Based on Predictions or Assertions (0 terms)
Biological Process (2 terms)
Terms Based on Experimental Evidence (1 term)
CV Term
Evidence
References
inferred from direct assay
Terms Based on Predictions or Assertions (1 term)
CV Term
Evidence
References
inferred from biological aspect of ancestor with PANTHER:PTN009229167
Cellular Component (2 terms)
Terms Based on Experimental Evidence (2 terms)
CV Term
Evidence
References
located_in chromatin
inferred from direct assay
located_in nucleus
inferred from direct assay
Terms Based on Predictions or Assertions (1 term)
CV Term
Evidence
References
is_active_in nucleus
inferred from biological aspect of ancestor with PANTHER:PTN008733575
Protein Family (UniProt)
Belongs to the HMGB family. (Q05783)
Summaries
Gene Snapshot
High mobility group protein D (HmgD) encodes a highly abundant chromosomal protein involved in DNA bending and chromatin organization. [Date last reviewed: 2018-09-06]
Gene Group (FlyBase)
HIGH MOBILITY GROUP BOX TRANSCRIPTION FACTORS -
The High mobility group box (HMGB) transcription factors are sequence-specific DNA binding proteins that regulate transcription. The HMGB proteins have a characteristic L-shaped HMGB domain of about 80 amino acid residues, which binds the DNA minor groove and induce DNA bending. The HMGB domains are found in one or more copies and are involved in the regulation of DNA-dependent processes such as transcription, replication and chromatin remodeling. (Adapted from FBrf0194706, FBrf0108466, PMID:24086078 and PMID:23153957).
Protein Function (UniProtKB)
Binds preferentially single-stranded DNA and unwinds double-stranded DNA. Prefers sites containing the sequence 5'-ttg-3'. Facilitates DNA bending. Associated with early embryonic chromatin in the absence of histone H1.
(UniProt, Q05783)
Summary (Interactive Fly)

transcription factor - HMG domain protein - responsible for transcriptional silencing during early development - required for proper morphogenesis of sensory neuron dendrites

Gene Model and Products
Number of Transcripts
4
Number of Unique Polypeptides
1

Please see the JBrowse view of Dmel\HmgD for information on other features

To submit a correction to a gene model please use the Contact FlyBase form

Protein Domains (via Pfam)
Isoform displayed:
Pfam protein domains
InterPro name
classification
start
end
Protein Domains (via SMART)
Isoform displayed:
SMART protein domains
InterPro name
classification
start
end
Structure
Protein 3D structure   (Predicted by AlphaFold)   (AlphaFold entry Q05783)

If you don't see a structure in the viewer, refresh your browser.
Model Confidence:
  • Very high (pLDDT > 90)
  • Confident (90 > pLDDT > 70)
  • Low (70 > pLDDT > 50)
  • Very low (pLDDT < 50)

AlphaFold produces a per-residue confidence score (pLDDT) between 0 and 100. Some regions with low pLDDT may be unstructured in isolation.

Experimentally Determined Structures
Crossreferences
PDB - An information portal to biological macromolecular structures
Comments on Gene Model

Apparent introns not annotated: probable artifacts due to repetitive sequence.

Gene model reviewed during 5.51

Low-frequency RNA-Seq exon junction(s) not annotated.

Gene model reviewed during 5.56

Transcript Data
Annotated Transcripts
Name
FlyBase ID
RefSeq ID
Length (nt)
Assoc. CDS (aa)
FBtr0071708
1280
112
FBtr0071709
1279
112
FBtr0301410
967
112
FBtr0345487
1495
112
Additional Transcript Data and Comments
Reported size (kB)

1.3, 0.9 (northern blot)

1.3, 0.95 (northern blot)

Comments
External Data
Crossreferences
Polypeptide Data
Annotated Polypeptides
Name
FlyBase ID
Predicted MW (kDa)
Length (aa)
Theoretical pI
UniProt
RefSeq ID
GenBank
FBpp0071625
12.4
112
9.90
FBpp0071626
12.4
112
9.90
FBpp0290624
12.4
112
9.90
FBpp0311598
12.4
112
9.90
Polypeptides with Identical Sequences

The group(s) of polypeptides indicated below share identical sequence to each other.

112 aa isoforms: HmgD-PA, HmgD-PB, HmgD-PC, HmgD-PD
Additional Polypeptide Data and Comments
Reported size (kDa)

112 (aa); 14.5 (kD observed); 12.3 (kD predicted)

Comments
External Data
Crossreferences
PDB - An information portal to biological macromolecular structures
Linkouts
Sequences Consistent with the Gene Model
Nucleotide / Polypeptide Records
 
Mapped Features

Click to get a list of regulatory features (enhancers, TFBS, etc.) and gene disruptions (point mutations, indels, etc.) within or overlapping Dmel\HmgD using the Feature Mapper tool.

External Data
Crossreferences
Eukaryotic Promoter Database - A collection of databases of experimentally validated promoters for selected model organisms.
Linkouts
Expression Data
Testis-specificity index

The testis specificity index was calculated from modENCODE tissue expression data by Vedelek et al., 2018 to indicate the degree of testis enrichment compared to other tissues. Scores range from -2.52 (underrepresented) to 5.2 (very high testis bias).

-0.46

Transcript Expression
in situ
Stage
Tissue/Position (including subcellular localization)
Reference
northern blot
Stage
Tissue/Position (including subcellular localization)
Reference
Additional Descriptive Data

HmgD transcripts are present throughout the germarium. They are also present in all of the germline nurse cells and the somatic follicle cells in stage 1-6 egg chambers. From stage 7, HmgD transcripts stop accumulating in the follicle cells but continue to be present in the nurse cells. They accumulate around the periphery of the oocyte and remain localized until the end of oogenesis. In early embryos, weak general expression is observed. HmgD expression resumes in stage 9 embryos where strong staining is seen in all neuroblasts. Starting in stage 11, staining is observed in a subset of cells of the PNS. Nervous system staining persists through most of embryogenesis. No staining is observed in commissures or longitudinal connectives.

HmgD transcripts are observed mainly in embryos and adult females on northern blots.

HmgD transcripts are abundant in embryos and adult females.

Marker for
 
Subcellular Localization
CV Term
Polypeptide Expression
enzyme assay or biochemical detection
Stage
Tissue/Position (including subcellular localization)
Reference
immunolocalization
Stage
Tissue/Position (including subcellular localization)
Reference
mass spectroscopy
Stage
Tissue/Position (including subcellular localization)
Reference
Additional Descriptive Data

HmgD protein is present at a constant level throughout embryogenesis but the number of molecules per nucleus declines with each cleavage cycle.

HmgD protein is associated with condensed chromatin structures during the first six nuclear cleavage cycles of the developing embryo. Histone H1 is absent from these same structures.

Marker for
 
Subcellular Localization
CV Term
Evidence
References
located_in chromatin
inferred from direct assay
located_in nucleus
inferred from direct assay
Expression Deduced from Reporters
High-Throughput Expression Data
Associated Tools

JBrowse - Visual display of RNA-Seq signals

View Dmel\HmgD in JBrowse
RNA-Seq by Region - Search RNA-Seq expression levels by exon or genomic region
Reference
See Gelbart and Emmert, 2013 for analysis details and data files for all genes.
Developmental Proteome: Life Cycle
Developmental Proteome: Embryogenesis
External Data and Images
Linkouts
BDGP expression data - Patterns of gene expression in Drosophila embryogenesis
DRscDB - A single-cell RNA-seq resource for data mining and data comparison across species
EMBL-EBI Single Cell Expression Atlas - Single cell expression across species
FlyAtlas - Adult expression by tissue, using Affymetrix Dros2 array
FlyAtlas2 - A Drosophila melanogaster expression atlas with RNA-Seq, miRNA-Seq and sex-specific data
Fly-FISH - A database of Drosophila embryo and larvae mRNA localization patterns
Flygut - An atlas of the Drosophila adult midgut
Images
Alleles, Insertions, Transgenic Constructs, and Aberrations
Classical and Insertion Alleles ( 51 )
For All Classical and Insertion Alleles Show
 
Other relevant insertions
Transgenic Constructs ( 11 )
For All Alleles Carried on Transgenic Constructs Show
Transgenic constructs containing/affecting coding region of HmgD
Transgenic constructs containing regulatory region of HmgD
Aberrations (Deficiencies and Duplications) ( 2 )
Variants
Variant Molecular Consequences
Alleles Representing Disease-Implicated Variants
Phenotypes
Orthologs
Human Orthologs (via DIOPT v9.1)
Species\Gene Symbol
Score
Best Score
Best Reverse Score
Alignment
Complementation?
Transgene?
Homo sapiens (Human) (39)
6 of 14
Yes
No
1  
6 of 14
Yes
No
1  
5 of 14
No
No
4 of 14
No
Yes
3 of 14
No
No
3 of 14
No
No
1  
2 of 14
No
Yes
2 of 14
No
No
1  
2 of 14
No
No
1  
2 of 14
No
No
2 of 14
No
No
2 of 14
No
No
1  
2 of 14
No
No
2 of 14
No
No
1 of 14
No
No
1  
1 of 14
No
No
1  
1 of 14
No
No
2  
1 of 14
No
No
1 of 14
No
No
1 of 14
No
No
1 of 14
No
No
1 of 14
No
No
1 of 14
No
No
1 of 14
No
No
1 of 14
No
No
1 of 14
No
No
1 of 14
No
No
1  
1 of 14
No
No
1 of 14
No
No
1 of 14
No
No
1  
1 of 14
No
No
1 of 14
No
Yes
1 of 14
No
No
1 of 14
No
No
1 of 14
No
No
1  
1 of 14
No
No
1 of 14
No
No
1  
1 of 14
No
No
3  
1 of 14
No
No
Model Organism Orthologs (via DIOPT v9.1)
Species\Gene Symbol
Score
Best Score
Best Reverse Score
Alignment
Complementation?
Transgene?
Rattus norvegicus (Norway rat) (36)
6 of 14
Yes
No
5 of 14
No
No
5 of 14
No
No
5 of 14
No
No
5 of 14
No
No
5 of 14
No
No
4 of 14
No
No
4 of 14
No
Yes
4 of 14
No
No
4 of 14
No
No
4 of 14
No
No
3 of 14
No
Yes
3 of 14
No
No
2 of 14
No
No
2 of 14
No
No
2 of 14
No
No
2 of 14
No
No
2 of 14
No
No
2 of 14
No
No
2 of 14
No
No
2 of 14
No
No
1 of 14
No
No
1 of 14
No
No
1 of 14
No
No
1 of 14
No
No
1 of 14
No
Yes
1 of 14
No
No
1 of 14
No
Yes
1 of 14
No
Yes
1 of 14
No
Yes
1 of 14
No
No
1 of 14
No
Yes
1 of 14
No
No
1 of 14
No
No
1 of 14
No
No
1 of 14
No
No
Mus musculus (laboratory mouse) (20)
6 of 14
Yes
No
6 of 14
Yes
No
5 of 14
No
No
5 of 14
No
Yes
4 of 14
No
No
2 of 14
No
No
2 of 14
No
No
2 of 14
No
No
2 of 14
No
No
2 of 14
No
No
2 of 14
No
Yes
1 of 14
No
No
1 of 14
No
No
1 of 14
No
No
1 of 14
No
No
1 of 14
No
No
1 of 14
No
No
1 of 14
No
No
1 of 14
No
No
1 of 14
No
No
Xenopus tropicalis (Western clawed frog) (34)
4 of 13
Yes
No
4 of 13
Yes
No
4 of 13
Yes
No
2 of 13
No
Yes
2 of 13
No
No
2 of 13
No
No
2 of 13
No
No
2 of 13
No
No
1 of 13
No
No
1 of 13
No
No
1 of 13
No
No
1 of 13
No
No
1 of 13
No
No
1 of 13
No
No
1 of 13
No
No
1 of 13
No
No
1 of 13
No
No
1 of 13
No
No
1 of 13
No
No
1 of 13
No
No
1 of 13
No
No
1 of 13
No
No
1 of 13
No
No
1 of 13
No
No
1 of 13
No
No
1 of 13
No
No
1 of 13
No
No
1 of 13
No
No
1 of 13
No
No
1 of 13
No
No
1 of 13
No
No
1 of 13
No
No
1 of 13
No
No
1 of 13
No
No
Danio rerio (Zebrafish) (26)
6 of 14
Yes
No
6 of 14
Yes
No
6 of 14
Yes
No
6 of 14
Yes
No
5 of 14
No
No
4 of 14
No
No
4 of 14
No
No
2 of 14
No
No
2 of 14
No
No
2 of 14
No
No
2 of 14
No
No
2 of 14
No
No
2 of 14
No
No
2 of 14
No
No
1 of 14
No
No
1 of 14
No
No
1 of 14
No
No
1 of 14
No
No
1 of 14
No
Yes
1 of 14
No
No
1 of 14
No
No
1 of 14
No
No
1 of 14
No
No
1 of 14
No
No
1 of 14
No
No
1 of 14
No
No
Caenorhabditis elegans (Nematode, roundworm) (16)
8 of 14
Yes
Yes
5 of 14
No
No
2 of 14
No
No
2 of 14
No
No
2 of 14
No
No
2 of 14
No
No
1 of 14
No
Yes
1 of 14
No
No
1 of 14
No
No
1 of 14
No
No
1 of 14
No
No
1 of 14
No
No
1 of 14
No
No
1 of 14
No
No
1 of 14
No
No
1 of 14
No
No
Anopheles gambiae (African malaria mosquito) (19)
12 of 12
Yes
Yes
Arabidopsis thaliana (thale-cress) (15)
8 of 13
Yes
Yes
8 of 13
Yes
Yes
8 of 13
Yes
Yes
7 of 13
No
Yes
7 of 13
No
Yes
6 of 13
No
No
6 of 13
No
No
5 of 13
No
No
5 of 13
No
No
2 of 13
No
No
2 of 13
No
No
2 of 13
No
No
1 of 13
No
No
1 of 13
No
No
1 of 13
No
No
Saccharomyces cerevisiae (Brewer's yeast) (6)
6 of 13
Yes
No
5 of 13
No
Yes
5 of 13
No
No
5 of 13
No
No
4 of 13
No
Yes
3 of 13
No
Yes
Schizosaccharomyces pombe (Fission yeast) (4)
6 of 12
Yes
No
3 of 12
No
Yes
2 of 12
No
No
1 of 12
No
Yes
Escherichia coli (enterobacterium) (0)
Other Organism Orthologs (via OrthoDB)
Data provided directly from OrthoDB:HmgD. Refer to their site for version information.
Paralogs
Paralogs (via DIOPT v9.1)
Drosophila melanogaster (Fruit fly) (21)
9 of 13
6 of 13
5 of 13
5 of 13
3 of 13
3 of 13
3 of 13
2 of 13
1 of 13
1 of 13
1 of 13
1 of 13
1 of 13
1 of 13
1 of 13
1 of 13
1 of 13
1 of 13
1 of 13
1 of 13
1 of 13
Human Disease Associations
FlyBase Human Disease Model Reports
    Disease Ontology (DO) Annotations
    Models Based on Experimental Evidence ( 0 )
    Allele
    Disease
    Evidence
    References
    Potential Models Based on Orthology ( 1 )
    Human Ortholog
    Disease
    Evidence
    References
    Modifiers Based on Experimental Evidence ( 0 )
    Allele
    Disease
    Interaction
    References
    Disease Associations of Human Orthologs (via DIOPT v9.1 and OMIM)
    Note that ortholog calls supported by only 1 or 2 algorithms (DIOPT score < 3) are not shown.
    Functional Complementation Data
    Functional complementation data is computed by FlyBase using a combination of the orthology data obtained from DIOPT and OrthoDB and the allele-level genetic interaction data curated from the literature.
    Interactions
    Summary of Physical Interactions
    Interaction Browsers

    Please see the Physical Interaction reports below for full details
    protein-protein
    Physical Interaction
    Assay
    References
    Summary of Genetic Interactions
    Interaction Browsers

    Please look at the allele data for full details of the genetic interactions
    Starting gene(s)
    Interaction type
    Interacting gene(s)
    Reference
    Starting gene(s)
    Interaction type
    Interacting gene(s)
    Reference
    External Data
    Linkouts
    BioGRID - A database of protein and genetic interactions.
    DroID - A comprehensive database of gene and protein interactions.
    MIST (protein-protein) - An integrated Molecular Interaction Database
    Pathways
    Signaling Pathways (FlyBase)
    Metabolic Pathways
    FlyBase
    External Links
    External Data
    Linkouts
    Reactome - An open-source, open access, manually curated and peer-reviewed pathway database.
    Class of Gene
    Genomic Location and Detailed Mapping Data
    Chromosome (arm)
    2R
    Recombination map
    2-96
    Cytogenetic map
    Sequence location
    FlyBase Computed Cytological Location
    Cytogenetic map
    Evidence for location
    57F9-57F10
    ; Limits computationally determined from genome sequence between P{EP}Tim10EP541&P{lacW}l(2)k09617k09617 and P{PZ}l(2)0783707837
    Experimentally Determined Cytological Location
    Cytogenetic map
    Notes
    References
    57F8-57F11
    (determined by in situ hybridisation)
    Experimentally Determined Recombination Data
    Location
    Left of (cM)
    Right of (cM)
    Notes
    Stocks and Reagents
    Stocks (30)
    Genomic Clones (23)
    cDNA Clones (233)
     

    Please Note This section lists cDNAs and ESTs that fall within the genomic extent of the gene model, which may include cDNAs and ESTs of genes within introns, or of overlapping genes. Please see JBrowse for alignment of the cDNAs and ESTs to the gene model.

    cDNA clones, fully sequenced
    BDGP DGC clones
      Other clones
        Drosophila Genomics Resource Center cDNA clones

        For each fully sequenced cDNA the DGRC maintains various forms of the cDNA (e.g tagged or untagged) in several different host vectors for subsequent cloning and expression in Drosophila and Drosophila cell lines.

        cDNA Clones, End Sequenced (ESTs)
        BDGP DGC clones
          Other clones
            RNAi and Array Information
            Linkouts
            DRSC - Results frm RNAi screens
            Antibody Information
            Laboratory Generated Antibodies
             
            Commercially Available Antibodies
             
            Cell Line Information
            Publicly Available Cell Lines
             
              Other Stable Cell Lines
               
                Other Comments

                Candidate stable intronic sequence RNA (sisRNA) identified within 5'UTR of this gene.

                dsRNA has been made from templates generated with primers directed against this gene. RNAi of HmgD results in an almost complete block of primary dendrite extension and lateral branching of ddaD and ddaE neurons. RNAi also causes defects in muscle and defects in dendrite morphogenesis.

                RNAi generated by PCR using primers directed to this gene causes a cell growth and viability phenotype when assayed in Kc167 and S2R+ cells.

                The strength and selectivity of DNA binding by the HmgD HMG domain is modulated by the C-terminal basic and acidic domains.

                HmgD is an architecture specific protein that prefers to bind to deformable DNA sites, A/T rich DNA containing the readily deformable dinucleotide sequence TG. HmgD binds preferentially to pre-bent DNA sites and induces substantial bends in DNA.

                Immunological studies reveal that HmgD, either by itself or in conjunction with other chromosomal proteins, induces a condensed state of chromatin that is distinct from and less compact than the His1 containing 30nm fibre and this chromatin could facilitate rapid nuclear cycles.

                HmgD transcripts display a complex pattern of temporal and spatial localisation implying a specialised rather than general role during early development.

                An HmgD cDNA has been cloned and sequenced.

                Relationship to Other Genes
                Source for database merge of
                Additional comments
                Nomenclature History
                Source for database identify of
                Nomenclature comments
                Etymology
                Synonyms and Secondary IDs (15)
                Reported As
                Symbol Synonym
                anon-EST:Liang-1.9
                Secondary FlyBase IDs
                  Datasets (0)
                  Study focus (0)
                  Experimental Role
                  Project
                  Project Type
                  Title
                  Study result (0)
                  Result
                  Result Type
                  Title
                  External Crossreferences and Linkouts ( 60 )
                  Sequence Crossreferences
                  NCBI Gene - Gene integrates information from a wide range of species. A record may include nomenclature, Reference Sequences (RefSeqs), maps, pathways, variations, phenotypes, and links to genome-, phenotype-, and locus-specific resources worldwide.
                  GenBank Nucleotide - A collection of sequences from several sources, including GenBank, RefSeq, TPA, and PDB.
                  GenBank Protein - A collection of sequences from several sources, including translations from annotated coding regions in GenBank, RefSeq and TPA, as well as records from SwissProt, PIR, PRF, and PDB.
                  RefSeq - A comprehensive, integrated, non-redundant, well-annotated set of reference sequences including genomic, transcript, and protein.
                  UniProt/GCRP - The gene-centric reference proteome (GCRP) provides a 1:1 mapping between genes and UniProt accessions in which a single 'canonical' isoform represents the product(s) of each protein-coding gene.
                  UniProt/Swiss-Prot - Manually annotated and reviewed records of protein sequence and functional information
                  UniProt/TrEMBL - Automatically annotated and unreviewed records of protein sequence and functional information
                  Other crossreferences
                  AlphaFold DB - AlphaFold provides open access to protein structure predictions for the human proteome and other key proteins of interest, to accelerate scientific research.
                  BDGP expression data - Patterns of gene expression in Drosophila embryogenesis
                  DRscDB - A single-cell RNA-seq resource for data mining and data comparison across species
                  EMBL-EBI Single Cell Expression Atlas - Single cell expression across species
                  FlyAtlas2 - A Drosophila melanogaster expression atlas with RNA-Seq, miRNA-Seq and sex-specific data
                  FlyMine - An integrated database for Drosophila genomics
                  KEGG Genes - Molecular building blocks of life in the genomic space.
                  MARRVEL_MODEL - MARRVEL (model organism gene)
                  PDB - An information portal to biological macromolecular structures
                  Linkouts
                  BioGRID - A database of protein and genetic interactions.
                  Drosophila Genomics Resource Center - Drosophila Genomics Resource Center (DGRC) cDNA clones
                  DroID - A comprehensive database of gene and protein interactions.
                  DRSC - Results frm RNAi screens
                  Eukaryotic Promoter Database - A collection of databases of experimentally validated promoters for selected model organisms.
                  FlyAtlas - Adult expression by tissue, using Affymetrix Dros2 array
                  FlyCyc Genes - Genes from a BioCyc PGDB for Dmel
                  Fly-FISH - A database of Drosophila embryo and larvae mRNA localization patterns
                  Flygut - An atlas of the Drosophila adult midgut
                  FlyMet - A comprehensive tissue-specific metabolomics resource for Drosophila.
                  Interactive Fly - A cyberspace guide to Drosophila development and metazoan evolution
                  MIST (protein-protein) - An integrated Molecular Interaction Database
                  Reactome - An open-source, open access, manually curated and peer-reviewed pathway database.
                  References (132)