FB2026_01 , released March 12, 2026
FB2026_01 , released March 12, 2026
Gene: Dmel\mil
Open Close
General Information
Symbol
Dmel\mil
Species
D. melanogaster
Name
milkah
Annotation Symbol
CG5017
Feature Type
FlyBase ID
FBgn0267366
Gene Model Status
Stock Availability
Gene Summary
milkah (mil) encodes a nucleosome assembly factor of the Nap family. It is involved in spermatogenesis, long-term memory formation and male courtship. [Date last reviewed: 2019-09-12] (FlyBase Gene Snapshot)
Also Known As

hanabi

Key Links
Genomic Location
Cytogenetic map
Sequence location
Recombination map
3-94
RefSeq locus
NT_033777 REGION:27912447..27913785
Sequence
Genomic Maps
Other Genome Views
The following external sites may use different assemblies or annotations than FlyBase.
Function
Gene Ontology (GO) Annotations (10 terms)
Molecular Function (2 terms)
Terms Based on Experimental Evidence (1 term)
CV Term
Evidence
References
inferred from direct assay
Terms Based on Predictions or Assertions (2 terms)
CV Term
Evidence
References
inferred from biological aspect of ancestor with PANTHER:PTN000221934
inferred from biological aspect of ancestor with PANTHER:PTN000221934
Biological Process (4 terms)
Terms Based on Experimental Evidence (4 terms)
CV Term
Evidence
References
inferred from mutant phenotype
involved_in memory
inferred from mutant phenotype
inferred from direct assay
involved_in spermatogenesis
inferred from mutant phenotype
Terms Based on Predictions or Assertions (1 term)
CV Term
Evidence
References
inferred from biological aspect of ancestor with PANTHER:PTN000221935
inferred from electronic annotation with InterPro:IPR002164
Cellular Component (4 terms)
Terms Based on Experimental Evidence (2 terms)
CV Term
Evidence
References
located_in cytosol
inferred from direct assay
located_in Nebenkern
inferred from direct assay
Terms Based on Predictions or Assertions (2 terms)
CV Term
Evidence
References
is_active_in chromatin
inferred from biological aspect of ancestor with PANTHER:PTN000221934
is_active_in nucleus
inferred from biological aspect of ancestor with PANTHER:PTN000221934
located_in nucleus
inferred from electronic annotation with InterPro:IPR002164
Gene Group (FlyBase)
Protein Family (UniProt)
-
Summaries
Gene Snapshot
milkah (mil) encodes a nucleosome assembly factor of the Nap family. It is involved in spermatogenesis, long-term memory formation and male courtship. [Date last reviewed: 2019-09-12]
Gene Model and Products
Number of Transcripts
1
Number of Unique Polypeptides
1

Please see the JBrowse view of Dmel\mil for information on other features

To submit a correction to a gene model please use the Contact FlyBase form

Protein Domains (via Pfam)
Isoform displayed:
Pfam protein domains
InterPro name
classification
start
end
Protein Domains (via SMART)
Isoform displayed:
SMART protein domains
InterPro name
classification
start
end
Structure
Protein 3D structure   (Predicted by AlphaFold)   (AlphaFold entry Q9VAZ1)

If you don't see a structure in the viewer, refresh your browser.
Model Confidence:
  • Very high (pLDDT > 90)
  • Confident (90 > pLDDT > 70)
  • Low (70 > pLDDT > 50)
  • Very low (pLDDT < 50)

AlphaFold produces a per-residue confidence score (pLDDT) between 0 and 100. Some regions with low pLDDT may be unstructured in isolation.

Experimentally Determined Structures
Crossreferences
Comments on Gene Model

Gene model reviewed during 5.49

Low-frequency RNA-Seq exon junction(s) not annotated.

Transcript Data
Annotated Transcripts
Name
FlyBase ID
RefSeq ID
Length (nt)
Assoc. CDS (aa)
FBtr0085246
1097
283
Additional Transcript Data and Comments
Reported size (kB)
Comments
External Data
Crossreferences
Polypeptide Data
Annotated Polypeptides
Name
FlyBase ID
Predicted MW (kDa)
Length (aa)
Theoretical pI
UniProt
RefSeq ID
GenBank
FBpp0084615
33.2
283
4.28
Polypeptides with Identical Sequences

There is only one protein coding transcript and one polypeptide associated with this gene

Additional Polypeptide Data and Comments
Reported size (kDa)
Comments
External Data
Crossreferences
InterPro - A database of protein families, domains and functional sites
Linkouts
Sequences Consistent with the Gene Model
Mapped Features

Click to get a list of regulatory features (enhancers, TFBS, etc.) and gene disruptions (point mutations, indels, etc.) within or overlapping Dmel\mil using the Feature Mapper tool.

External Data
Crossreferences
Eukaryotic Promoter Database - A collection of databases of experimentally validated promoters for selected model organisms.
Linkouts
Expression Data
Testis-specificity index

The testis specificity index was calculated from modENCODE tissue expression data by Vedelek et al., 2018 to indicate the degree of testis enrichment compared to other tissues. Scores range from -2.52 (underrepresented) to 5.2 (very high testis bias).

4.09

Transcript Expression
Additional Descriptive Data
Marker for
 
Subcellular Localization
CV Term
Polypeptide Expression
immunolocalization
Stage
Tissue/Position (including subcellular localization)
Reference
mass spectroscopy
Stage
Tissue/Position (including subcellular localization)
Reference
Additional Descriptive Data

mil localizes to the apical tip of mature sperm cells.

Marker for
 
Subcellular Localization
CV Term
Evidence
References
located_in cytosol
inferred from direct assay
located_in Nebenkern
inferred from direct assay
Expression Deduced from Reporters
High-Throughput Expression Data
Associated Tools

JBrowse - Visual display of RNA-Seq signals

View Dmel\mil in JBrowse
RNA-Seq by Region - Search RNA-Seq expression levels by exon or genomic region
Reference
See Gelbart and Emmert, 2013 for analysis details and data files for all genes.
Developmental Proteome: Life Cycle
Developmental Proteome: Embryogenesis
External Data and Images
Linkouts
BDGP expression data - Patterns of gene expression in Drosophila embryogenesis
DRscDB - A single-cell RNA-seq resource for data mining and data comparison across species
EMBL-EBI Single Cell Expression Atlas - Single cell expression across species
FlyAtlas - Adult expression by tissue, using Affymetrix Dros2 array
FlyAtlas2 - A Drosophila melanogaster expression atlas with RNA-Seq, miRNA-Seq and sex-specific data
Fly-FISH - A database of Drosophila embryo and larvae mRNA localization patterns
Images
Alleles, Insertions, Transgenic Constructs, and Aberrations
Classical and Insertion Alleles ( 5 )
For All Classical and Insertion Alleles Show
 
Other relevant insertions
Transgenic Constructs ( 5 )
For All Alleles Carried on Transgenic Constructs Show
Transgenic constructs containing/affecting coding region of mil
Transgenic constructs containing regulatory region of mil
Aberrations (Deficiencies and Duplications) ( 0 )
Inferred from experimentation ( 0 )
Inferred from location ( 0 )
    Variants
    Variant Molecular Consequences
    Alleles Representing Disease-Implicated Variants
    Phenotypes
    Orthologs
    Human Orthologs (via DIOPT v9.1)
    Species\Gene Symbol
    Score
    Best Score
    Best Reverse Score
    Alignment
    Complementation?
    Transgene?
    Homo sapiens (Human) (20)
    6 of 14
    Yes
    No
    1  
    6 of 14
    Yes
    No
    5 of 14
    No
    No
    5 of 14
    No
    No
    3 of 14
    No
    No
    2 of 14
    No
    No
    2 of 14
    No
    No
    2 of 14
    No
    No
    2 of 14
    No
    No
    2 of 14
    No
    No
    2 of 14
    No
    No
    2 of 14
    No
    No
    2 of 14
    No
    No
    2 of 14
    No
    No
    2 of 14
    No
    No
    2 of 14
    No
    No
    2 of 14
    No
    No
    1 of 14
    No
    No
    1 of 14
    No
    No
    1 of 14
    No
    No
    Model Organism Orthologs (via DIOPT v9.1)
    Species\Gene Symbol
    Score
    Best Score
    Best Reverse Score
    Alignment
    Complementation?
    Transgene?
    Rattus norvegicus (Norway rat) (16)
    6 of 14
    Yes
    No
    5 of 14
    No
    No
    5 of 14
    No
    No
    4 of 14
    No
    No
    2 of 14
    No
    No
    2 of 14
    No
    No
    2 of 14
    No
    No
    2 of 14
    No
    No
    2 of 14
    No
    No
    2 of 14
    No
    No
    2 of 14
    No
    No
    2 of 14
    No
    No
    1 of 14
    No
    No
    1 of 14
    No
    No
    1 of 14
    No
    Yes
    1 of 14
    No
    No
    Mus musculus (laboratory mouse) (11)
    6 of 14
    Yes
    No
    5 of 14
    No
    No
    5 of 14
    No
    No
    4 of 14
    No
    No
    2 of 14
    No
    No
    2 of 14
    No
    No
    2 of 14
    No
    No
    2 of 14
    No
    No
    2 of 14
    No
    No
    2 of 14
    No
    No
    2 of 14
    No
    No
    Xenopus tropicalis (Western clawed frog) (5)
    5 of 13
    Yes
    No
    2 of 13
    No
    No
    1 of 13
    No
    No
    1 of 13
    No
    No
    1 of 13
    No
    No
    Danio rerio (Zebrafish) (6)
    6 of 14
    Yes
    No
    5 of 14
    No
    No
    5 of 14
    No
    No
    2 of 14
    No
    No
    1 of 14
    No
    No
    1 of 14
    No
    No
    Caenorhabditis elegans (Nematode, roundworm) (2)
    6 of 14
    Yes
    No
    1 of 14
    No
    No
    Anopheles gambiae (African malaria mosquito) (3)
    Arabidopsis thaliana (thale-cress) (6)
    7 of 13
    Yes
    No
    7 of 13
    Yes
    No
    7 of 13
    Yes
    No
    7 of 13
    Yes
    No
    1 of 13
    No
    No
    1 of 13
    No
    No
    Saccharomyces cerevisiae (Brewer's yeast) (2)
    7 of 13
    Yes
    No
    1 of 13
    No
    No
    Schizosaccharomyces pombe (Fission yeast) (3)
    5 of 12
    Yes
    No
    5 of 12
    Yes
    No
    1 of 12
    No
    No
    Escherichia coli (enterobacterium) (0)
    Other Organism Orthologs (via OrthoDB)
    Data provided directly from OrthoDB:mil. Refer to their site for version information.
    Paralogs
    Paralogs (via DIOPT v9.1)
    Drosophila melanogaster (Fruit fly) (3)
    7 of 13
    7 of 13
    3 of 13
    Human Disease Associations
    FlyBase Human Disease Model Reports
      Disease Ontology (DO) Annotations
      Models Based on Experimental Evidence ( 0 )
      Allele
      Disease
      Evidence
      References
      Potential Models Based on Orthology ( 0 )
      Human Ortholog
      Disease
      Evidence
      References
      Modifiers Based on Experimental Evidence ( 0 )
      Allele
      Disease
      Interaction
      References
      Disease Associations of Human Orthologs (via DIOPT v9.1 and OMIM)
      Note that ortholog calls supported by only 1 or 2 algorithms (DIOPT score < 3) are not shown.
      Functional Complementation Data
      Functional complementation data is computed by FlyBase using a combination of the orthology data obtained from DIOPT and OrthoDB and the allele-level genetic interaction data curated from the literature.
      Interactions
      Summary of Physical Interactions
      Interaction Browsers

      Please see the Physical Interaction reports below for full details
      protein-protein
      Physical Interaction
      Assay
      References
      Summary of Genetic Interactions
      Interaction Browsers
      Starting gene(s)
      Interaction type
      Interacting gene(s)
      Reference
      Starting gene(s)
      Interaction type
      Interacting gene(s)
      Reference
      External Data
      Linkouts
      DroID - A comprehensive database of gene and protein interactions.
      MIST (protein-protein) - An integrated Molecular Interaction Database
      Pathways
      Signaling Pathways (FlyBase)
      Metabolic Pathways
      FlyBase
      External Links
      External Data
      Linkouts
      Class of Gene
      Genomic Location and Detailed Mapping Data
      Chromosome (arm)
      3R
      Recombination map
      3-94
      Cytogenetic map
      Sequence location
      FlyBase Computed Cytological Location
      Cytogenetic map
      Evidence for location
      98B5-98B5
      Limits computationally determined from genome sequence between P{PZ}btzrL203 and P{PZ}l(3)0648706487
      Experimentally Determined Cytological Location
      Cytogenetic map
      Notes
      References
      Experimentally Determined Recombination Data
      Location
      Left of (cM)
      Right of (cM)
      Notes
      Stocks and Reagents
      Stocks (23)
      Genomic Clones (19)
      cDNA Clones (10)
       

      Please Note This section lists cDNAs and ESTs that fall within the genomic extent of the gene model, which may include cDNAs and ESTs of genes within introns, or of overlapping genes. Please see JBrowse for alignment of the cDNAs and ESTs to the gene model.

      cDNA clones, fully sequenced
      BDGP DGC clones
      Other clones
      Drosophila Genomics Resource Center cDNA clones

      For each fully sequenced cDNA the DGRC maintains various forms of the cDNA (e.g tagged or untagged) in several different host vectors for subsequent cloning and expression in Drosophila and Drosophila cell lines.

      cDNA Clones, End Sequenced (ESTs)
      BDGP DGC clones
      RNAi and Array Information
      Linkouts
      Antibody Information
      Laboratory Generated Antibodies
       

      polyclonal

      Commercially Available Antibodies
       
      Cell Line Information
      Publicly Available Cell Lines
       
        Other Stable Cell Lines
         
          Other Comments
          Relationship to Other Genes
          Source for database merge of

          Source for merge of: mil hanabi

          Additional comments
          Nomenclature History
          Source for database identify of

          Source for identity of: hanabi CG5017

          Nomenclature comments
          Etymology

          The gene is named "Hanabi", the Japanese term for "fireworks" based on the mutant phenotype of scattered nuclei during spermatid elongation.

          Given the name "milkah", after one of Pavlov's dogs.

          Synonyms and Secondary IDs (9)
          Datasets (0)
          Study focus (0)
          Experimental Role
          Project
          Project Type
          Title
          Study result (0)
          Result
          Result Type
          Title
          External Crossreferences and Linkouts ( 28 )
          Sequence Crossreferences
          NCBI Gene - Gene integrates information from a wide range of species. A record may include nomenclature, Reference Sequences (RefSeqs), maps, pathways, variations, phenotypes, and links to genome-, phenotype-, and locus-specific resources worldwide.
          GenBank Nucleotide - A collection of sequences from several sources, including GenBank, RefSeq, TPA, and PDB.
          GenBank Protein - A collection of sequences from several sources, including translations from annotated coding regions in GenBank, RefSeq and TPA, as well as records from SwissProt, PIR, PRF, and PDB.
          RefSeq - A comprehensive, integrated, non-redundant, well-annotated set of reference sequences including genomic, transcript, and protein.
          UniProt/GCRP - The gene-centric reference proteome (GCRP) provides a 1:1 mapping between genes and UniProt accessions in which a single 'canonical' isoform represents the product(s) of each protein-coding gene.
          UniProt/TrEMBL - Automatically annotated and unreviewed records of protein sequence and functional information
          Other crossreferences
          AlphaFold DB - AlphaFold provides open access to protein structure predictions for the human proteome and other key proteins of interest, to accelerate scientific research.
          BDGP expression data - Patterns of gene expression in Drosophila embryogenesis
          DRscDB - A single-cell RNA-seq resource for data mining and data comparison across species
          EMBL-EBI Single Cell Expression Atlas - Single cell expression across species
          FlyAtlas2 - A Drosophila melanogaster expression atlas with RNA-Seq, miRNA-Seq and sex-specific data
          InterPro - A database of protein families, domains and functional sites
          KEGG Genes - Molecular building blocks of life in the genomic space.
          MARRVEL_MODEL - MARRVEL (model organism gene)
          Linkouts
          Drosophila Genomics Resource Center - Drosophila Genomics Resource Center (DGRC) cDNA clones
          DroID - A comprehensive database of gene and protein interactions.
          Eukaryotic Promoter Database - A collection of databases of experimentally validated promoters for selected model organisms.
          FlyAtlas - Adult expression by tissue, using Affymetrix Dros2 array
          FlyCyc Genes - Genes from a BioCyc PGDB for Dmel
          Fly-FISH - A database of Drosophila embryo and larvae mRNA localization patterns
          MIST (protein-protein) - An integrated Molecular Interaction Database
          References (42)