FB2026_01 , released March 12, 2026
FB2026_01 , released March 12, 2026
Gene: Dmel\nau
Open Close
General Information
Symbol
Dmel\nau
Species
D. melanogaster
Name
nautilus
Annotation Symbol
CG10250
Feature Type
FlyBase ID
FBgn0002922
Gene Model Status
Stock Availability
Gene Summary
nautilus (nau) encodes a protein that belongs to the bHLH family of transcription factors. It is the unique Drosophila ortholog of the mammalian family of bHLH Myogenic Regulatory Factors. It is involved in somatic muscle myogenesis. [Date last reviewed: 2019-03-14] (FlyBase Gene Snapshot)
Also Known As

MyoD, Dmyd

Key Links
Genomic Location
Cytogenetic map
Sequence location
Recombination map
3-80
RefSeq locus
NT_033777 REGION:23712814..23718357
Sequence
Genomic Maps
Other Genome Views
The following external sites may use different assemblies or annotations than FlyBase.
Function
Gene Ontology (GO) Annotations (10 terms)
Molecular Function (3 terms)
Terms Based on Experimental Evidence (0 terms)
Terms Based on Predictions or Assertions (3 terms)
CV Term
Evidence
References
Biological Process (5 terms)
Terms Based on Experimental Evidence (2 terms)
CV Term
Evidence
References
inferred from mutant phenotype
inferred from mutant phenotype
Terms Based on Predictions or Assertions (4 terms)
CV Term
Evidence
References
inferred from electronic annotation with InterPro:IPR039704
traceable author statement
Cellular Component (2 terms)
Terms Based on Experimental Evidence (2 terms)
CV Term
Evidence
References
located_in cytoplasm
inferred from direct assay
located_in nucleus
inferred from direct assay
Terms Based on Predictions or Assertions (1 term)
CV Term
Evidence
References
is_active_in nucleus
inferred from biological aspect of ancestor with PANTHER:PTN000164853
Protein Family (UniProt)
-
Summaries
Gene Snapshot
nautilus (nau) encodes a protein that belongs to the bHLH family of transcription factors. It is the unique Drosophila ortholog of the mammalian family of bHLH Myogenic Regulatory Factors. It is involved in somatic muscle myogenesis. [Date last reviewed: 2019-03-14]
Gene Group (FlyBase)
BASIC HELIX-LOOP-HELIX TRANSCRIPTION FACTORS -
Basic helix-loop-helix (bHLH) transcription factors are sequence-specific DNA-binding proteins that regulate transcription. They are characterized by a 60 amino acid region comprising a basic DNA binding domain followed by a HLH motif formed from two amphipathic α-helices connected by a loop. bHLH transcription factors form homo- and hetero-dimeric complexes, which bind to a E box consensus sequence. (Adapted from PMID:15186484).
Protein Function (UniProtKB)
May play an important role in the early development of muscle.
(UniProt, P22816)
Summary (Interactive Fly)

transcription factor - myogenic bHLH factor - ortholog of the mammalian family of bHLH Myogenic Regulatory Factors - involved in somatic muscle myogenesis

Gene Model and Products
Number of Transcripts
2
Number of Unique Polypeptides
2

Please see the JBrowse view of Dmel\nau for information on other features

To submit a correction to a gene model please use the Contact FlyBase form

Protein Domains (via Pfam)
Isoform displayed:
Pfam protein domains
InterPro name
classification
start
end
Protein Domains (via SMART)
Isoform displayed:
SMART protein domains
InterPro name
classification
start
end
Structure
Protein 3D structure   (Predicted by AlphaFold)   (AlphaFold entry P22816)

If you don't see a structure in the viewer, refresh your browser.
Model Confidence:
  • Very high (pLDDT > 90)
  • Confident (90 > pLDDT > 70)
  • Low (70 > pLDDT > 50)
  • Very low (pLDDT < 50)

AlphaFold produces a per-residue confidence score (pLDDT) between 0 and 100. Some regions with low pLDDT may be unstructured in isolation.

Experimentally Determined Structures
Crossreferences
Comments on Gene Model

Gene model reviewed during 5.47

Transcript Data
Annotated Transcripts
Name
FlyBase ID
RefSeq ID
Length (nt)
Assoc. CDS (aa)
FBtr0084472
1532
332
FBtr0301445
1520
328
Additional Transcript Data and Comments
Reported size (kB)

1.5 (northern blot)

Comments
External Data
Crossreferences
Polypeptide Data
Annotated Polypeptides
Name
FlyBase ID
Predicted MW (kDa)
Length (aa)
Theoretical pI
UniProt
RefSeq ID
GenBank
FBpp0083863
36.2
332
8.20
FBpp0290660
35.8
328
8.20
Polypeptides with Identical Sequences

None of the polypeptides share 100% sequence identity.

Additional Polypeptide Data and Comments
Reported size (kDa)

332 (aa); 36 (kD predicted)

Comments
External Data
Subunit Structure (UniProtKB)

Efficient DNA binding requires dimerization with another bHLH protein.

(UniProt, P22816)
Linkouts
Sequences Consistent with the Gene Model
Mapped Features

Click to get a list of regulatory features (enhancers, TFBS, etc.) and gene disruptions (point mutations, indels, etc.) within or overlapping Dmel\nau using the Feature Mapper tool.

External Data
Crossreferences
Eukaryotic Promoter Database - A collection of databases of experimentally validated promoters for selected model organisms.
Linkouts
Expression Data
Testis-specificity index

The testis specificity index was calculated from modENCODE tissue expression data by Vedelek et al., 2018 to indicate the degree of testis enrichment compared to other tissues. Scores range from -2.52 (underrepresented) to 5.2 (very high testis bias).

-0.66

Transcript Expression
No Assay Recorded
Stage
Tissue/Position (including subcellular localization)
Reference
in situ
Stage
Tissue/Position (including subcellular localization)
Reference
embryonic/larval hypodermal muscle cell | precursor | subset

Comment: dorsal, lateral, and ventral muscle precursors

northern blot
Stage
Tissue/Position (including subcellular localization)
Reference
Additional Descriptive Data

nau is expressed in clusters of cells in the outermost layer of the somatic mesoderm. In neurogenic mutants, nau continues to be expressed in these clusters, but the clusters contain many more cells.

nau transcripts are detected in embryos and larvae by northern blot analysis. They peak in 9-12hr embryos and then drastically decline.

Marker for
 
Subcellular Localization
CV Term
Polypeptide Expression
immunolocalization
Stage
Tissue/Position (including subcellular localization)
Reference
Additional Descriptive Data

nau protein is first detected at embryonic stage 11 by immunolocalization. In each segment, clusters of nuclei on either side of the midline and a smaller lateral cluster are stained. Another small cluster appears later in a dorsal position in the segments from the labium to A7. During stage 11, the number of nau-expressing cells increases 2-3 fold to a total of 30-40 per hemisegment. Differences in the number and pattern of stained cells between the abdominal and more anterior segments are observed. As dorsal closure begins, the level of staining decreases rapidly. Only the nuclei of some muscle precursors and some clusters in the head continue to show high levels of nau protein. After dorsal closure, weak staining is seen in a variety of muscles. Stronger staining persists in the pharyngeal muscles and in muscles of the anterior spiracles and telson.

Marker for
 
Subcellular Localization
CV Term
Evidence
References
located_in cytoplasm
inferred from direct assay
located_in nucleus
inferred from direct assay
Expression Deduced from Reporters
High-Throughput Expression Data
Associated Tools

JBrowse - Visual display of RNA-Seq signals

View Dmel\nau in JBrowse
RNA-Seq by Region - Search RNA-Seq expression levels by exon or genomic region
Reference
See Gelbart and Emmert, 2013 for analysis details and data files for all genes.
Developmental Proteome: Life Cycle
Developmental Proteome: Embryogenesis
External Data and Images
Linkouts
BDGP expression data - Patterns of gene expression in Drosophila embryogenesis
DRscDB - A single-cell RNA-seq resource for data mining and data comparison across species
EMBL-EBI Single Cell Expression Atlas - Single cell expression across species
FlyAtlas - Adult expression by tissue, using Affymetrix Dros2 array
FlyAtlas2 - A Drosophila melanogaster expression atlas with RNA-Seq, miRNA-Seq and sex-specific data
Flygut - An atlas of the Drosophila adult midgut
Images
Alleles, Insertions, Transgenic Constructs, and Aberrations
Classical and Insertion Alleles ( 5 )
For All Classical and Insertion Alleles Show
 
Other relevant insertions
Transgenic Constructs ( 14 )
For All Alleles Carried on Transgenic Constructs Show
Transgenic constructs containing/affecting coding region of nau
Transgenic constructs containing regulatory region of nau
Aberrations (Deficiencies and Duplications) ( 8 )
Variants
Variant Molecular Consequences
Alleles Representing Disease-Implicated Variants
Phenotypes
For more details about a specific phenotype click on the relevant allele symbol.
Lethality
Allele
Sterility
Allele
Other Phenotypes
Allele
Phenotype manifest in
Allele
Orthologs
Human Orthologs (via DIOPT v9.1)
Species\Gene Symbol
Score
Best Score
Best Reverse Score
Alignment
Complementation?
Transgene?
Homo sapiens (Human) (49)
11 of 14
Yes
Yes
10 of 14
No
Yes
6 of 14
No
Yes
1  
6 of 14
No
Yes
1  
1 of 14
No
No
2  
1 of 14
No
No
1 of 14
No
No
1 of 14
No
No
1  
1 of 14
No
No
1 of 14
No
No
2  
1 of 14
No
No
1 of 14
No
No
1 of 14
No
No
1 of 14
No
No
1  
1 of 14
No
No
1  
1 of 14
No
No
1  
1 of 14
No
No
1 of 14
No
No
1 of 14
No
No
1  
1 of 14
No
No
1 of 14
No
No
1 of 14
No
No
1 of 14
No
No
1 of 14
No
No
1 of 14
No
No
1 of 14
No
No
1 of 14
No
No
1 of 14
No
No
1 of 14
No
No
1 of 14
No
No
1 of 14
No
No
2  
1 of 14
No
No
1 of 14
No
No
1  
1 of 14
No
No
1 of 14
No
No
1  
1 of 14
No
No
1 of 14
No
No
1 of 14
No
No
1 of 14
No
No
1 of 14
No
No
5  
1 of 14
No
No
1  
1 of 14
No
No
1  
1 of 14
No
No
1 of 14
No
No
1 of 14
No
No
1 of 14
No
No
1 of 14
No
No
1 of 14
No
No
1  
1 of 14
No
No
1  
Model Organism Orthologs (via DIOPT v9.1)
Species\Gene Symbol
Score
Best Score
Best Reverse Score
Alignment
Complementation?
Transgene?
Rattus norvegicus (Norway rat) (43)
11 of 14
Yes
Yes
10 of 14
No
Yes
7 of 14
No
Yes
6 of 14
No
Yes
1 of 14
No
No
1 of 14
No
No
1 of 14
No
No
1 of 14
No
No
1 of 14
No
No
1 of 14
No
No
1 of 14
No
No
1 of 14
No
No
1 of 14
No
No
1 of 14
No
No
1 of 14
No
No
1 of 14
No
No
1 of 14
No
No
1 of 14
No
No
1 of 14
No
No
1 of 14
No
No
1 of 14
No
No
1 of 14
No
No
1 of 14
No
No
1 of 14
No
No
1 of 14
No
No
1 of 14
No
Yes
1 of 14
No
No
1 of 14
No
No
1 of 14
No
No
1 of 14
No
No
1 of 14
No
No
1 of 14
No
No
1 of 14
No
No
1 of 14
No
No
1 of 14
No
No
1 of 14
No
No
1 of 14
No
No
1 of 14
No
No
1 of 14
No
No
1 of 14
No
No
1 of 14
No
No
1 of 14
No
No
1 of 14
No
No
Mus musculus (laboratory mouse) (44)
11 of 14
Yes
Yes
10 of 14
No
Yes
7 of 14
No
Yes
6 of 14
No
Yes
1 of 14
No
No
1 of 14
No
No
1 of 14
No
No
1 of 14
No
No
1 of 14
No
No
1 of 14
No
No
6  
1 of 14
No
No
4  
1 of 14
No
No
1 of 14
No
No
1 of 14
No
No
1 of 14
No
No
1 of 14
No
No
1 of 14
No
No
1 of 14
No
No
1 of 14
No
No
1 of 14
No
No
1 of 14
No
No
1 of 14
No
No
1 of 14
No
No
1 of 14
No
No
1 of 14
No
No
1 of 14
No
No
1 of 14
No
No
1 of 14
No
No
1 of 14
No
No
1  
1 of 14
No
No
1 of 14
No
No
3  
1 of 14
No
No
1  
1 of 14
No
No
1 of 14
No
No
1 of 14
No
No
1 of 14
No
No
1 of 14
No
No
1 of 14
No
No
1 of 14
No
No
1 of 14
No
No
1 of 14
No
No
1 of 14
No
No
1 of 14
No
No
1 of 14
No
No
Xenopus tropicalis (Western clawed frog) (34)
5 of 13
Yes
Yes
4 of 13
No
Yes
2 of 13
No
Yes
1 of 13
No
No
1 of 13
No
No
1 of 13
No
No
1 of 13
No
No
1 of 13
No
No
1 of 13
No
No
1 of 13
No
No
1 of 13
No
Yes
1 of 13
No
No
1 of 13
No
No
1 of 13
No
No
1 of 13
No
No
1 of 13
No
No
1 of 13
No
No
1 of 13
No
No
1 of 13
No
No
1 of 13
No
Yes
1 of 13
No
Yes
1 of 13
No
Yes
1 of 13
No
No
1 of 13
No
No
1 of 13
No
No
1 of 13
No
No
1 of 13
No
No
1 of 13
No
No
1 of 13
No
No
1 of 13
No
No
1 of 13
No
No
1 of 13
No
No
1 of 13
No
No
1 of 13
No
No
Danio rerio (Zebrafish) (51)
11 of 14
Yes
Yes
10 of 14
No
Yes
6 of 14
No
Yes
6 of 14
No
Yes
1 of 14
No
No
1 of 14
No
No
1 of 14
No
No
1 of 14
No
No
1 of 14
No
No
1 of 14
No
No
1 of 14
No
No
1 of 14
No
No
1 of 14
No
No
1 of 14
No
No
1 of 14
No
No
1 of 14
No
No
1 of 14
No
No
1 of 14
No
No
1 of 14
No
No
1 of 14
No
No
1 of 14
No
No
1 of 14
No
No
1 of 14
No
No
1 of 14
No
No
1 of 14
No
No
1 of 14
No
No
1 of 14
No
No
1 of 14
No
No
1 of 14
No
No
1 of 14
No
No
1 of 14
No
No
1 of 14
No
No
1 of 14
No
No
1 of 14
No
No
1 of 14
No
No
1 of 14
No
No
1 of 14
No
No
1 of 14
No
No
1 of 14
No
No
1 of 14
No
No
1 of 14
No
No
1 of 14
No
No
1 of 14
No
No
1 of 14
No
No
1 of 14
No
No
1 of 14
No
No
1 of 14
No
No
1 of 14
No
No
1 of 14
No
No
Caenorhabditis elegans (Nematode, roundworm) (17)
7 of 14
Yes
Yes
1 of 14
No
No
1 of 14
No
No
1 of 14
No
No
1 of 14
No
No
1 of 14
No
No
1 of 14
No
No
1 of 14
No
No
1 of 14
No
No
1 of 14
No
No
1 of 14
No
No
1 of 14
No
No
1 of 14
No
No
1 of 14
No
No
1 of 14
No
No
1 of 14
No
No
1 of 14
No
No
Anopheles gambiae (African malaria mosquito) (22)
Arabidopsis thaliana (thale-cress) (1)
1 of 13
Yes
No
Saccharomyces cerevisiae (Brewer's yeast) (0)
Schizosaccharomyces pombe (Fission yeast) (0)
Escherichia coli (enterobacterium) (0)
Other Organism Orthologs (via OrthoDB)
Data provided directly from OrthoDB:nau. Refer to their site for version information.
Paralogs
Paralogs (via DIOPT v9.1)
Drosophila melanogaster (Fruit fly) (22)
1 of 13
1 of 13
1 of 13
1 of 13
1 of 13
1 of 13
1 of 13
1 of 13
1 of 13
1 of 13
1 of 13
1 of 13
1 of 13
1 of 13
1 of 13
1 of 13
1 of 13
1 of 13
1 of 13
1 of 13
1 of 13
1 of 13
Human Disease Associations
FlyBase Human Disease Model Reports
    Disease Ontology (DO) Annotations
    Models Based on Experimental Evidence ( 0 )
    Allele
    Disease
    Evidence
    References
    Potential Models Based on Orthology ( 1 )
    Human Ortholog
    Disease
    Evidence
    References
    Modifiers Based on Experimental Evidence ( 0 )
    Allele
    Disease
    Interaction
    References
    Disease Associations of Human Orthologs (via DIOPT v9.1 and OMIM)
    Note that ortholog calls supported by only 1 or 2 algorithms (DIOPT score < 3) are not shown.
    Functional Complementation Data
    Functional complementation data is computed by FlyBase using a combination of the orthology data obtained from DIOPT and OrthoDB and the allele-level genetic interaction data curated from the literature.
    Interactions
    Summary of Physical Interactions
    Summary of Genetic Interactions
    Interaction Browsers

    Please look at the allele data for full details of the genetic interactions
    Starting gene(s)
    Interaction type
    Interacting gene(s)
    Reference
    Starting gene(s)
    Interaction type
    Interacting gene(s)
    Reference
    External Data
    Subunit Structure (UniProtKB)
    Efficient DNA binding requires dimerization with another bHLH protein.
    (UniProt, P22816 )
    Linkouts
    BioGRID - A database of protein and genetic interactions.
    DroID - A comprehensive database of gene and protein interactions.
    MIST (protein-protein) - An integrated Molecular Interaction Database
    Pathways
    Signaling Pathways (FlyBase)
    Metabolic Pathways
    FlyBase
    External Links
    External Data
    Linkouts
    Reactome - An open-source, open access, manually curated and peer-reviewed pathway database.
    Class of Gene
    Genomic Location and Detailed Mapping Data
    Chromosome (arm)
    3R
    Recombination map
    3-80
    Cytogenetic map
    Sequence location
    FlyBase Computed Cytological Location
    Cytogenetic map
    Evidence for location
    95A10-95B1
    Limits computationally determined from genome sequence between P{PZ}l(3)0690606906 and P{PZ}l(3)0468404684
    Experimentally Determined Cytological Location
    Cytogenetic map
    Notes
    References
    95A1-95A10
    (determined by in situ hybridisation)
    95A-95A
    (determined by in situ hybridisation)
    95A-95B
    (determined by in situ hybridisation)
    Experimentally Determined Recombination Data
    Location
    Left of (cM)
    Right of (cM)
    Notes
    Stocks and Reagents
    Stocks (13)
    Genomic Clones (24)
    cDNA Clones (15)
     

    Please Note This section lists cDNAs and ESTs that fall within the genomic extent of the gene model, which may include cDNAs and ESTs of genes within introns, or of overlapping genes. Please see JBrowse for alignment of the cDNAs and ESTs to the gene model.

    cDNA clones, fully sequenced
    BDGP DGC clones
    Other clones
      Drosophila Genomics Resource Center cDNA clones

      For each fully sequenced cDNA the DGRC maintains various forms of the cDNA (e.g tagged or untagged) in several different host vectors for subsequent cloning and expression in Drosophila and Drosophila cell lines.

      cDNA Clones, End Sequenced (ESTs)
      BDGP DGC clones
        RNAi and Array Information
        Linkouts
        DRSC - Results frm RNAi screens
        Antibody Information
        Laboratory Generated Antibodies
         
        Commercially Available Antibodies
         
        Cell Line Information
        Publicly Available Cell Lines
         
          Other Stable Cell Lines
           
            Other Comments

            Embryos lacking both maternal and zygotic nau function lack a distinct set of muscle fibres. This muscle loss is tolerated, however, such that the loss of both maternal and zygotic nau function does not result in lethality at any stage of development.

            nau has a crucial role in embryonic muscle formation.

            nau is not required for the formation of muscle precursors, but rather plays a role in the differentiation of a subset of muscle precursors into mature muscle fibres in the developing embryo.

            Ectopic expression of nau in developing cardioblasts suggests nau induces myogenic differentiation in vivo and may affect the myogenic differentiation program of specific muscle fibres.

            wg acts as an inductive signal that influences the expression of nau in the ventral mesoderm.

            nau RNA expression during development has been studied.

            Abnormalities in clusters of nau-producing cells appear in the neurogenic mutants, E(spl), N, Dl, bib, neu, mam, and amx, nau and βTub85D, during stage 11, the time of first detectable nau expression. The clusters arise at the correct developmental time and position but contain more cells than wild type clusters. This causes strong ectodermal abnormalities and has effects on mesoderm development.

            Ecol\lacZ reporter gene constructs have demonstrated that the nau gene product is a transiently expressed nuclear protein that serves as a marker for myogenic precursor cells, or a subset of them, in development. These cells may represent muscle founder cells that are crucial in organising and establishing the precise muscle pattern in each segment of the body plan. nau is a distinctive gene and is not a member of a multigene family.

            Isolated from a genomic library at low stringency using a mouse MyoD and rat myogenin probe.

            nau has been cloned and sequenced and the pattern of RNA expression analysed.

            Relationship to Other Genes
            Source for database merge of
            Additional comments
            Nomenclature History
            Source for database identify of
            Nomenclature comments
            Etymology
            Synonyms and Secondary IDs (11)
            Reported As
            Secondary FlyBase IDs
              Datasets (0)
              Study focus (0)
              Experimental Role
              Project
              Project Type
              Title
              Study result (0)
              Result
              Result Type
              Title
              External Crossreferences and Linkouts ( 40 )
              Sequence Crossreferences
              NCBI Gene - Gene integrates information from a wide range of species. A record may include nomenclature, Reference Sequences (RefSeqs), maps, pathways, variations, phenotypes, and links to genome-, phenotype-, and locus-specific resources worldwide.
              GenBank Nucleotide - A collection of sequences from several sources, including GenBank, RefSeq, TPA, and PDB.
              GenBank Protein - A collection of sequences from several sources, including translations from annotated coding regions in GenBank, RefSeq and TPA, as well as records from SwissProt, PIR, PRF, and PDB.
              RefSeq - A comprehensive, integrated, non-redundant, well-annotated set of reference sequences including genomic, transcript, and protein.
              UniProt/GCRP - The gene-centric reference proteome (GCRP) provides a 1:1 mapping between genes and UniProt accessions in which a single 'canonical' isoform represents the product(s) of each protein-coding gene.
              UniProt/Swiss-Prot - Manually annotated and reviewed records of protein sequence and functional information
              UniProt/TrEMBL - Automatically annotated and unreviewed records of protein sequence and functional information
              Other crossreferences
              AlphaFold DB - AlphaFold provides open access to protein structure predictions for the human proteome and other key proteins of interest, to accelerate scientific research.
              BDGP expression data - Patterns of gene expression in Drosophila embryogenesis
              DRscDB - A single-cell RNA-seq resource for data mining and data comparison across species
              EMBL-EBI Single Cell Expression Atlas - Single cell expression across species
              FlyAtlas2 - A Drosophila melanogaster expression atlas with RNA-Seq, miRNA-Seq and sex-specific data
              FlyMine - An integrated database for Drosophila genomics
              KEGG Genes - Molecular building blocks of life in the genomic space.
              MARRVEL_MODEL - MARRVEL (model organism gene)
              Linkouts
              BioGRID - A database of protein and genetic interactions.
              Drosophila Genomics Resource Center - Drosophila Genomics Resource Center (DGRC) cDNA clones
              DroID - A comprehensive database of gene and protein interactions.
              DRSC - Results frm RNAi screens
              Eukaryotic Promoter Database - A collection of databases of experimentally validated promoters for selected model organisms.
              FlyAtlas - Adult expression by tissue, using Affymetrix Dros2 array
              FlyCyc Genes - Genes from a BioCyc PGDB for Dmel
              Flygut - An atlas of the Drosophila adult midgut
              FlyMet - A comprehensive tissue-specific metabolomics resource for Drosophila.
              iBeetle-Base - RNAi phenotypes in the red flour beetle (Tribolium castaneum)
              Interactive Fly - A cyberspace guide to Drosophila development and metazoan evolution
              MIST (protein-protein) - An integrated Molecular Interaction Database
              Reactome - An open-source, open access, manually curated and peer-reviewed pathway database.
              References (167)