FB2026_01 , released March 12, 2026
FB2026_01 , released March 12, 2026
Gene: Dmel\ara
Open Close
General Information
Symbol
Dmel\ara
Species
D. melanogaster
Name
araucan
Annotation Symbol
CG10571
Feature Type
FlyBase ID
FBgn0015904
Gene Model Status
Stock Availability
Gene Summary
araucan (ara) encodes a homeodomain-containing protein of the TALE subfamily and one of the three components of the Iroquois gene complex. ara product acts as a transcriptional regulator to control territorial and cell fate specification, cell sorting behaviour and pattern formation. It also plays a transcription independent cell-autonomous role in cell proliferation control. [Date last reviewed: 2018-09-20] (FlyBase Gene Snapshot)
Also Known As

iro, iroquois, Iro-C, IroC

Key Links
Genomic Location
Cytogenetic map
Sequence location
Recombination map
3-38
RefSeq locus
NT_037436 REGION:12580672..12596018
Sequence
Genomic Maps
Other Genome Views
The following external sites may use different assemblies or annotations than FlyBase.
Function
Gene Ontology (GO) Annotations (19 terms)
Molecular Function (3 terms)
Terms Based on Experimental Evidence (1 term)
CV Term
Evidence
References
Terms Based on Predictions or Assertions (3 terms)
CV Term
Evidence
References
Biological Process (14 terms)
Terms Based on Experimental Evidence (8 terms)
CV Term
Evidence
References
inferred from mutant phenotype
inferred from mutant phenotype
inferred from genetic interaction with FLYBASE:caup; FB:FBgn0015919
inferred from mutant phenotype
inferred from mutant phenotype
Terms Based on Predictions or Assertions (6 terms)
CV Term
Evidence
References
involved_in cell development
inferred from biological aspect of ancestor with PANTHER:PTN002257217
traceable author statement
inferred from biological aspect of ancestor with PANTHER:PTN000129978
Cellular Component (2 terms)
Terms Based on Experimental Evidence (1 term)
CV Term
Evidence
References
located_in cytoplasm
inferred from high throughput direct assay
Terms Based on Predictions or Assertions (1 term)
CV Term
Evidence
References
is_active_in nucleus
inferred from biological aspect of ancestor with PANTHER:PTN002257217
located_in nucleus
inferred from sequence or structural similarity with UniProtKB:P40424
inferred by curator from GO:0000981
inferred from electronic annotation with InterPro:IPR003893
Protein Family (UniProt)
Belongs to the TALE/IRO homeobox family. (Q24248)
Summaries
Gene Snapshot
araucan (ara) encodes a homeodomain-containing protein of the TALE subfamily and one of the three components of the Iroquois gene complex. ara product acts as a transcriptional regulator to control territorial and cell fate specification, cell sorting behaviour and pattern formation. It also plays a transcription independent cell-autonomous role in cell proliferation control. [Date last reviewed: 2018-09-20]
Gene Group (FlyBase)
TALE HOMEOBOX TRANSCRIPTION FACTORS -
TALE homeobox transcription factors are sequence-specific DNA binding proteins that regulate transcription. These proteins are characterized by an atypical homeodomain that possesses a three amino acid insertion in the first loop. (Adapted from FBrf0232555 and PMID:18797923).
Protein Function (UniProtKB)
Controls proneural and vein forming genes. Positive transcriptional controller of AC-SC (achaete-scute). May act as an activator that interacts with the transcriptional complex assembled on the AC and SC promoters and participates in transcription initiation.
(UniProt, Q24248)
Summary (Interactive Fly)

transcription factor in the Iroquois Complex- homeodomain Pbx class - a prepattern gene that determines the size, shape and number of sensory organ mother cells in imaginal discs - required in the dorsal mesoderm to ensure normal heart development

Gene Model and Products
Number of Transcripts
2
Number of Unique Polypeptides
2

Please see the JBrowse view of Dmel\ara for information on other features

To submit a correction to a gene model please use the Contact FlyBase form

Protein Domains (via Pfam)
Isoform displayed:
Pfam protein domains
InterPro name
classification
start
end
Protein Domains (via SMART)
Isoform displayed:
SMART protein domains
InterPro name
classification
start
end
Structure
Protein 3D structure   (Predicted by AlphaFold)   (AlphaFold entry Q24248)

If you don't see a structure in the viewer, refresh your browser.
Model Confidence:
  • Very high (pLDDT > 90)
  • Confident (90 > pLDDT > 70)
  • Low (70 > pLDDT > 50)
  • Very low (pLDDT < 50)

AlphaFold produces a per-residue confidence score (pLDDT) between 0 and 100. Some regions with low pLDDT may be unstructured in isolation.

Experimentally Determined Structures
Crossreferences
Comments on Gene Model

Gene model reviewed during 5.45

Transcript Data
Annotated Transcripts
Name
FlyBase ID
RefSeq ID
Length (nt)
Assoc. CDS (aa)
FBtr0075908
2942
717
FBtr0300551
2918
709
Additional Transcript Data and Comments
Reported size (kB)

3.0, 1.8, 0.6 (northern blot)

Comments
External Data
Crossreferences
Polypeptide Data
Annotated Polypeptides
Name
FlyBase ID
Predicted MW (kDa)
Length (aa)
Theoretical pI
UniProt
RefSeq ID
GenBank
FBpp0075640
75.4
717
6.39
FBpp0289778
74.7
709
6.39
Polypeptides with Identical Sequences

None of the polypeptides share 100% sequence identity.

Additional Polypeptide Data and Comments
Reported size (kDa)
Comments
External Data
Crossreferences
Linkouts
Sequences Consistent with the Gene Model
Mapped Features

Click to get a list of regulatory features (enhancers, TFBS, etc.) and gene disruptions (point mutations, indels, etc.) within or overlapping Dmel\ara using the Feature Mapper tool.

External Data
Crossreferences
Eukaryotic Promoter Database - A collection of databases of experimentally validated promoters for selected model organisms.
Linkouts
Expression Data
Testis-specificity index

The testis specificity index was calculated from modENCODE tissue expression data by Vedelek et al., 2018 to indicate the degree of testis enrichment compared to other tissues. Scores range from -2.52 (underrepresented) to 5.2 (very high testis bias).

-0.71

Transcript Expression
in situ
Stage
Tissue/Position (including subcellular localization)
Reference
northern blot
Stage
Tissue/Position (including subcellular localization)
Reference
Additional Descriptive Data

At stage 10 of embryogenesis, ara mRNA transcripts are detected in a similar pattern albeit at a lower level compared to the mesodermal expression of caup mRNA transcripts.

Marker for
 
Subcellular Localization
CV Term
Polypeptide Expression
immunolocalization
Stage
Tissue/Position (including subcellular localization)
Reference
Additional Descriptive Data

Mirzoyan and Pandur (FBrf0222881) use an antibody which detects both ara and caup proteins, which have similar expression patterns. During stage 13, single ara/caup-expressing cells form alongside the row of Mef2-positive myocardial cells, and are between the eve-positive pericardial cells by stage 15.

Using an anti-caup antibody that recognizes both ara and caup, expression is detected in groups of cells in the presumptive visceral trunk mesoderm at early embryonic stage 11. By mid stage 11, they are expressed at the same dorsoventral level in the visceral mesoderm and the in the dorso-lateral ectoderm. Expression declines in the visceral mesoderm at late stage 11 and then is detected in the somatic mesoderm in promuscular clusters from which a subset of muscle progenitors expressing ara and caup will segregate in stage 12. Expression is maintained in sibling muscle founder myoblasts derived from araFBtr0091350:pb-XRcaup-expressing progenitors and in the muscles they give rise to, namely LT1-4, dorsal transverse 1, and segment border muscle. ara and caup muscle expression was compared that of several muscle identity genes.

Marker for
 
Subcellular Localization
CV Term
Evidence
References
located_in cytoplasm
inferred from high throughput direct assay
Expression Deduced from Reporters
High-Throughput Expression Data
Associated Tools

JBrowse - Visual display of RNA-Seq signals

View Dmel\ara in JBrowse
RNA-Seq by Region - Search RNA-Seq expression levels by exon or genomic region
Reference
See Gelbart and Emmert, 2013 for analysis details and data files for all genes.
Developmental Proteome: Life Cycle
Developmental Proteome: Embryogenesis
External Data and Images
Linkouts
BDGP expression data - Patterns of gene expression in Drosophila embryogenesis
DRscDB - A single-cell RNA-seq resource for data mining and data comparison across species
EMBL-EBI Single Cell Expression Atlas - Single cell expression across species
FlyAtlas - Adult expression by tissue, using Affymetrix Dros2 array
FlyAtlas2 - A Drosophila melanogaster expression atlas with RNA-Seq, miRNA-Seq and sex-specific data
Fly-FISH - A database of Drosophila embryo and larvae mRNA localization patterns
Flygut - An atlas of the Drosophila adult midgut
Images
FlyExpress - Embryonic expression images (BDGP data)
  • Stages(s) 4-6
  • Stages(s) 7-8
  • Stages(s) 9-10
  • Stages(s) 11-12
  • Stages(s) 13-16
Alleles, Insertions, Transgenic Constructs, and Aberrations
Classical and Insertion Alleles ( 25 )
For All Classical and Insertion Alleles Show
 
Other relevant insertions
Transgenic Constructs ( 14 )
For All Alleles Carried on Transgenic Constructs Show
Transgenic constructs containing/affecting coding region of ara
Transgenic constructs containing regulatory region of ara
Aberrations (Deficiencies and Duplications) ( 11 )
Variants
Variant Molecular Consequences
Alleles Representing Disease-Implicated Variants
Phenotypes
For more details about a specific phenotype click on the relevant allele symbol.
Lethality
Allele
Sterility
Allele
Other Phenotypes
Allele
Phenotype manifest in
Allele
macrochaeta & adult thorax
macrochaeta & scutum
scutum & microchaeta | medial, with Scer\GAL4sca-537.4
sense organ & wing | anterior, with Scer\GAL4C-765
Orthologs
Human Orthologs (via DIOPT v9.1)
Species\Gene Symbol
Score
Best Score
Best Reverse Score
Alignment
Complementation?
Transgene?
Homo sapiens (Human) (27)
9 of 14
Yes
Yes
9 of 14
Yes
Yes
6 of 14
No
Yes
5 of 14
No
Yes
4 of 14
No
Yes
4 of 14
No
Yes
2 of 14
No
No
2 of 14
No
No
1  
2 of 14
No
No
2 of 14
No
No
1  
2 of 14
No
No
2 of 14
No
No
1 of 14
No
No
1 of 14
No
No
1 of 14
No
No
1  
1 of 14
No
No
1 of 14
No
No
1 of 14
No
No
1 of 14
No
No
1  
1 of 14
No
No
1 of 14
No
No
1 of 14
No
No
1 of 14
No
No
2  
1 of 14
No
No
2  
1 of 14
No
No
1  
1 of 14
No
No
2  
1 of 14
No
No
Model Organism Orthologs (via DIOPT v9.1)
Species\Gene Symbol
Score
Best Score
Best Reverse Score
Alignment
Complementation?
Transgene?
Rattus norvegicus (Norway rat) (18)
9 of 14
Yes
Yes
9 of 14
Yes
Yes
6 of 14
No
Yes
4 of 14
No
No
3 of 14
No
No
2 of 14
No
Yes
2 of 14
No
No
2 of 14
No
No
2 of 14
No
No
2 of 14
No
No
2 of 14
No
No
2 of 14
No
No
1 of 14
No
No
1 of 14
No
Yes
1 of 14
No
No
1 of 14
No
No
1 of 14
No
No
1 of 14
No
No
Mus musculus (laboratory mouse) (20)
8 of 14
Yes
Yes
8 of 14
Yes
Yes
6 of 14
No
Yes
5 of 14
No
Yes
5 of 14
No
Yes
5 of 14
No
Yes
2 of 14
No
No
2 of 14
No
No
2 of 14
No
No
2 of 14
No
No
2 of 14
No
No
2 of 14
No
No
2 of 14
No
No
1 of 14
No
No
1 of 14
No
Yes
1 of 14
No
No
1 of 14
No
No
1 of 14
No
No
1 of 14
No
No
1 of 14
No
No
Xenopus tropicalis (Western clawed frog) (15)
7 of 13
Yes
No
6 of 13
No
Yes
6 of 13
No
Yes
3 of 13
No
No
2 of 13
No
No
1 of 13
No
Yes
1 of 13
No
No
1 of 13
No
No
1 of 13
No
No
1 of 13
No
No
1 of 13
No
No
1 of 13
No
No
1 of 13
No
Yes
1 of 13
No
No
1 of 13
No
No
Danio rerio (Zebrafish) (26)
9 of 14
Yes
Yes
8 of 14
No
Yes
7 of 14
No
Yes
7 of 14
No
No
6 of 14
No
Yes
6 of 14
No
Yes
6 of 14
No
Yes
5 of 14
No
Yes
4 of 14
No
Yes
3 of 14
No
Yes
3 of 14
No
No
2 of 14
No
No
2 of 14
No
No
2 of 14
No
No
2 of 14
No
No
2 of 14
No
No
2 of 14
No
No
1 of 14
No
No
1 of 14
No
No
1 of 14
No
No
1 of 14
No
No
1 of 14
No
No
1 of 14
No
No
1 of 14
No
No
1 of 14
No
No
1 of 14
No
No
Caenorhabditis elegans (Nematode, roundworm) (2)
6 of 14
Yes
No
1 of 14
No
No
Anopheles gambiae (African malaria mosquito) (9)
11 of 12
Yes
Yes
1 of 12
No
No
Arabidopsis thaliana (thale-cress) (21)
3 of 13
Yes
No
2 of 13
No
No
2 of 13
No
No
2 of 13
No
No
2 of 13
No
No
2 of 13
No
No
2 of 13
No
No
2 of 13
No
No
2 of 13
No
No
1 of 13
No
No
1 of 13
No
No
1 of 13
No
No
1 of 13
No
No
1 of 13
No
No
1 of 13
No
No
1 of 13
No
No
1 of 13
No
No
1 of 13
No
No
1 of 13
No
No
1 of 13
No
No
1 of 13
No
No
Saccharomyces cerevisiae (Brewer's yeast) (5)
2 of 13
Yes
No
1 of 13
No
No
1 of 13
No
No
1 of 13
No
No
1 of 13
No
No
Schizosaccharomyces pombe (Fission yeast) (0)
Escherichia coli (enterobacterium) (0)
Other Organism Orthologs (via OrthoDB)
Data provided directly from OrthoDB:ara. Refer to their site for version information.
Paralogs
Paralogs (via DIOPT v9.1)
Drosophila melanogaster (Fruit fly) (8)
11 of 13
10 of 13
3 of 13
2 of 13
1 of 13
1 of 13
1 of 13
1 of 13
Human Disease Associations
FlyBase Human Disease Model Reports
    Disease Ontology (DO) Annotations
    Models Based on Experimental Evidence ( 0 )
    Allele
    Disease
    Evidence
    References
    Potential Models Based on Orthology ( 0 )
    Human Ortholog
    Disease
    Evidence
    References
    Modifiers Based on Experimental Evidence ( 0 )
    Allele
    Disease
    Interaction
    References
    Disease Associations of Human Orthologs (via DIOPT v9.1 and OMIM)
    Note that ortholog calls supported by only 1 or 2 algorithms (DIOPT score < 3) are not shown.
    Functional Complementation Data
    Functional complementation data is computed by FlyBase using a combination of the orthology data obtained from DIOPT and OrthoDB and the allele-level genetic interaction data curated from the literature.
    Interactions
    Summary of Physical Interactions
    Summary of Genetic Interactions
    Interaction Browsers

    Please look at the allele data for full details of the genetic interactions
    Starting gene(s)
    Interaction type
    Interacting gene(s)
    Reference
    Starting gene(s)
    Interaction type
    Interacting gene(s)
    Reference
    External Data
    Linkouts
    BioGRID - A database of protein and genetic interactions.
    DroID - A comprehensive database of gene and protein interactions.
    MIST (protein-protein) - An integrated Molecular Interaction Database
    Pathways
    Signaling Pathways (FlyBase)
    Metabolic Pathways
    FlyBase
    External Links
    External Data
    Linkouts
    Class of Gene
    Genomic Location and Detailed Mapping Data
    Chromosome (arm)
    3L
    Recombination map
    3-38
    Cytogenetic map
    Sequence location
    FlyBase Computed Cytological Location
    Cytogenetic map
    Evidence for location
    69C8-69C10
    Limits computationally determined from genome sequence between P{PZ}l(3)0508805088 and P{PZ}l(3)0692406924
    Experimentally Determined Cytological Location
    Cytogenetic map
    Notes
    References
    69D-69D
    (determined by in situ hybridisation)
    Distance between ara and mirr is less than 50kb.
    Experimentally Determined Recombination Data
    Location
    Right of (cM)
    Notes

    Can be 1.5% recombination between ara and mirr, with none between ara and caup.

    Stocks and Reagents
    Stocks (51)
    Genomic Clones (33)
    cDNA Clones (24)
     

    Please Note This section lists cDNAs and ESTs that fall within the genomic extent of the gene model, which may include cDNAs and ESTs of genes within introns, or of overlapping genes. Please see JBrowse for alignment of the cDNAs and ESTs to the gene model.

    cDNA clones, fully sequenced
    BDGP DGC clones
    Other clones
      Drosophila Genomics Resource Center cDNA clones

      For each fully sequenced cDNA the DGRC maintains various forms of the cDNA (e.g tagged or untagged) in several different host vectors for subsequent cloning and expression in Drosophila and Drosophila cell lines.

      cDNA Clones, End Sequenced (ESTs)
      BDGP DGC clones
      RNAi and Array Information
      Linkouts
      DRSC - Results frm RNAi screens
      Antibody Information
      Laboratory Generated Antibodies
       
      Commercially Available Antibodies
       
      Cell Line Information
      Publicly Available Cell Lines
       
        Other Stable Cell Lines
         
          Other Comments

          The acquisition of lateral transverse muscle fate requires the repression mediated by ara/caup of other muscle identity genes (such as slou and vg).

          RNAi generated by PCR using primers directed to this gene causes a cell growth and viability phenotype when assayed in Kc167 and S2R+ cells.

          Iroquois complex genes (ara, caup and mirr) are required for the specification of dorsal head structures.

          The localised expression of the three genes of the iroquois complex - IRO-C (ara, caup and mirr) specifies the identity of dorsal cells in the eye. Juxtaposition of IRO-C-expressing and non-expressing cells forms a straight border that promotes growth and serves as a pattern-organising centre in the eye disc.

          The genes of the Iroquois complex (ara, caup and mirr) are necessary for notum specification. The Iro-C genes establish a signalling system that appears to organize development in the notum and dorsal wing hinge (tegula and sclerite) territories.

          When the Iroquois complex is absent, notum cells are transformed into hinge (tegula and sclerite) cells.

          A member of the IRO-C (Iroquois complex).

          The genes of the IRO-C are coordinately regulated and implicated in similar developmental processes. They are regulated in the eye by the Pc-group and trx-group genes, but not by classical modifiers of position effect variegation. Ventral silencing of the whole IRO-C in the eye occurs at the level of chromatin structure in a manner similar to that of the homeotic gene complexes, perhaps by local compaction of the region into a heterochromatin-like structure involving Pc-group products.

          The ara gene maps to the Iroquois complex and contributes to the prepattern that regulates expression of ac-sc complex genes and rho. ara is a positive, direct regulator of the ac-sc genes.

          Gene titration method has been used to search for genes involved in the determination of sense organs. A mutation of ara and caup (formerly iro) demonstrates an interaction with Df(4)M101-62f, a chromosome known to alter the development of the PNS.

          In mutants for the Iroquois gene complex, two lateral bands of the notum are completely devoid of sense organs.

          Relationship to Other Genes
          Source for database merge of
          Additional comments
          Nomenclature History
          Source for database identify of

          Source for identity of: ara CG10571

          Nomenclature comments
          Etymology

          The "ara" gene is named after the Araucanian American-Indian tribe.

          The 'Iroquois' gene complex is named after the native American Iroquois tribe, also called Mohawks, which shaved all but a medial stripe of hairs on the head.

          "Iroquois" is the name given to the gene complex by analogy with the American tribe that have a haircut ascribed to the Mohawk nation.

          Synonyms and Secondary IDs (18)
          Reported As
          Symbol Synonym
          Secondary FlyBase IDs
          • FBgn0014195
          Datasets (0)
          Study focus (0)
          Experimental Role
          Project
          Project Type
          Title
          Study result (0)
          Result
          Result Type
          Title
          External Crossreferences and Linkouts ( 45 )
          Sequence Crossreferences
          NCBI Gene - Gene integrates information from a wide range of species. A record may include nomenclature, Reference Sequences (RefSeqs), maps, pathways, variations, phenotypes, and links to genome-, phenotype-, and locus-specific resources worldwide.
          GenBank Nucleotide - A collection of sequences from several sources, including GenBank, RefSeq, TPA, and PDB.
          GenBank Protein - A collection of sequences from several sources, including translations from annotated coding regions in GenBank, RefSeq and TPA, as well as records from SwissProt, PIR, PRF, and PDB.
          RefSeq - A comprehensive, integrated, non-redundant, well-annotated set of reference sequences including genomic, transcript, and protein.
          UniProt/GCRP - The gene-centric reference proteome (GCRP) provides a 1:1 mapping between genes and UniProt accessions in which a single 'canonical' isoform represents the product(s) of each protein-coding gene.
          UniProt/Swiss-Prot - Manually annotated and reviewed records of protein sequence and functional information
          UniProt/TrEMBL - Automatically annotated and unreviewed records of protein sequence and functional information
          Other crossreferences
          AlphaFold DB - AlphaFold provides open access to protein structure predictions for the human proteome and other key proteins of interest, to accelerate scientific research.
          BDGP expression data - Patterns of gene expression in Drosophila embryogenesis
          DRscDB - A single-cell RNA-seq resource for data mining and data comparison across species
          EMBL-EBI Single Cell Expression Atlas - Single cell expression across species
          FlyAtlas2 - A Drosophila melanogaster expression atlas with RNA-Seq, miRNA-Seq and sex-specific data
          FlyMine - An integrated database for Drosophila genomics
          KEGG Genes - Molecular building blocks of life in the genomic space.
          MARRVEL_MODEL - MARRVEL (model organism gene)
          Linkouts
          BioGRID - A database of protein and genetic interactions.
          Drosophila Genomics Resource Center - Drosophila Genomics Resource Center (DGRC) cDNA clones
          DroID - A comprehensive database of gene and protein interactions.
          DRSC - Results frm RNAi screens
          Eukaryotic Promoter Database - A collection of databases of experimentally validated promoters for selected model organisms.
          FlyAtlas - Adult expression by tissue, using Affymetrix Dros2 array
          FlyCyc Genes - Genes from a BioCyc PGDB for Dmel
          Fly-FISH - A database of Drosophila embryo and larvae mRNA localization patterns
          Flygut - An atlas of the Drosophila adult midgut
          iBeetle-Base - RNAi phenotypes in the red flour beetle (Tribolium castaneum)
          Interactive Fly - A cyberspace guide to Drosophila development and metazoan evolution
          MIST (protein-protein) - An integrated Molecular Interaction Database
          References (243)