Open Close
General Information
Symbol
Dmel\caz
Species
D. melanogaster
Name
cabeza
Annotation Symbol
CG3606
Feature Type
FlyBase ID
FBgn0285954
Gene Model Status
Stock Availability
Gene Snapshot
cabeza (caz) encodes a chromatin binding protein involved in locomotion, synaptic growth at the neuromuscular junction and eye development. [Date last reviewed: 2019-03-28]
Also Known As
dFUS, SARFH, P19, Sarcoma-associated RNA-binding fly homolog
Key Links
Genomic Location
Cytogenetic map
Sequence location
X:16,288,810..16,293,520 [+]
Recombination map
1-53
Sequence
Other Genome Views
The following external sites may use different assemblies or annotations than FlyBase.
Function
GO Summary Ribbons
Gene Group (FlyBase)
Protein Family (UniProt)
Belongs to the RRM TET family. (Q27294)
Molecular Function (GO)
[Detailed GO annotations]
Experimental Evidence
Predictions / Assertions
Summaries
Gene Group (FlyBase)
TRANSCRIPTION FACTOR II D -
The general transcription factor complex Transcription factor II D (TFIID) plays an important role in recruiting the transcription machinery to core promoters. TFIID is a complex composed of TBP and several TBP-associated factors. (Adapted from FBrf0125116 and FBrf0156065).
Protein Function (UniProtKB)
May participate in a function common to the expression of most genes transcribed by RNA polymerase II.
(UniProt, Q27294)
Gene Model and Products
Number of Transcripts
4
Number of Unique Polypeptides
4

Please see the GBrowse view of Dmel\caz or the JBrowse view of Dmel\caz for information on other features

To submit a correction to a gene model please use the Contact FlyBase form

Protein Domains (via Pfam)
Isoform displayed:
Pfam protein domains
InterPro name
classification
start
end
Protein Domains (via SMART)
Isoform displayed:
SMART protein domains
InterPro name
classification
start
end
Comments on Gene Model
Variable use of small exon; supported combination results in frameshift and premature stop in downstream exon.
Gene model reviewed during 5.47
Multiphase exon postulated: exon reading frame differs in alternative transcripts; overlap >20aa.
Sequence Ontology: Class of Gene
Transcript Data
Annotated Transcripts
Name
FlyBase ID
RefSeq ID
Length (nt)
Assoc. CDS (aa)
FBtr0074217
1666
399
FBtr0334990
1621
384
FBtr0334991
1534
355
FBtr0334992
1631
124
Additional Transcript Data and Comments
Reported size (kB)
Comments
External Data
Crossreferences
Polypeptide Data
Annotated Polypeptides
Name
FlyBase ID
Predicted MW (kDa)
Length (aa)
Theoretical pI
RefSeq ID
GenBank
FBpp0073996
38.8
399
9.72
FBpp0307000
37.3
384
9.78
FBpp0307001
34.3
355
9.78
FBpp0307002
13.7
124
11.71
Polypeptides with Identical Sequences

None of the polypeptides share 100% sequence identity.

Additional Polypeptide Data and Comments
Reported size (kDa)
Comments
External Data
Linkouts
Sequences Consistent with the Gene Model
Mapped Features

Click to get a list of regulatory features (enhancers, TFBS, etc.) and gene disruptions (point mutations, indels, etc.) within or overlapping Dmel\caz using the Feature Mapper tool.

External Data
Crossreferences
Linkouts
Gene Ontology (16 terms)
Molecular Function (3 terms)
Terms Based on Experimental Evidence (1 term)
CV Term
Evidence
References
inferred from direct assay
Terms Based on Predictions or Assertions (2 terms)
CV Term
Evidence
References
inferred from sequence or structural similarity
inferred from electronic annotation with InterPro:IPR034870
(assigned by InterPro )
Biological Process (8 terms)
Terms Based on Experimental Evidence (5 terms)
CV Term
Evidence
References
Terms Based on Predictions or Assertions (3 terms)
CV Term
Evidence
References
Cellular Component (5 terms)
Terms Based on Experimental Evidence (4 terms)
CV Term
Evidence
References
inferred from high throughput direct assay
inferred from direct assay
Terms Based on Predictions or Assertions (2 terms)
CV Term
Evidence
References
inferred from biological aspect of ancestor with PANTHER:PTN000579998
(assigned by GO_Central )
inferred from sequence or structural similarity with HGNC:11545
Expression Data
Expression Summary Ribbons
Colored tiles in ribbon indicate that expression data has been curated by FlyBase for that anatomical location. Colorless tiles indicate that there is no curated data for that location.
For complete stage-specific expression data, view the modENCODE Development RNA-Seq section under High-Throughput Expression below.
Transcript Expression
in situ
Stage
Tissue/Position (including subcellular localization)
Reference
organism

Comment: maternally deposited

Additional Descriptive Data
Marker for
 
Subcellular Localization
CV Term
Polypeptide Expression
immunolocalization
Stage
Tissue/Position (including subcellular localization)
Reference
mass spectroscopy
Stage
Tissue/Position (including subcellular localization)
Reference
Additional Descriptive Data
Marker for
 
Subcellular Localization
CV Term
Evidence
References
inferred from high throughput direct assay
inferred from direct assay
Expression Deduced from Reporters
High-Throughput Expression Data
Associated Tools

GBrowse - Visual display of RNA-Seq signals

View Dmel\caz in GBrowse 2
RNA-Seq by Region - Search RNA-Seq expression levels by exon or genomic region
Reference
See Gelbart and Emmert, 2013 for analysis details and data files for all genes.
Developmental Proteome: Life Cycle
Developmental Proteome: Embryogenesis
External Data and Images
Linkouts
FlyAtlas - Adult expression by tissue, using Affymetrix Dros2 array
Images
Alleles, Insertions, and Transgenic Constructs
Classical and Insertion Alleles ( 10 )
For All Classical and Insertion Alleles Show
 
Other relevant insertions
Transgenic Constructs ( 27 )
For All Alleles Carried on Transgenic Constructs Show
Transgenic constructs containing/affecting coding region of caz
Transgenic constructs containing regulatory region of caz
Deletions and Duplications ( 8 )
Phenotypes
For more details about a specific phenotype click on the relevant allele symbol.
Lethality
Allele
Other Phenotypes
Allele
Phenotype manifest in
Allele
Orthologs
Human Orthologs (via DIOPT v7.1)
Homo sapiens (Human) (15)
Species\Gene Symbol
Score
Best Score
Best Reverse Score
Alignment
Complementation?
Transgene?
10 of 15
Yes
Yes
 
 
9 of 15
No
Yes
 
7 of 15
No
Yes
 
1 of 15
No
No
1 of 15
No
Yes
1 of 15
No
No
1 of 15
No
Yes
1 of 15
No
Yes
1 of 15
No
Yes
1 of 15
No
Yes
1 of 15
No
Yes
1 of 15
No
Yes
1 of 15
No
Yes
1 of 15
No
Yes
1 of 15
No
Yes
Model Organism Orthologs (via DIOPT v7.1)
Mus musculus (laboratory mouse) (9)
Species\Gene Symbol
Score
Best Score
Best Reverse Score
Alignment
Complementation?
Transgene?
12 of 15
Yes
Yes
8 of 15
No
Yes
7 of 15
No
Yes
1 of 15
No
No
1 of 15
No
Yes
1 of 15
No
No
1 of 15
No
Yes
1 of 15
No
No
1 of 15
No
No
Rattus norvegicus (Norway rat) (11)
10 of 13
Yes
Yes
4 of 13
No
Yes
3 of 13
No
Yes
3 of 13
No
Yes
3 of 13
No
Yes
1 of 13
No
Yes
1 of 13
No
Yes
1 of 13
No
Yes
1 of 13
No
No
1 of 13
No
Yes
1 of 13
No
No
Xenopus tropicalis (Western clawed frog) (6)
7 of 12
Yes
Yes
7 of 12
Yes
Yes
2 of 12
No
Yes
1 of 12
No
Yes
1 of 12
No
No
1 of 12
No
Yes
Danio rerio (Zebrafish) (6)
9 of 15
Yes
Yes
9 of 15
Yes
Yes
9 of 15
Yes
Yes
6 of 15
No
Yes
1 of 15
No
No
1 of 15
No
No
Caenorhabditis elegans (Nematode, roundworm) (4)
6 of 15
Yes
Yes
1 of 15
No
No
1 of 15
No
No
1 of 15
No
Yes
Arabidopsis thaliana (thale-cress) (7)
4 of 9
Yes
Yes
2 of 9
No
Yes
1 of 9
No
Yes
1 of 9
No
No
1 of 9
No
Yes
1 of 9
No
No
1 of 9
No
No
Saccharomyces cerevisiae (Brewer's yeast) (0)
No records found.
Schizosaccharomyces pombe (Fission yeast) (0)
No records found.
Orthologs in Drosophila Species (via OrthoDB v9.1) ( EOG09190GR8 )
Organism
Common Name
Gene
AAA Syntenic Ortholog
Multiple Dmel Genes in this Orthologous Group
Drosophila melanogaster
fruit fly
Drosophila suzukii
Spotted wing Drosophila
Drosophila sechellia
Drosophila erecta
Drosophila ananassae
Drosophila pseudoobscura pseudoobscura
Drosophila persimilis
Drosophila willistoni
Drosophila virilis
Drosophila mojavensis
Drosophila grimshawi
Orthologs in non-Drosophila Dipterans (via OrthoDB v9.1) ( EOG09150DSR )
Organism
Common Name
Gene
Multiple Dmel Genes in this Orthologous Group
Musca domestica
House fly
Glossina morsitans
Tsetse fly
Lucilia cuprina
Australian sheep blowfly
Mayetiola destructor
Hessian fly
Aedes aegypti
Yellow fever mosquito
Aedes aegypti
Yellow fever mosquito
Anopheles darlingi
American malaria mosquito
Anopheles gambiae
Malaria mosquito
Culex quinquefasciatus
Southern house mosquito
Orthologs in non-Dipteran Insects (via OrthoDB v9.1) ( EOG090W0JKA )
Organism
Common Name
Gene
Multiple Dmel Genes in this Orthologous Group
Bombyx mori
Silkmoth
Danaus plexippus
Monarch butterfly
Apis florea
Little honeybee
Apis mellifera
Western honey bee
Bombus impatiens
Common eastern bumble bee
Bombus terrestris
Buff-tailed bumblebee
Linepithema humile
Argentine ant
Megachile rotundata
Alfalfa leafcutting bee
Nasonia vitripennis
Parasitic wasp
Dendroctonus ponderosae
Mountain pine beetle
Tribolium castaneum
Red flour beetle
Pediculus humanus
Human body louse
Rhodnius prolixus
Kissing bug
Cimex lectularius
Bed bug
Acyrthosiphon pisum
Pea aphid
Zootermopsis nevadensis
Nevada dampwood termite
Orthologs in non-Insect Arthropods (via OrthoDB v9.1) ( EOG090X0IPT )
Organism
Common Name
Gene
Multiple Dmel Genes in this Orthologous Group
Strigamia maritima
European centipede
Ixodes scapularis
Black-legged tick
Stegodyphus mimosarum
African social velvet spider
Tetranychus urticae
Two-spotted spider mite
Daphnia pulex
Water flea
Orthologs in non-Arthropod Metazoa (via OrthoDB v9.1) ( EOG091G0U8W )
Organism
Common Name
Gene
Multiple Dmel Genes in this Orthologous Group
Strongylocentrotus purpuratus
Purple sea urchin
Strongylocentrotus purpuratus
Purple sea urchin
Ciona intestinalis
Vase tunicate
Gallus gallus
Domestic chicken
Gallus gallus
Domestic chicken
Paralogs
Paralogs (via DIOPT v7.1)
Drosophila melanogaster (Fruit fly) (2)
4 of 10
1 of 10
Human Disease Associations
FlyBase Human Disease Model Reports
Disease Model Summary Ribbon
Disease Ontology (DO) Annotations
Models Based on Experimental Evidence ( 13 )
Potential Models Based on Orthology ( 4 )
Modifiers Based on Experimental Evidence ( 7 )
Comments on Models/Modifiers Based on Experimental Evidence ( 1 )
 
The Q349X mutation of caz is analogous to the R495X mutation of Hsap\FUS that frequently occurs in both familial and sporadic cases of amyotrophic lateral sclerosis and frontotemporal dementia, but almost no neurodegenerative phenotype is observed when this mutation is introduced into flies.
Disease Associations of Human Orthologs (via DIOPT v7.1 and OMIM)
Note that ortholog calls supported by only 1 or 2 algorithms (DIOPT score < 3) are not shown.
Functional Complementation Data
Functional complementation data is computed by FlyBase using a combination of the orthology data obtained from DIOPT and OrthoDB and the allele-level genetic interaction data curated from the literature.
Dmel gene
Ortholog showing functional complementation
Supporting References
Interactions
Summary of Physical Interactions
Summary of Genetic Interactions
esyN Network Diagram
esyN Network Key:
Suppression
Enhancement

Please look at the allele data for full details of the genetic interactions
Starting gene(s)
Interaction type
Interacting gene(s)
Reference
Starting gene(s)
Interaction type
Interacting gene(s)
Reference
External Data
Linkouts
InterologFinder - Protein-protein interactions (PPI) from both known and predicted PPI data sets.
Pathways
Genomic Location and Detailed Mapping Data
Chromosome (arm)
X
Recombination map
1-53
Cytogenetic map
Sequence location
X:16,288,810..16,293,520 [+]
FlyBase Computed Cytological Location
Cytogenetic map
Evidence for location
14B8-14B12
Left limit from inclusion within Df(1)4b18 (FBrf0083559) Right limit from non-inclusion within Df(1)l32 (FBrf0083559)
Experimentally Determined Cytological Location
Cytogenetic map
Notes
References
14B-14B
(determined by in situ hybridisation)
Experimentally Determined Recombination Data
Left of (cM)
Right of (cM)
Notes
Stocks and Reagents
Stocks (9)
Genomic Clones (19)
 
cDNA Clones (42)
 

Please Note This section lists cDNAs and ESTs that fall within the genomic extent of the gene model, which may include cDNAs and ESTs of genes within introns, or of overlapping genes. Please see GBrowse for alignment of the cDNAs and ESTs to the gene model.

cDNA clones, fully sequences
BDGP DGC clones
Other clones
    Drosophila Genomics Resource Center cDNA clones

    For each fully sequenced cDNA the DGRC maintains various forms of the cDNA (e.g tagged or untagged) in several different host vectors for subsequent cloning and expression in Drosophila and Drosophila cell lines.

      cDNA Clones, End Sequenced (ESTs)
      RNAi and Array Information
      Linkouts
      Antibody Information
      Laboratory Generated Antibodies
       
      Commercially Available Antibodies
       
      Other Information
      Relationship to Other Genes
      Source for database identify of
      Source for database merge of
      Source for merge of: caz BcDNA:GM09207
      Source for merge of: caz l(1)5g11
      Additional comments
      Source for merge of caz BcDNA:GM09207 was a shared cDNA ( date:030728 ).
      Other Comments
      Synaptic transmission at the larval neuromuscular junction is severely impaired in caz mutant flies.
      caz has been cloned and sequenced, and its expression pattern has been analysed.
      caz gene product contains a glycine rich C-terminal domain, but only one RNA recognition motif (RRM) located in the central portion. Intron and exon borders of full length cDNA and genomic clones have been determined and the predicted protein has 403 amino acids and is 48.4kD. UV crosslinking experiments demonstrate general RNA binding activity for the full length protein and the RRM domain.
      Isolated from a pupal cDNA library, using a fragment of the fs(1)h coding region containing "pen" repetitive sequences (GGN, where N is any nucleotide) as the probe.
      Origin and Etymology
      Discoverer
      Etymology
      Identification
      External Crossreferences and Linkouts ( 51 )
      Sequence Crossreferences
      NCBI Gene - Gene integrates information from a wide range of species. A record may include nomenclature, Reference Sequences (RefSeqs), maps, pathways, variations, phenotypes, and links to genome-, phenotype-, and locus-specific resources worldwide.
      GenBank Nucleotide - A collection of sequences from several sources, including GenBank, RefSeq, TPA, and PDB.
      GenBank Protein - A collection of sequences from several sources, including translations from annotated coding regions in GenBank, RefSeq and TPA, as well as records from SwissProt, PIR, PRF, and PDB.
      RefSeq - A comprehensive, integrated, non-redundant, well-annotated set of reference sequences including genomic, transcript, and protein.
      UniProt/Swiss-Prot - Manually annotated and reviewed records of protein sequence and functional information
      UniProt/TrEMBL - Automatically annotated and unreviewed records of protein sequence and functional information
      Linkouts
      FlyAtlas - Adult expression by tissue, using Affymetrix Dros2 array
      InterologFinder - Protein-protein interactions (PPI) from both known and predicted PPI data sets.
      Synonyms and Secondary IDs (21)
      Reported As
      Secondary FlyBase IDs
      • FBgn0011571
      • FBgn0003059
      • FBgn0011830
      • FBgn0062479
      • FBgn0019764
      Datasets (0)
      Study focus (0)
      Experimental Role
      Project
      Project Type
      Title
      References (97)