Open Close
General Information
Symbol
Dmel\Gsc
Species
D. melanogaster
Name
Goosecoid
Annotation Symbol
CG2851
Feature Type
FlyBase ID
FBgn0010323
Gene Model Status
Stock Availability
Gene Snapshot
Goosecoid (Gsc) encodes a homeoprotein that shares DNA binding sequence specificity with the products of bcd and oc. It regulates cell fate in the brain and stomatogastric nervous system. [Date last reviewed: 2019-03-07]
Also Known As

D-gsc, 60Mun2, l(2)05341, Muenster 72

Key Links
Genomic Location
Cytogenetic map
Sequence location
2L:583,540..594,685 [-]
Recombination map

2-1.5

RefSeq locus
NT_033779 REGION:583540..594685
Sequence
Other Genome Views
The following external sites may use different assemblies or annotations than FlyBase.
Function
GO Summary Ribbons
Protein Family (UniProt)
Belongs to the paired homeobox family. Bicoid subfamily. (P54366)
Summaries
Gene Group (FlyBase)
PAIRED-LIKE HOMEOBOX TRANSCRIPTION FACTORS -
Paired-like (PRD-like) homeobox transcription factors are sequence-specific DNA binding proteins that regulate transcription. PRD-like transcription factors lack a paired domain but possess a homeodomain characteristic of the paired homeobox transcription factors. Some members also possess an Engrailed Homology 1 (EH1) and/or an OAR motif near the C-terminus. (Adapted from FBrf0232555 and FBrf0106517).
Protein Function (UniProtKB)
Appears to regulate regional development of specific tissues. Can rescue axis polarity in UV-radiated Xenopus embryos.
(UniProt, P54366)
Summary (Interactive Fly)

transcription factor - homeodomain - paired-like - regulates cell fate in the brain and stomatogastric nervous system

Gene Model and Products
Number of Transcripts
3
Number of Unique Polypeptides
3

Please see the GBrowse view of Dmel\Gsc or the JBrowse view of Dmel\Gsc for information on other features

To submit a correction to a gene model please use the Contact FlyBase form

Protein Domains (via Pfam)
Isoform displayed:
Pfam protein domains
InterPro name
classification
start
end
Protein Domains (via SMART)
Isoform displayed:
SMART protein domains
InterPro name
classification
start
end
Comments on Gene Model

Unconventional translation start (AUU) postulated; FlyBase analysis.

gene_with_non_canonical_start_codon ; SO:0001739

Gene model reviewed during 5.52

Low-frequency RNA-Seq exon junction(s) not annotated.

Gene model reviewed during 6.03

Sequence Ontology: Class of Gene
Transcript Data
Annotated Transcripts
Name
FlyBase ID
RefSeq ID
Length (nt)
Assoc. CDS (aa)
FBtr0343220
2362
415
FBtr0343221
2362
473
FBtr0347398
1124
107
Additional Transcript Data and Comments
Reported size (kB)

2.4 (longest cDNA)

Comments
External Data
Crossreferences
Polypeptide Data
Annotated Polypeptides
Name
FlyBase ID
Predicted MW (kDa)
Length (aa)
Theoretical pI
RefSeq ID
GenBank
FBpp0309909
44.5
415
8.52
FBpp0309910
50.2
473
9.04
FBpp0312549
12.0
107
9.99
Polypeptides with Identical Sequences

None of the polypeptides share 100% sequence identity.

Additional Polypeptide Data and Comments
Reported size (kDa)
Comments
External Data
Crossreferences
InterPro - A database of protein families, domains and functional sites
Linkouts
Sequences Consistent with the Gene Model
Mapped Features

Click to get a list of regulatory features (enhancers, TFBS, etc.) and gene disruptions (point mutations, indels, etc.) within or overlapping Dmel\Gsc using the Feature Mapper tool.

External Data
Crossreferences
Eukaryotic Promoter Database - A collection of databases of experimentally validated promoters for selected model organisms.
Linkouts
Gene Ontology (8 terms)
Molecular Function (5 terms)
Terms Based on Experimental Evidence (3 terms)
CV Term
Evidence
References
Terms Based on Predictions or Assertions (2 terms)
CV Term
Evidence
References
inferred from biological aspect of ancestor with PANTHER:PTN002518769
(assigned by GO_Central )
inferred from biological aspect of ancestor with PANTHER:PTN002518769
(assigned by GO_Central )
Biological Process (2 terms)
Terms Based on Experimental Evidence (0 terms)
Terms Based on Predictions or Assertions (2 terms)
CV Term
Evidence
References
Cellular Component (1 term)
Terms Based on Experimental Evidence (0 terms)
Terms Based on Predictions or Assertions (1 term)
CV Term
Evidence
References
inferred by curator from GO:0001227
inferred from biological aspect of ancestor with PANTHER:PTN002518769
(assigned by GO_Central )
Expression Data
Expression Summary Ribbons
Colored tiles in ribbon indicate that expression data has been curated by FlyBase for that anatomical location. Colorless tiles indicate that there is no curated data for that location.
For complete stage-specific expression data, view the modENCODE Development RNA-Seq section under High-Throughput Expression below.
Transcript Expression
in situ
Stage
Tissue/Position (including subcellular localization)
Reference
organism | faint | ubiquitous

Comment: maternally deposited

Additional Descriptive Data

Gsc transcript is expressed in two domains during embryogenesis. At stage 4, Gsc transcript is expressed as a narrow dorsoanterior stripe, which assumes a horseshoe shape by stage 6. The horseshoe splits dorsally at stage 8, and by stage 14, cells expressing Gsc transcript are located in the brain hemispheres. The second domain of expression begins at stage 9 at the anterior tip of the embryo, in cells which invaginate into the stomodeum. These cells abut the anterior midgut region at the end of germband retraction, and might contribute to the anterior foregut, the ring gland, and the stomodeal nervous system.

Gsc transcripts are first detected in a horseshoe pattern in the procephalic neurogenic region of blastoderm embryos. Later these cells delaminate in two groups and form a ventro-lateral sector of each brain hemisphere. At stage 10, a new expression domain appears in the foregut anlage and shortly after marks the stomodeum. This expression domain gives rise to a specific region of the foregut at the junction of the pharynx and esophagus, a part of the stomatogastric nervous system, and a part of the ring gland.

Marker for
 
Subcellular Localization
CV Term
Polypeptide Expression
Additional Descriptive Data
Marker for
 
Subcellular Localization
CV Term
Evidence
References
Expression Deduced from Reporters
High-Throughput Expression Data
Associated Tools

GBrowse - Visual display of RNA-Seq signals

View Dmel\Gsc in GBrowse 2
RNA-Seq by Region - Search RNA-Seq expression levels by exon or genomic region
Reference
See Gelbart and Emmert, 2013 for analysis details and data files for all genes.
Developmental Proteome: Life Cycle
Developmental Proteome: Embryogenesis
External Data and Images
Linkouts
BDGP expression data - Patterns of gene expression in Drosophila embryogenesis
FLIGHT - Cell culture data for RNAi and other high-throughput technologies
FlyAtlas - Adult expression by tissue, using Affymetrix Dros2 array
Flygut - An atlas of the Drosophila adult midgut
Images
Alleles, Insertions, and Transgenic Constructs
Classical and Insertion Alleles ( 14 )
For All Classical and Insertion Alleles Show
 
Other relevant insertions
Transgenic Constructs ( 10 )
For All Alleles Carried on Transgenic Constructs Show
Transgenic constructs containing/affecting coding region of Gsc
Transgenic constructs containing regulatory region of Gsc
Deletions and Duplications ( 8 )
Phenotypes
Orthologs
Human Orthologs (via DIOPT v7.1)
Homo sapiens (Human) (16)
Species\Gene Symbol
Score
Best Score
Best Reverse Score
Alignment
Complementation?
Transgene?
6 of 15
Yes
Yes
4 of 15
No
Yes
1 of 15
No
Yes
1 of 15
No
No
1 of 15
No
No
1 of 15
No
No
 
1 of 15
No
No
1 of 15
No
No
 
1 of 15
No
No
 
1 of 15
No
No
1 of 15
No
No
1 of 15
No
No
1 of 15
No
No
 
1 of 15
No
No
1 of 15
No
No
1 of 15
No
No
 
Model Organism Orthologs (via DIOPT v7.1)
Mus musculus (laboratory mouse) (14)
Species\Gene Symbol
Score
Best Score
Best Reverse Score
Alignment
Complementation?
Transgene?
5 of 15
Yes
Yes
5 of 15
Yes
Yes
1 of 15
No
Yes
1 of 15
No
No
1 of 15
No
No
1 of 15
No
No
1 of 15
No
No
 
1 of 15
No
No
1 of 15
No
No
1 of 15
No
No
1 of 15
No
No
1 of 15
No
No
1 of 15
No
No
1 of 15
No
No
Rattus norvegicus (Norway rat) (2)
5 of 13
Yes
Yes
3 of 13
No
Yes
Xenopus tropicalis (Western clawed frog) (7)
5 of 12
Yes
Yes
1 of 12
No
No
1 of 12
No
No
1 of 12
No
No
1 of 12
No
No
1 of 12
No
No
1 of 12
No
No
Danio rerio (Zebrafish) (16)
6 of 15
Yes
Yes
1 of 15
No
Yes
1 of 15
No
Yes
1 of 15
No
Yes
1 of 15
No
No
1 of 15
No
No
1 of 15
No
No
1 of 15
No
No
1 of 15
No
No
1 of 15
No
No
1 of 15
No
No
1 of 15
No
No
1 of 15
No
No
1 of 15
No
No
1 of 15
No
No
1 of 15
No
No
Caenorhabditis elegans (Nematode, roundworm) (6)
6 of 15
Yes
Yes
1 of 15
No
No
1 of 15
No
Yes
1 of 15
No
No
1 of 15
No
No
1 of 15
No
No
Arabidopsis thaliana (thale-cress) (0)
No records found.
Saccharomyces cerevisiae (Brewer's yeast) (3)
1 of 15
Yes
No
1 of 15
Yes
No
1 of 15
Yes
Yes
Schizosaccharomyces pombe (Fission yeast) (2)
1 of 12
Yes
No
1 of 12
Yes
No
Orthologs in Drosophila Species (via OrthoDB v9.1) ( EOG09190I5F )
Organism
Common Name
Gene
AAA Syntenic Ortholog
Multiple Dmel Genes in this Orthologous Group
Drosophila melanogaster
fruit fly
Drosophila suzukii
Spotted wing Drosophila
Drosophila simulans
Drosophila sechellia
Drosophila erecta
Drosophila yakuba
Drosophila ananassae
Drosophila pseudoobscura pseudoobscura
Drosophila persimilis
Drosophila willistoni
Drosophila virilis
Drosophila mojavensis
Drosophila grimshawi
Drosophila grimshawi
Orthologs in non-Drosophila Dipterans (via OrthoDB v9.1) ( EOG09150DLK )
Organism
Common Name
Gene
Multiple Dmel Genes in this Orthologous Group
Musca domestica
House fly
Glossina morsitans
Tsetse fly
Lucilia cuprina
Australian sheep blowfly
Orthologs in non-Dipteran Insects (via OrthoDB v9.1) ( EOG090W0A1F )
Organism
Common Name
Gene
Multiple Dmel Genes in this Orthologous Group
Bombyx mori
Silkmoth
Danaus plexippus
Monarch butterfly
Heliconius melpomene
Postman butterfly
Apis florea
Little honeybee
Apis mellifera
Western honey bee
Bombus impatiens
Common eastern bumble bee
Bombus terrestris
Buff-tailed bumblebee
Linepithema humile
Argentine ant
Megachile rotundata
Alfalfa leafcutting bee
Nasonia vitripennis
Parasitic wasp
Dendroctonus ponderosae
Mountain pine beetle
Pediculus humanus
Human body louse
Rhodnius prolixus
Kissing bug
Cimex lectularius
Bed bug
Acyrthosiphon pisum
Pea aphid
Zootermopsis nevadensis
Nevada dampwood termite
Orthologs in non-Insect Arthropods (via OrthoDB v9.1) ( EOG090X09X4 )
Organism
Common Name
Gene
Multiple Dmel Genes in this Orthologous Group
Strigamia maritima
European centipede
Strigamia maritima
European centipede
Ixodes scapularis
Black-legged tick
Tetranychus urticae
Two-spotted spider mite
Daphnia pulex
Water flea
Orthologs in non-Arthropod Metazoa (via OrthoDB v9.1) ( EOG091G0OQ6 )
Organism
Common Name
Gene
Multiple Dmel Genes in this Orthologous Group
Strongylocentrotus purpuratus
Purple sea urchin
Ciona intestinalis
Vase tunicate
Ciona intestinalis
Vase tunicate
Gallus gallus
Domestic chicken
Gallus gallus
Domestic chicken
Paralogs
Paralogs (via DIOPT v7.1)
Drosophila melanogaster (Fruit fly) (23)
2 of 10
2 of 10
1 of 10
1 of 10
1 of 10
1 of 10
1 of 10
1 of 10
1 of 10
1 of 10
1 of 10
1 of 10
1 of 10
1 of 10
1 of 10
1 of 10
1 of 10
1 of 10
1 of 10
1 of 10
1 of 10
1 of 10
1 of 10
Human Disease Associations
FlyBase Human Disease Model Reports
    Disease Model Summary Ribbon
    Disease Ontology (DO) Annotations
    Models Based on Experimental Evidence ( 0 )
    Allele
    Disease
    Evidence
    References
    Potential Models Based on Orthology ( 0 )
    Human Ortholog
    Disease
    Evidence
    References
    Modifiers Based on Experimental Evidence ( 0 )
    Allele
    Disease
    Interaction
    References
    Disease Associations of Human Orthologs (via DIOPT v7.1 and OMIM)
    Note that ortholog calls supported by only 1 or 2 algorithms (DIOPT score < 3) are not shown.
    Functional Complementation Data
    Functional complementation data is computed by FlyBase using a combination of the orthology data obtained from DIOPT and OrthoDB and the allele-level genetic interaction data curated from the literature.
    Interactions
    Summary of Physical Interactions
    esyN Network Diagram
    Show neighbor-neighbor interactions:
    Select Layout:
    Legend:
    Protein
    RNA
    Selected Interactor(s)
    Interactions Browser

    Please see the Physical Interaction reports below for full details
    protein-protein
    Physical Interaction
    Assay
    References
    Summary of Genetic Interactions
    esyN Network Diagram
    Starting gene(s)
    Interaction type
    Interacting gene(s)
    Reference
    Starting gene(s)
    Interaction type
    Interacting gene(s)
    Reference
    External Data
    Linkouts
    BioGRID - A database of protein and genetic interactions.
    DroID - A comprehensive database of gene and protein interactions.
    InterologFinder - Protein-protein interactions (PPI) from both known and predicted PPI data sets.
    MIST (protein-protein) - An integrated Molecular Interaction Database
    Pathways
    Signaling Pathways (FlyBase)
    Metabolic Pathways
    External Data
    Linkouts
    Genomic Location and Detailed Mapping Data
    Chromosome (arm)
    2L
    Recombination map

    2-1.5

    Cytogenetic map
    Sequence location
    2L:583,540..594,685 [-]
    FlyBase Computed Cytological Location
    Cytogenetic map
    Evidence for location
    21E2-21E2
    Limits computationally determined from genome sequence between P{lacW}l(2)k09610k09610&P{lacW}Sk09538a and P{PZ}l(2)1068510685&P{lacW}Tango14k00619
    Experimentally Determined Cytological Location
    Cytogenetic map
    Notes
    References
    21C7-21D1
    (determined by in situ hybridisation)
    21D1-21D2
    (determined by in situ hybridisation)
    21C5-21C6
    (determined by in situ hybridisation)
    Gsc and Pph13 map within 10kb of each other. The cytological location has been previously, and erroneously, reported as 60C (FBrf0065373).
    Experimentally Determined Recombination Data
    Location
    Left of (cM)
    Right of (cM)
    Notes
    Stocks and Reagents
    Stocks (13)
    Genomic Clones (18)
     

    Please Note FlyBase no longer curates genomic clone accessions so this list may not be complete

    cDNA Clones (3)
     

    Please Note This section lists cDNAs and ESTs that fall within the genomic extent of the gene model, which may include cDNAs and ESTs of genes within introns, or of overlapping genes. Please see GBrowse for alignment of the cDNAs and ESTs to the gene model.

    cDNA clones, fully sequences
    BDGP DGC clones
      Other clones
      Drosophila Genomics Resource Center cDNA clones

      For each fully sequenced cDNA the DGRC maintains various forms of the cDNA (e.g tagged or untagged) in several different host vectors for subsequent cloning and expression in Drosophila and Drosophila cell lines.

      cDNA Clones, End Sequenced (ESTs)
      BDGP DGC clones
        Other clones
          RNAi and Array Information
          Linkouts
          DRSC - Results frm RNAi screens
          GenomeRNAi - A database for cell-based and in vivo RNAi phenotypes and reagents
          Antibody Information
          Laboratory Generated Antibodies
           
          Commercially Available Antibodies
           
          Other Information
          Relationship to Other Genes
          Source for database identify of

          Source for identity of: Gsc CG2851

          Source for database merge of
          Additional comments

          Gsc can substitute for Xenopus goosecoid in a functional assay.

          Gsc can mimic Xenopus Goosecoid function by rescuing axis formation when injected into Xenopus embryos.

          Pph13 and Gsc are closely linked.

          Other Comments

          Gsc protein behaves as a transcriptional repressor, acting through specific palindromic homeodomain sites (P3K sites). Gsc protein is a "passive repressor" of activator homeoproteins binding to the same sites and an "active repressor" of activators binding to distinct sites. Gsc protein is also able to strongly repress transcription activated by Paired-class homeoproteins through P3K sites, via specific protein-protein interactions. This "interactive repression" requires the conserved GEH/eh-1 domain.

          The sequence used to clone Gsc as reported in FBrf0087230 was '60Mun2', not Pph13 (60Mun1).

          '60Mun1' as stated in FBrf0087230 should actually have read '60Mun2' (see FBrf0100819).

          Gsc expression is controlled by dpp, oc, slp1/slp2 and tll.

          Gsc is required for the birth, survival or invagination (detachment from the foregut anlage) of EG1 precursor cells.

          Mutant analysis suggests that Gsc protein regulates regional development of specific tissues.

          Both Gsc and Xenopus Goosecoid are expressed in cells undergoing morphogenetic movements. Gsc acts as an active repressor of transcription.

          Gsc gene product contains an otd-like homeodomain (oc class) and has an RNA expression pattern that overlaps with that of otd.

          Origin and Etymology
          Discoverer
          Etymology
          Identification
          External Crossreferences and Linkouts ( 48 )
          Sequence Crossreferences
          NCBI Gene - Gene integrates information from a wide range of species. A record may include nomenclature, Reference Sequences (RefSeqs), maps, pathways, variations, phenotypes, and links to genome-, phenotype-, and locus-specific resources worldwide.
          GenBank Nucleotide - A collection of sequences from several sources, including GenBank, RefSeq, TPA, and PDB.
          GenBank Protein - A collection of sequences from several sources, including translations from annotated coding regions in GenBank, RefSeq and TPA, as well as records from SwissProt, PIR, PRF, and PDB.
          RefSeq - A comprehensive, integrated, non-redundant, well-annotated set of reference sequences including genomic, transcript, and protein.
          UniProt/Swiss-Prot - Manually annotated and reviewed records of protein sequence and functional information
          UniProt/TrEMBL - Automatically annotated and unreviewed records of protein sequence and functional information
          Other crossreferences
          BDGP expression data - Patterns of gene expression in Drosophila embryogenesis
          Drosophila Genomics Resource Center - Drosophila Genomics Resource Center (DGRC) cDNA clones
          Eukaryotic Promoter Database - A collection of databases of experimentally validated promoters for selected model organisms.
          Flygut - An atlas of the Drosophila adult midgut
          GenomeRNAi - A database for cell-based and in vivo RNAi phenotypes and reagents
          InterPro - A database of protein families, domains and functional sites
          KEGG Genes - Molecular building blocks of life in the genomic space.
          modMine - A data warehouse for the modENCODE project
          Linkouts
          BioGRID - A database of protein and genetic interactions.
          DroID - A comprehensive database of gene and protein interactions.
          DRSC - Results frm RNAi screens
          FLIGHT - Cell culture data for RNAi and other high-throughput technologies
          FlyAtlas - Adult expression by tissue, using Affymetrix Dros2 array
          FlyCyc Genes - Genes from a BioCyc PGDB for Dmel
          FlyMine - An integrated database for Drosophila genomics
          Interactive Fly - A cyberspace guide to Drosophila development and metazoan evolution
          InterologFinder - Protein-protein interactions (PPI) from both known and predicted PPI data sets.
          MIST (protein-protein) - An integrated Molecular Interaction Database
          Synonyms and Secondary IDs (15)
          Datasets (0)
          Study focus (0)
          Experimental Role
          Project
          Project Type
          Title
          References (85)