FB2026_01 , released March 12, 2026
FB2026_01 , released March 12, 2026
Gene: Dmel\shep
Open Close
General Information
Symbol
Dmel\shep
Species
D. melanogaster
Name
alan shepard
Annotation Symbol
CG32423
Feature Type
FlyBase ID
FBgn0052423
Gene Model Status
Stock Availability
Gene Summary
alan shepard (shep) encodes an evolutionarily conserved RNA/DNA binding protein that regulates alternative splicing and gypsy insulator activities. It regulates neural development during the embryonic and larval stages, and neuronal remodeling during metamorphosis. [Date last reviewed: 2019-03-14] (FlyBase Gene Snapshot)
Also Known As

anon- EST:Posey83

Key Links
Genomic Location
Cytogenetic map
Sequence location
Recombination map
3-13
RefSeq locus
NT_037436 REGION:5155821..5277944
Sequence
Genomic Maps
Other Genome Views
The following external sites may use different assemblies or annotations than FlyBase.
Function
Gene Ontology (GO) Annotations (16 terms)
Molecular Function (6 terms)
Terms Based on Experimental Evidence (0 terms)
Terms Based on Predictions or Assertions (6 terms)
CV Term
Evidence
References
inferred from biological aspect of ancestor with PANTHER:PTN000610044
enables mRNA binding
inferred from sequence or structural similarity
inferred from electronic annotation with InterPro:IPR035979
inferred from biological aspect of ancestor with PANTHER:PTN000610044
inferred from biological aspect of ancestor with PANTHER:PTN000610044
enables RNA binding
inferred from electronic annotation with InterPro:IPR000504, InterPro:IPR002343
Biological Process (7 terms)
Terms Based on Experimental Evidence (7 terms)
CV Term
Evidence
References
inferred from mutant phenotype
inferred from mutant phenotype
involved_in gravitaxis
inferred from mutant phenotype
involved_in metamorphosis
inferred from mutant phenotype
involved_in neuron remodeling
inferred from mutant phenotype
inferred from high throughput mutant phenotype
inferred from mutant phenotype
Terms Based on Predictions or Assertions (0 terms)
Cellular Component (3 terms)
Terms Based on Experimental Evidence (1 term)
CV Term
Evidence
References
located_in cytosol
inferred from high throughput direct assay
Terms Based on Predictions or Assertions (3 terms)
CV Term
Evidence
References
is_active_in cytosol
inferred from biological aspect of ancestor with PANTHER:PTN000610044
is_active_in nucleus
inferred from biological aspect of ancestor with PANTHER:PTN000610044
inferred from electronic annotation with InterPro:IPR002343
inferred from biological aspect of ancestor with PANTHER:PTN002768817
Gene Group (FlyBase)
Protein Family (UniProt)
-
Summaries
Gene Snapshot
alan shepard (shep) encodes an evolutionarily conserved RNA/DNA binding protein that regulates alternative splicing and gypsy insulator activities. It regulates neural development during the embryonic and larval stages, and neuronal remodeling during metamorphosis. [Date last reviewed: 2019-03-14]
Protein Function (UniProtKB)
Has a role in the perception of gravity.
(UniProt, Q8MSV2)
Summary (Interactive Fly)

RNA-binding protein - regulates insulator activity - regulates neuronal remodeling during metamorphosis, regulates gravitaxis

Gene Model and Products
Number of Transcripts
8
Number of Unique Polypeptides
6

Please see the JBrowse view of Dmel\shep for information on other features

To submit a correction to a gene model please use the Contact FlyBase form

Protein Domains (via Pfam)
Isoform displayed:
Pfam protein domains
InterPro name
classification
start
end
Protein Domains (via SMART)
Isoform displayed:
SMART protein domains
InterPro name
classification
start
end
Structure
Protein 3D structure   (Predicted by AlphaFold)   (AlphaFold entry Q8MSV2)

If you don't see a structure in the viewer, refresh your browser.
Model Confidence:
  • Very high (pLDDT > 90)
  • Confident (90 > pLDDT > 70)
  • Low (70 > pLDDT > 50)
  • Very low (pLDDT < 50)

AlphaFold produces a per-residue confidence score (pLDDT) between 0 and 100. Some regions with low pLDDT may be unstructured in isolation.

Experimentally Determined Structures
Crossreferences
Comments on Gene Model

Stage-specific extension of 3' UTRs observed during embryogenesis (FBrf0215804); all variants may not be annotated.

Gene model reviewed during 5.40

Annotated transcripts do not represent all possible combinations of alternative exons and/or alternative promoters.

Low-frequency RNA-Seq exon junction(s) not annotated.

Gene model reviewed during 5.46

Tissue-specific extension of 3' UTRs observed during later stages (FBrf0218523, FBrf0219848); all variants may not be annotated

A non-AUG start codon may be used for translation of one or more transcripts of this gene; based on the presence of conserved protein signatures within the 5' UTR without an in-frame AUG (FBrf0243886).

Transcript Data
Annotated Transcripts
Name
FlyBase ID
RefSeq ID
Length (nt)
Assoc. CDS (aa)
FBtr0077172
3036
578
FBtr0077174
7989
499
FBtr0077173
4580
499
FBtr0301599
2776
379
FBtr0308859
5850
590
FBtr0333287
2541
379
FBtr0333288
7617
371
FBtr0333289
4661
377
Additional Transcript Data and Comments
Reported size (kB)
Comments
External Data
Crossreferences
Polypeptide Data
Annotated Polypeptides
Name
FlyBase ID
Predicted MW (kDa)
Length (aa)
Theoretical pI
UniProt
RefSeq ID
GenBank
FBpp0076875
60.7
578
9.32
FBpp0076877
53.0
499
8.47
FBpp0076876
53.0
499
8.47
FBpp0290814
41.3
379
7.90
FBpp0301014
62.2
590
9.20
FBpp0305482
41.3
379
7.90
FBpp0305483
40.3
371
7.90
FBpp0305484
41.0
377
7.90
Polypeptides with Identical Sequences

The group(s) of polypeptides indicated below share identical sequence to each other.

499 aa isoforms: shep-PB, shep-PD
379 aa isoforms: shep-PE, shep-PG
Additional Polypeptide Data and Comments
Reported size (kDa)
Comments
External Data
Linkouts
Sequences Consistent with the Gene Model
Nucleotide / Polypeptide Records
 
Mapped Features

Click to get a list of regulatory features (enhancers, TFBS, etc.) and gene disruptions (point mutations, indels, etc.) within or overlapping Dmel\shep using the Feature Mapper tool.

External Data
Crossreferences
Eukaryotic Promoter Database - A collection of databases of experimentally validated promoters for selected model organisms.
Linkouts
Expression Data
Testis-specificity index

The testis specificity index was calculated from modENCODE tissue expression data by Vedelek et al., 2018 to indicate the degree of testis enrichment compared to other tissues. Scores range from -2.52 (underrepresented) to 5.2 (very high testis bias).

-0.93

Transcript Expression
in situ
Stage
Tissue/Position (including subcellular localization)
Reference
organism

Comment: maternally deposited

organism

Comment: extended 3' UTR isoform

Additional Descriptive Data

Zygotic-specific isoforms of shep with long 3' UTR extensions were observed.

Expression pattern inferred from unspecified enhancer trap line.

Marker for
 
Subcellular Localization
CV Term
Polypeptide Expression
immunolocalization
Stage
Tissue/Position (including subcellular localization)
Reference
mass spectroscopy
Stage
Tissue/Position (including subcellular localization)
Reference
Additional Descriptive Data

High levels of shep protein are observed in neurons marked by elav and low levels are seen in glial cells marked by repo in third instar larvae. It is absent in neuroblasts marked by dpn.

shep protein is enriched in the embryonic CNS including the brain and ventral nerve cord, areas that are also positive for the neuron-specific protein elav. The overlap between shep and elav is partial in that shep is also expressed in glial cells. shep levels are low but detectable in non-CNS tissues

Marker for
 
Subcellular Localization
CV Term
Evidence
References
located_in cytosol
inferred from high throughput direct assay
Expression Deduced from Reporters
Reporter: P{GT1}shepBG01322
Stage
Tissue/Position (including subcellular localization)
Reference
Stage
Tissue/Position (including subcellular localization)
Reference
High-Throughput Expression Data
Associated Tools

JBrowse - Visual display of RNA-Seq signals

View Dmel\shep in JBrowse
RNA-Seq by Region - Search RNA-Seq expression levels by exon or genomic region
Reference
See Gelbart and Emmert, 2013 for analysis details and data files for all genes.
Developmental Proteome: Life Cycle
Developmental Proteome: Embryogenesis
External Data and Images
Linkouts
BDGP expression data - Patterns of gene expression in Drosophila embryogenesis
DRscDB - A single-cell RNA-seq resource for data mining and data comparison across species
EMBL-EBI Single Cell Expression Atlas - Single cell expression across species
FlyAtlas - Adult expression by tissue, using Affymetrix Dros2 array
FlyAtlas2 - A Drosophila melanogaster expression atlas with RNA-Seq, miRNA-Seq and sex-specific data
Fly-FISH - A database of Drosophila embryo and larvae mRNA localization patterns
Flygut - An atlas of the Drosophila adult midgut
Images
FlyExpress - Embryonic expression images (BDGP data)
  • Stages(s) 1-3
  • Stages(s) 4-6
  • Stages(s) 7-8
  • Stages(s) 9-10
  • Stages(s) 11-12
  • Stages(s) 13-16
Alleles, Insertions, Transgenic Constructs, and Aberrations
Classical and Insertion Alleles ( 53 )
For All Classical and Insertion Alleles Show
 
Other relevant insertions
Transgenic Constructs ( 12 )
For All Alleles Carried on Transgenic Constructs Show
Transgenic constructs containing/affecting coding region of shep
Transgenic constructs containing regulatory region of shep
Aberrations (Deficiencies and Duplications) ( 0 )
Inferred from experimentation ( 0 )
Inferred from location ( 8 )
Variants
Variant Molecular Consequences
Alleles Representing Disease-Implicated Variants
Phenotypes
For more details about a specific phenotype click on the relevant allele symbol.
Lethality
Allele
Sterility
Allele
Other Phenotypes
Allele
Phenotype manifest in
Allele
Orthologs
Human Orthologs (via DIOPT v9.1)
Species\Gene Symbol
Score
Best Score
Best Reverse Score
Alignment
Complementation?
Transgene?
Homo sapiens (Human) (34)
10 of 14
Yes
Yes
1  
9 of 14
No
Yes
9 of 14
No
Yes
2 of 14
No
No
3  
2 of 14
No
No
3  
2 of 14
No
No
2  
1 of 14
No
No
3  
1 of 14
No
No
1 of 14
No
No
1 of 14
No
No
1 of 14
No
No
1  
1 of 14
No
No
1 of 14
No
No
1 of 14
No
No
2  
1 of 14
No
No
1 of 14
No
No
1  
1 of 14
No
No
1 of 14
No
No
1 of 14
No
No
2  
1 of 14
No
No
1 of 14
No
No
1 of 14
No
No
1 of 14
No
No
1  
1 of 14
No
No
1  
1 of 14
No
No
1 of 14
No
No
1 of 14
No
No
1 of 14
No
No
1 of 14
No
No
1  
1 of 14
No
No
1 of 14
No
No
1 of 14
No
No
1  
1 of 14
No
No
1 of 14
No
No
1  
Model Organism Orthologs (via DIOPT v9.1)
Species\Gene Symbol
Score
Best Score
Best Reverse Score
Alignment
Complementation?
Transgene?
Rattus norvegicus (Norway rat) (15)
10 of 14
Yes
Yes
9 of 14
No
Yes
9 of 14
No
Yes
2 of 14
No
No
2 of 14
No
No
2 of 14
No
No
1 of 14
No
No
1 of 14
No
No
1 of 14
No
No
1 of 14
No
No
1 of 14
No
No
1 of 14
No
No
1 of 14
No
No
1 of 14
No
No
1 of 14
No
No
Mus musculus (laboratory mouse) (18)
10 of 14
Yes
Yes
9 of 14
No
Yes
9 of 14
No
Yes
2 of 14
No
No
2 of 14
No
No
1 of 14
No
No
1 of 14
No
No
1 of 14
No
No
1 of 14
No
No
1 of 14
No
No
1 of 14
No
No
1 of 14
No
No
1 of 14
No
No
1 of 14
No
No
1 of 14
No
No
1 of 14
No
No
1 of 14
No
No
1 of 14
No
No
Xenopus tropicalis (Western clawed frog) (21)
6 of 13
Yes
Yes
5 of 13
No
Yes
5 of 13
No
Yes
2 of 13
No
No
2 of 13
No
No
2 of 13
No
No
1 of 13
No
No
1 of 13
No
No
1 of 13
No
No
1 of 13
No
No
1 of 13
No
No
1 of 13
No
No
1 of 13
No
No
1 of 13
No
No
1 of 13
No
No
1 of 13
No
No
1 of 13
No
No
1 of 13
No
No
1 of 13
No
No
1 of 13
No
No
1 of 13
No
No
Danio rerio (Zebrafish) (20)
10 of 14
Yes
Yes
10 of 14
Yes
Yes
9 of 14
No
Yes
8 of 14
No
Yes
6 of 14
No
Yes
2 of 14
No
No
2 of 14
No
No
2 of 14
No
No
2 of 14
No
No
1 of 14
No
No
1 of 14
No
No
1 of 14
No
No
1 of 14
No
No
1 of 14
No
No
1 of 14
No
No
1 of 14
No
No
1 of 14
No
No
1 of 14
No
No
1 of 14
No
No
1 of 14
No
No
Caenorhabditis elegans (Nematode, roundworm) (14)
7 of 14
Yes
Yes
2 of 14
No
No
1 of 14
No
No
1 of 14
No
No
1 of 14
No
No
1 of 14
No
No
1 of 14
No
No
1 of 14
No
No
1 of 14
No
No
1 of 14
No
No
1 of 14
No
No
1 of 14
No
No
1 of 14
No
No
1 of 14
No
No
Anopheles gambiae (African malaria mosquito) (16)
10 of 12
Yes
Yes
Arabidopsis thaliana (thale-cress) (49)
2 of 13
Yes
No
2 of 13
Yes
No
2 of 13
Yes
No
2 of 13
Yes
No
2 of 13
Yes
No
2 of 13
Yes
No
1 of 13
No
No
1 of 13
No
No
1 of 13
No
No
1 of 13
No
No
1 of 13
No
No
1 of 13
No
No
1 of 13
No
No
1 of 13
No
No
1 of 13
No
No
1 of 13
No
No
1 of 13
No
No
1 of 13
No
No
1 of 13
No
No
1 of 13
No
No
1 of 13
No
Yes
1 of 13
No
No
1 of 13
No
No
1 of 13
No
No
1 of 13
No
No
1 of 13
No
No
1 of 13
No
No
1 of 13
No
No
1 of 13
No
No
1 of 13
No
No
1 of 13
No
No
1 of 13
No
No
1 of 13
No
No
1 of 13
No
No
1 of 13
No
No
1 of 13
No
No
1 of 13
No
No
1 of 13
No
No
1 of 13
No
No
1 of 13
No
No
1 of 13
No
No
1 of 13
No
No
1 of 13
No
No
1 of 13
No
No
1 of 13
No
No
1 of 13
No
No
1 of 13
No
No
1 of 13
No
No
1 of 13
No
No
Saccharomyces cerevisiae (Brewer's yeast) (6)
1 of 13
Yes
No
1 of 13
Yes
No
1 of 13
Yes
No
1 of 13
Yes
Yes
1 of 13
Yes
No
1 of 13
Yes
No
Schizosaccharomyces pombe (Fission yeast) (4)
7 of 12
Yes
Yes
1 of 12
No
No
1 of 12
No
No
1 of 12
No
No
Escherichia coli (enterobacterium) (0)
Other Organism Orthologs (via OrthoDB)
Data provided directly from OrthoDB:shep. Refer to their site for version information.
Paralogs
Paralogs (via DIOPT v9.1)
Drosophila melanogaster (Fruit fly) (29)
3 of 13
3 of 13
3 of 13
3 of 13
3 of 13
2 of 13
2 of 13
2 of 13
2 of 13
2 of 13
2 of 13
2 of 13
2 of 13
1 of 13
1 of 13
1 of 13
1 of 13
1 of 13
1 of 13
1 of 13
1 of 13
1 of 13
1 of 13
1 of 13
1 of 13
1 of 13
1 of 13
1 of 13
1 of 13
Human Disease Associations
FlyBase Human Disease Model Reports
    Disease Ontology (DO) Annotations
    Models Based on Experimental Evidence ( 0 )
    Allele
    Disease
    Evidence
    References
    Potential Models Based on Orthology ( 0 )
    Human Ortholog
    Disease
    Evidence
    References
    Modifiers Based on Experimental Evidence ( 1 )
    Disease Associations of Human Orthologs (via DIOPT v9.1 and OMIM)
    Note that ortholog calls supported by only 1 or 2 algorithms (DIOPT score < 3) are not shown.
    Functional Complementation Data
    Functional complementation data is computed by FlyBase using a combination of the orthology data obtained from DIOPT and OrthoDB and the allele-level genetic interaction data curated from the literature.
    Interactions
    Summary of Physical Interactions
    Interaction Browsers

    Please see the Physical Interaction reports below for full details
    RNA-protein
    Physical Interaction
    Assay
    References
    protein-protein
    Physical Interaction
    Assay
    References
    Summary of Genetic Interactions
    Interaction Browsers

    Please look at the allele data for full details of the genetic interactions
    Starting gene(s)
    Interaction type
    Interacting gene(s)
    Reference
    Starting gene(s)
    Interaction type
    Interacting gene(s)
    Reference
    External Data
    Linkouts
    BioGRID - A database of protein and genetic interactions.
    DroID - A comprehensive database of gene and protein interactions.
    MIST (protein-protein) - An integrated Molecular Interaction Database
    Pathways
    Signaling Pathways (FlyBase)
    Metabolic Pathways
    FlyBase
    External Links
    External Data
    Linkouts
    Class of Gene
    Genomic Location and Detailed Mapping Data
    Chromosome (arm)
    3L
    Recombination map
    3-13
    Cytogenetic map
    Sequence location
    FlyBase Computed Cytological Location
    Cytogenetic map
    Evidence for location
    64C7-64C11
    ; Limits computationally determined from genome sequence between P{PZ}l(3)rG166rG166 and P{PZ}sinu06524
    Experimentally Determined Cytological Location
    Cytogenetic map
    Notes
    References
    Experimentally Determined Recombination Data
    Location
    Left of (cM)
    Right of (cM)
    Notes
    Stocks and Reagents
    Stocks (110)
    Genomic Clones (94)
    cDNA Clones (138)
     

    Please Note This section lists cDNAs and ESTs that fall within the genomic extent of the gene model, which may include cDNAs and ESTs of genes within introns, or of overlapping genes. Please see JBrowse for alignment of the cDNAs and ESTs to the gene model.

    cDNA clones, fully sequenced
    BDGP DGC clones
    Drosophila Genomics Resource Center cDNA clones

    For each fully sequenced cDNA the DGRC maintains various forms of the cDNA (e.g tagged or untagged) in several different host vectors for subsequent cloning and expression in Drosophila and Drosophila cell lines.

    cDNA Clones, End Sequenced (ESTs)
    RNAi and Array Information
    Linkouts
    DRSC - Results frm RNAi screens
    Antibody Information
    Laboratory Generated Antibodies
    Commercially Available Antibodies
     
    Cell Line Information
    Publicly Available Cell Lines
     
      Other Stable Cell Lines
       
        Other Comments

        Identified as a candidate gene for hypoxia-specific selection (via an experimental evolution paradigm) that is also differentially expressed between control and hypoxia-adapted larvae.

        Nonsense-mediated mRNA decay (NMD) down-regulates a distinct splice isoform(s) of this gene.

        Area matching Drosophila EST AA949873.

        Relationship to Other Genes
        Source for database merge of

        Source for merge of: CG32423 anon- EST:Posey83

        Source for merge of: CG32423 BcDNA:RH63980

        Source for merge of: CG10668 CG10649 CG10647

        Source for merge of: CG32423 anon-WO0118547.198 anon-WO0172774.41

        Source for merge of: CG32423 BcDNA:LD40028

        Additional comments

        Annotations CG10668, CG10649 and CG10647 merged as CG32423 in release 3 of the genome annotation.

        Source for merge of CG32423 BcDNA:RH63980 was a shared cDNA ( date:030728 ).

        Source for merge of CG32423 anon-WO0118547.198 anon-WO0172774.41 was sequence comparison ( date:051113 ).

        Source for merge of CG32423 BcDNA:LD40028 was a shared cDNA ( date:030728 ).

        Nomenclature History
        Source for database identify of
        Nomenclature comments
        Etymology

        The full name of this gene, 'alan shepard' commemorates one astronaut. The symbol of this gene, 'shep' commemorates another, William 'Shep' Shepherd.

        The 'alan' gene is named after Alan Shepard, the first American in space.

        Synonyms and Secondary IDs (16)
        Reported As
        Symbol Synonym
        BcDNA:LD40028
        BcDNA:RH63980
        CG10647
        CG10649
        alan-shepard
        anon-EST:Posey83
        anon-WO0118547.198
        anon-WO0172774.41
        cg10668
        Secondary FlyBase IDs
        • FBgn0027668
        • FBgn0035595
        • FBgn0035597
        • FBgn0035599
        • FBgn0046389
        • FBgn0062176
        • FBgn0062425
        • FBgn0063142
        Datasets (0)
        Study focus (0)
        Experimental Role
        Project
        Project Type
        Title
        Study result (0)
        Result
        Result Type
        Title
        External Crossreferences and Linkouts ( 106 )
        Sequence Crossreferences
        NCBI Gene - Gene integrates information from a wide range of species. A record may include nomenclature, Reference Sequences (RefSeqs), maps, pathways, variations, phenotypes, and links to genome-, phenotype-, and locus-specific resources worldwide.
        GenBank Protein - A collection of sequences from several sources, including translations from annotated coding regions in GenBank, RefSeq and TPA, as well as records from SwissProt, PIR, PRF, and PDB.
        RefSeq - A comprehensive, integrated, non-redundant, well-annotated set of reference sequences including genomic, transcript, and protein.
        UniProt/GCRP - The gene-centric reference proteome (GCRP) provides a 1:1 mapping between genes and UniProt accessions in which a single 'canonical' isoform represents the product(s) of each protein-coding gene.
        UniProt/Swiss-Prot - Manually annotated and reviewed records of protein sequence and functional information
        Other crossreferences
        AlphaFold DB - AlphaFold provides open access to protein structure predictions for the human proteome and other key proteins of interest, to accelerate scientific research.
        BDGP expression data - Patterns of gene expression in Drosophila embryogenesis
        DRscDB - A single-cell RNA-seq resource for data mining and data comparison across species
        EMBL-EBI Single Cell Expression Atlas - Single cell expression across species
        FlyAtlas2 - A Drosophila melanogaster expression atlas with RNA-Seq, miRNA-Seq and sex-specific data
        FlyMine - An integrated database for Drosophila genomics
        KEGG Genes - Molecular building blocks of life in the genomic space.
        MARRVEL_MODEL - MARRVEL (model organism gene)
        Linkouts
        BioGRID - A database of protein and genetic interactions.
        Drosophila Genomics Resource Center - Drosophila Genomics Resource Center (DGRC) cDNA clones
        DroID - A comprehensive database of gene and protein interactions.
        DRSC - Results frm RNAi screens
        Eukaryotic Promoter Database - A collection of databases of experimentally validated promoters for selected model organisms.
        FlyAtlas - Adult expression by tissue, using Affymetrix Dros2 array
        FlyCyc Genes - Genes from a BioCyc PGDB for Dmel
        Fly-FISH - A database of Drosophila embryo and larvae mRNA localization patterns
        Flygut - An atlas of the Drosophila adult midgut
        iBeetle-Base - RNAi phenotypes in the red flour beetle (Tribolium castaneum)
        Interactive Fly - A cyberspace guide to Drosophila development and metazoan evolution
        MIST (protein-protein) - An integrated Molecular Interaction Database
        References (135)