Open Close
General Information
Symbol
Dmel\CG10104
Species
D. melanogaster
Name
Annotation Symbol
CG10104
Feature Type
FlyBase ID
FBgn0033933
Gene Model Status
Stock Availability
Enzyme Name (EC)
Pepsin A (3.4.23.1)
Renin (3.4.23.15)
Penicillopepsin (3.4.23.20)
Saccharopepsin (3.4.23.25)
Scytalidopepsin B (3.4.23.32)
Cathepsin E (3.4.23.34)
Barrierpepsin (3.4.23.35)
Signal peptidase II (3.4.23.36)
Chymosin (3.4.23.4)
Prepilin peptidase (3.4.23.43)
Cathepsin D (3.4.23.5)
Gene Snapshot
Key Links
Genomic Location
Cytogenetic map
Sequence location
2R:14,390,696..14,392,209 [+]
Recombination map

2-71

RefSeq locus
NT_033778 REGION:14390696..14392209
Sequence
Other Genome Views
The following external sites may use different assemblies or annotations than FlyBase.
Function
GO Summary Ribbons
Gene Ontology (GO) Annotations (4 terms)
Molecular Function (1 term)
Terms Based on Experimental Evidence (0 terms)
Terms Based on Predictions or Assertions (1 term)
CV Term
Evidence
References
inferred from electronic annotation with InterPro:IPR001461, InterPro:IPR001969
(assigned by InterPro )
Biological Process (2 terms)
Terms Based on Experimental Evidence (0 terms)
Terms Based on Predictions or Assertions (2 terms)
CV Term
Evidence
References
inferred from biological aspect of ancestor with PANTHER:PTN002671475
(assigned by GO_Central )
inferred from electronic annotation with InterPro:IPR001461, InterPro:IPR001969
(assigned by InterPro )
Cellular Component (1 term)
Terms Based on Experimental Evidence (0 terms)
Terms Based on Predictions or Assertions (1 term)
CV Term
Evidence
References
inferred from biological aspect of ancestor with PANTHER:PTN002671475
(assigned by GO_Central )
Gene Group (FlyBase)
Protein Family (UniProt)
-
Catalytic Activity (EC)
Experimental Evidence
-
Predictions / Assertions
Preferential cleavage: hydrophobic, preferably aromatic, residues in P1 and P1' positions (3.4.23.1)
Cleaves 1-Phe-|-Val-2, 4-Gln-|-His-5, 13-Glu-|-Ala-14, 14-Ala-|-Leu-15, 15-Leu-|-Tyr-16, 16-Tyr-|-Leu-17, 23-Gly-|-Phe-24, 24-Phe-|-Phe-25 and 25-Phe-|-Tyr-26 bonds in the B chain of insulin (3.4.23.1)
Cleavage of Leu-|-Xaa bond in angiotensinogen to generate angiotensin I (3.4.23.15)
Hydrolysis of proteins with broad specificity similar to that of pepsin A, preferring hydrophobic residues at P1 and P1', but also cleaving 20-Gly-|-Glu-21 in the B chain of insulin (3.4.23.20)
Clots milk, and activates trypsinogen (3.4.23.20)
Hydrolysis of proteins with broad specificity for peptide bonds (3.4.23.25)
Cleaves -Leu-Leu-|-Val-Tyr- bond in a synthetic substrate (3.4.23.25)
Does not act on esters of Tyr or Arg (3.4.23.25)
Hydrolysis of proteins with broad specificity, cleaving 24-Phe-|-Phe-25, but not 15-Leu-|-Tyr-16 and 25-Phe-|-Tyr-26 in the B chain of insulin (3.4.23.32)
Similar to cathepsin D, but slightly broader specificity (3.4.23.34)
Selective cleavage of 6-Leu-|-Lys-7 bond in the pheromone alpha-mating factor (3.4.23.35)
Release of signal peptides from bacterial membrane prolipoproteins (3.4.23.36)
Hydrolyzes -Xaa-Yaa-Zaa-|-(S,diacylglyceryl)Cys-, in which Xaa is hydrophobic (preferably Leu), and Yaa (Ala or Ser) and Zaa (Gly or Ala) have small, neutral side chains (3.4.23.36)
Broad specificity similar to that of pepsin A (3.4.23.4)
Clots milk by cleavage of a single 104-Ser-Phe-|-Met-Ala-107 bond in kappa-chain of casein (3.4.23.4)
Typically cleaves a -Gly-|-Phe- bond to release an N-terminal, basic peptide of 5-8 residues from type IV prepilin, and then N-methylates the new N-terminal amino group, the methyl donor being S-adenosyl-L- methionine (3.4.23.43)
Specificity similar to, but narrower than, that of pepsin A (3.4.23.5)
Does not cleave the 4-Gln-|-His-5 bond in B chain of insulin (3.4.23.5)
Summaries
Gene Group (FlyBase)
A1 ASPARTIC ENDOPEPTIDASES -
A1 aspartic endopeptidases belong to MEROPS family A1, most of which are most active at acidic pH.. They catalyse the hydrolysis of internal, alpha-peptide bonds in a polypeptide chain by a mechanism in which a water molecule bound by the side chains of aspartic residues at the active center acts as a nucleophile.
Gene Model and Products
Number of Transcripts
1
Number of Unique Polypeptides
1

Please see the JBrowse view of Dmel\CG10104 for information on other features

To submit a correction to a gene model please use the Contact FlyBase form

Protein Domains (via Pfam)
Isoform displayed:
Pfam protein domains
InterPro name
classification
start
end
Protein Domains (via SMART)
Isoform displayed:
SMART protein domains
InterPro name
classification
start
end
Comments on Gene Model

Gene model reviewed during 5.50

Sequence Ontology: Class of Gene
Transcript Data
Annotated Transcripts
Name
FlyBase ID
RefSeq ID
Length (nt)
Assoc. CDS (aa)
FBtr0087486
1514
404
Additional Transcript Data and Comments
Reported size (kB)
Comments
External Data
Crossreferences
Polypeptide Data
Annotated Polypeptides
Name
FlyBase ID
Predicted MW (kDa)
Length (aa)
Theoretical pI
RefSeq ID
GenBank
FBpp0086615
45.7
404
8.77
Polypeptides with Identical Sequences

There is only one protein coding transcript and one polypeptide associated with this gene

Additional Polypeptide Data and Comments
Reported size (kDa)
Comments
External Data
Linkouts
Sequences Consistent with the Gene Model
Mapped Features

Click to get a list of regulatory features (enhancers, TFBS, etc.) and gene disruptions (point mutations, indels, etc.) within or overlapping Dmel\CG10104 using the Feature Mapper tool.

External Data
Crossreferences
Eukaryotic Promoter Database - A collection of databases of experimentally validated promoters for selected model organisms.
Linkouts
Expression Data
Expression Summary Ribbons
Colored tiles in ribbon indicate that expression data has been curated by FlyBase for that anatomical location. Colorless tiles indicate that there is no curated data for that location.
For complete stage-specific expression data, view the modENCODE Development RNA-Seq section under High-Throughput Expression below.
Transcript Expression
Additional Descriptive Data
Marker for
 
Subcellular Localization
CV Term
Polypeptide Expression
Additional Descriptive Data
Marker for
 
Subcellular Localization
CV Term
Evidence
References
Expression Deduced from Reporters
High-Throughput Expression Data
Associated Tools

GBrowse - Visual display of RNA-Seq signals

View Dmel\CG10104 in GBrowse 2
RNA-Seq by Region - Search RNA-Seq expression levels by exon or genomic region
Reference
See Gelbart and Emmert, 2013 for analysis details and data files for all genes.
Developmental Proteome: Life Cycle
Developmental Proteome: Embryogenesis
External Data and Images
Linkouts
BDGP expression data - Patterns of gene expression in Drosophila embryogenesis
FLIGHT - Cell culture data for RNAi and other high-throughput technologies
FlyAtlas - Adult expression by tissue, using Affymetrix Dros2 array
Fly-FISH - A database of Drosophila embryo and larvae mRNA localization patterns
Flygut - An atlas of the Drosophila adult midgut
Images
Alleles, Insertions, and Transgenic Constructs
Classical and Insertion Alleles ( 0 )
For All Classical and Insertion Alleles Show
 
Other relevant insertions
Transgenic Constructs ( 6 )
For All Alleles Carried on Transgenic Constructs Show
Transgenic constructs containing/affecting coding region of CG10104
Transgenic constructs containing regulatory region of CG10104
Deletions and Duplications ( 0 )
Phenotypes
For more details about a specific phenotype click on the relevant allele symbol.
Lethality
Allele
Phenotype manifest in
Allele
Orthologs
Human Orthologs (via DIOPT v8.0)
Homo sapiens (Human) (12)
Species\Gene Symbol
Score
Best Score
Best Reverse Score
Alignment
Complementation?
Transgene?
6 of 15
Yes
No
1  
6 of 15
Yes
No
6 of 15
Yes
No
3 of 15
No
No
3 of 15
No
No
3 of 15
No
No
3 of 15
No
No
2 of 15
No
No
1 of 15
No
No
6  
1 of 15
No
No
1 of 15
No
No
1 of 15
No
No
Model Organism Orthologs (via DIOPT v8.0)
Mus musculus (laboratory mouse) (10)
Species\Gene Symbol
Score
Best Score
Best Reverse Score
Alignment
Complementation?
Transgene?
7 of 15
Yes
No
6 of 15
No
No
6 of 15
No
Yes
3 of 15
No
No
3 of 15
No
No
3 of 15
No
No
3 of 15
No
No
3 of 15
No
Yes
1 of 15
No
No
1 of 15
No
No
Rattus norvegicus (Norway rat) (11)
6 of 13
Yes
No
6 of 13
Yes
No
5 of 13
No
No
3 of 13
No
No
3 of 13
No
No
2 of 13
No
No
2 of 13
No
No
2 of 13
No
No
1 of 13
No
No
1 of 13
No
No
1 of 13
No
No
Xenopus tropicalis (Western clawed frog) (19)
6 of 12
Yes
Yes
5 of 12
No
No
5 of 12
No
No
3 of 12
No
No
3 of 12
No
No
3 of 12
No
No
3 of 12
No
No
3 of 12
No
No
2 of 12
No
No
2 of 12
No
No
2 of 12
No
Yes
2 of 12
No
Yes
2 of 12
No
Yes
2 of 12
No
Yes
2 of 12
No
No
2 of 12
No
No
1 of 12
No
No
1 of 12
No
No
1 of 12
No
Yes
Danio rerio (Zebrafish) (5)
6 of 15
Yes
No
6 of 15
Yes
No
5 of 15
No
No
5 of 15
No
Yes
1 of 15
No
No
Caenorhabditis elegans (Nematode, roundworm) (20)
8 of 15
Yes
No
6 of 15
No
No
5 of 15
No
Yes
4 of 15
No
No
4 of 15
No
No
4 of 15
No
No
4 of 15
No
No
4 of 15
No
No
3 of 15
No
No
3 of 15
No
Yes
3 of 15
No
Yes
3 of 15
No
No
3 of 15
No
No
2 of 15
No
No
2 of 15
No
No
2 of 15
No
No
2 of 15
No
Yes
1 of 15
No
Yes
1 of 15
No
No
1 of 15
No
No
Arabidopsis thaliana (thale-cress) (5)
6 of 9
Yes
No
6 of 9
Yes
No
6 of 9
Yes
No
3 of 9
No
Yes
3 of 9
No
Yes
Saccharomyces cerevisiae (Brewer's yeast) (9)
7 of 15
Yes
No
4 of 15
No
Yes
3 of 15
No
No
3 of 15
No
No
3 of 15
No
No
3 of 15
No
Yes
2 of 15
No
Yes
2 of 15
No
Yes
1 of 15
No
No
Schizosaccharomyces pombe (Fission yeast) (2)
4 of 12
Yes
No
2 of 12
No
No
Ortholog(s) in Drosophila Species (via OrthoDB v9.1) ( EOG09190B1F )
Organism
Common Name
Gene
AAA Syntenic Ortholog
Multiple Dmel Genes in this Orthologous Group
Drosophila suzukii
Spotted wing Drosophila
Drosophila simulans
Drosophila sechellia
Drosophila erecta
Drosophila yakuba
Drosophila ananassae
Drosophila pseudoobscura pseudoobscura
Drosophila persimilis
Drosophila willistoni
Drosophila virilis
Drosophila mojavensis
Drosophila grimshawi
Orthologs in non-Drosophila Dipterans (via OrthoDB v9.1) ( None identified )
No non-Drosophilid orthologies identified
Orthologs in non-Dipteran Insects (via OrthoDB v9.1) ( EOG090W07EL )
Organism
Common Name
Gene
Multiple Dmel Genes in this Orthologous Group
Bombyx mori
Silkmoth
Danaus plexippus
Monarch butterfly
Heliconius melpomene
Postman butterfly
Apis florea
Little honeybee
Apis mellifera
Western honey bee
Bombus impatiens
Common eastern bumble bee
Bombus terrestris
Buff-tailed bumblebee
Linepithema humile
Argentine ant
Megachile rotundata
Alfalfa leafcutting bee
Nasonia vitripennis
Parasitic wasp
Dendroctonus ponderosae
Mountain pine beetle
Dendroctonus ponderosae
Mountain pine beetle
Tribolium castaneum
Red flour beetle
Pediculus humanus
Human body louse
Rhodnius prolixus
Kissing bug
Rhodnius prolixus
Kissing bug
Rhodnius prolixus
Kissing bug
Cimex lectularius
Bed bug
Cimex lectularius
Bed bug
Acyrthosiphon pisum
Pea aphid
Zootermopsis nevadensis
Nevada dampwood termite
Orthologs in non-Insect Arthropods (via OrthoDB v9.1) ( EOG090X0850 )
Organism
Common Name
Gene
Multiple Dmel Genes in this Orthologous Group
Strigamia maritima
European centipede
Ixodes scapularis
Black-legged tick
Ixodes scapularis
Black-legged tick
Ixodes scapularis
Black-legged tick
Stegodyphus mimosarum
African social velvet spider
Stegodyphus mimosarum
African social velvet spider
Tetranychus urticae
Two-spotted spider mite
Daphnia pulex
Water flea
Daphnia pulex
Water flea
Daphnia pulex
Water flea
Orthologs in non-Arthropod Metazoa (via OrthoDB v9.1) ( EOG091G0JP7 )
Organism
Common Name
Gene
Multiple Dmel Genes in this Orthologous Group
Strongylocentrotus purpuratus
Purple sea urchin
Strongylocentrotus purpuratus
Purple sea urchin
Ciona intestinalis
Vase tunicate
Ciona intestinalis
Vase tunicate
Gallus gallus
Domestic chicken
Gallus gallus
Domestic chicken
Gallus gallus
Domestic chicken
Gallus gallus
Domestic chicken
Gallus gallus
Domestic chicken
Gallus gallus
Domestic chicken
Gallus gallus
Domestic chicken
Gallus gallus
Domestic chicken
Paralogs
Paralogs (via DIOPT v8.0)
Drosophila melanogaster (Fruit fly) (12)
6 of 10
6 of 10
6 of 10
6 of 10
6 of 10
6 of 10
6 of 10
6 of 10
6 of 10
6 of 10
6 of 10
5 of 10
Human Disease Associations
FlyBase Human Disease Model Reports
    Disease Model Summary Ribbon
    Disease Ontology (DO) Annotations
    Models Based on Experimental Evidence ( 0 )
    Allele
    Disease
    Evidence
    References
    Potential Models Based on Orthology ( 2 )
    Modifiers Based on Experimental Evidence ( 0 )
    Allele
    Disease
    Interaction
    References
    Disease Associations of Human Orthologs (via DIOPT v8.0 and OMIM)
    Note that ortholog calls supported by only 1 or 2 algorithms (DIOPT score < 3) are not shown.
    Functional Complementation Data
    Functional complementation data is computed by FlyBase using a combination of the orthology data obtained from DIOPT and OrthoDB and the allele-level genetic interaction data curated from the literature.
    Interactions
    Summary of Physical Interactions
    esyN Network Diagram
    Interactions Browser
    Summary of Genetic Interactions
    esyN Network Diagram
    Starting gene(s)
    Interaction type
    Interacting gene(s)
    Reference
    Starting gene(s)
    Interaction type
    Interacting gene(s)
    Reference
    External Data
    Linkouts
    BioGRID - A database of protein and genetic interactions.
    DroID - A comprehensive database of gene and protein interactions.
    InterologFinder - Protein-protein interactions (PPI) from both known and predicted PPI data sets.
    Pathways
    Signaling Pathways (FlyBase)
    Metabolic Pathways
    External Data
    Linkouts
    KEGG Pathways - Wiring diagrams of molecular interactions, reactions and relations.
    Reactome - An open-source, open access, manually curated and peer-reviewed pathway database.
    Genomic Location and Detailed Mapping Data
    Chromosome (arm)
    2R
    Recombination map

    2-71

    Cytogenetic map
    Sequence location
    2R:14,390,696..14,392,209 [+]
    FlyBase Computed Cytological Location
    Cytogenetic map
    Evidence for location
    51A1-51A1
    Limits computationally determined from genome sequence between P{PZ}Tfb106949&P{PZ}l(2)0356303563 and P{lacW}l(2)k16805k16805
    Experimentally Determined Cytological Location
    Cytogenetic map
    Notes
    References
    Experimentally Determined Recombination Data
    Location
    Left of (cM)
    Right of (cM)
    Notes
    Stocks and Reagents
    Stocks (5)
    Genomic Clones (29)
    cDNA Clones (6)
     

    Please Note This section lists cDNAs and ESTs that fall within the genomic extent of the gene model, which may include cDNAs and ESTs of genes within introns, or of overlapping genes. Please see GBrowse for alignment of the cDNAs and ESTs to the gene model.

    cDNA clones, fully sequenced
    BDGP DGC clones
    Other clones
      Drosophila Genomics Resource Center cDNA clones

      For each fully sequenced cDNA the DGRC maintains various forms of the cDNA (e.g tagged or untagged) in several different host vectors for subsequent cloning and expression in Drosophila and Drosophila cell lines.

      cDNA Clones, End Sequenced (ESTs)
      BDGP DGC clones
      Other clones
      RNAi and Array Information
      Linkouts
      DRSC - Results frm RNAi screens
      GenomeRNAi - A database for cell-based and in vivo RNAi phenotypes and reagents
      Antibody Information
      Laboratory Generated Antibodies
       
      Commercially Available Antibodies
       
      Other Information
      Relationship to Other Genes
      Source for database identify of
      Source for database merge of
      Additional comments

      The CG10104 gene may have been derived from the cathD gene by retroposition.

      Other Comments
      Origin and Etymology
      Discoverer
      Etymology
      Identification
      External Crossreferences and Linkouts ( 37 )
      Sequence Crossreferences
      NCBI Gene - Gene integrates information from a wide range of species. A record may include nomenclature, Reference Sequences (RefSeqs), maps, pathways, variations, phenotypes, and links to genome-, phenotype-, and locus-specific resources worldwide.
      GenBank Nucleotide - A collection of sequences from several sources, including GenBank, RefSeq, TPA, and PDB.
      GenBank Protein - A collection of sequences from several sources, including translations from annotated coding regions in GenBank, RefSeq and TPA, as well as records from SwissProt, PIR, PRF, and PDB.
      RefSeq - A comprehensive, integrated, non-redundant, well-annotated set of reference sequences including genomic, transcript, and protein.
      UniProt/TrEMBL - Automatically annotated and unreviewed records of protein sequence and functional information
      Other crossreferences
      BDGP expression data - Patterns of gene expression in Drosophila embryogenesis
      Drosophila Genomics Resource Center - Drosophila Genomics Resource Center (DGRC) cDNA clones
      Eukaryotic Promoter Database - A collection of databases of experimentally validated promoters for selected model organisms.
      Fly-FISH - A database of Drosophila embryo and larvae mRNA localization patterns
      Flygut - An atlas of the Drosophila adult midgut
      GenomeRNAi - A database for cell-based and in vivo RNAi phenotypes and reagents
      iBeetle-Base - RNAi phenotypes in the red flour beetle (Tribolium castaneum)
      KEGG Genes - Molecular building blocks of life in the genomic space.
      modMine - A data warehouse for the modENCODE project
      Linkouts
      BioGRID - A database of protein and genetic interactions.
      DroID - A comprehensive database of gene and protein interactions.
      DRSC - Results frm RNAi screens
      FLIGHT - Cell culture data for RNAi and other high-throughput technologies
      FlyAtlas - Adult expression by tissue, using Affymetrix Dros2 array
      FlyCyc Genes - Genes from a BioCyc PGDB for Dmel
      FlyMine - An integrated database for Drosophila genomics
      InterologFinder - Protein-protein interactions (PPI) from both known and predicted PPI data sets.
      KEGG Pathways - Wiring diagrams of molecular interactions, reactions and relations.
      Reactome - An open-source, open access, manually curated and peer-reviewed pathway database.
      Synonyms and Secondary IDs (1)
      Datasets (0)
      Study focus (0)
      Experimental Role
      Project
      Project Type
      Title
      References (31)