Open Close
General Information
Symbol
Dmel\Clp
Species
D. melanogaster
Name
Clipper
Annotation Symbol
CG3642
Feature Type
FlyBase ID
FBgn0015621
Gene Model Status
Stock Availability
Gene Snapshot
Also Known As

CPSF30, c6a

Key Links
Genomic Location
Cytogenetic map
Sequence location
2L:819,964..821,209 [+]
Recombination map

2-1.5

RefSeq locus
NT_033779 REGION:819964..821209
Sequence
Other Genome Views
The following external sites may use different assemblies or annotations than FlyBase.
Function
GO Summary Ribbons
Gene Ontology (GO) Annotations (5 terms)
Molecular Function (3 terms)
Terms Based on Experimental Evidence (1 term)
CV Term
Evidence
References
Terms Based on Predictions or Assertions (2 terms)
CV Term
Evidence
References
inferred from electronic annotation with InterPro:IPR001878, InterPro:IPR036875
(assigned by InterPro )
inferred from electronic annotation with InterPro:IPR001878
(assigned by InterPro )
Biological Process (1 term)
Terms Based on Experimental Evidence (0 terms)
Terms Based on Predictions or Assertions (1 term)
CV Term
Evidence
References
inferred from biological aspect of ancestor with PANTHER:PTN000559780
(assigned by GO_Central )
inferred from sequence or structural similarity with UniProtKB:Q06102
Cellular Component (1 term)
Terms Based on Experimental Evidence (0 terms)
Terms Based on Predictions or Assertions (1 term)
CV Term
Evidence
References
Protein Family (UniProt)
-
Summaries
Gene Group (FlyBase)
HISTONE PRE-MRNA CLEAVAGE COMPLEX -
Histone cleavage complex includes endonucleases and polyadenylation factors involved in processing of histone pre-mRNA. (Adapted from FBrf0223323 and PMID:23071092).
POLYADENYLATION FACTORS -
Polyadenylation factors associate with snRNPs, cleavage complexes and other factors to mediate the 3' end processing of pre-mRNA. (Adapted from FBrf0223323).
OTHER ENDORIBONUCLEASES -
Other endoribonucleases are a collection of endonucleases that do not fall into any of the major endoribonucleases.
Protein Function (UniProtKB)
Component of the cleavage and polyadenylation specificity factor (CPSF) complex that plays a key role in pre-mRNA 3'-end formation, recognizing the AAUAAA signal sequence and interacting with poly(A) polymerase and other factors to bring about cleavage and poly(A) addition. Has endonuclease activity. Binds RNA polymers with a preference for G- and/or C-rich clusters. Binds single-stranded DNA non-specifically.
(UniProt, Q9VPT8)
Gene Model and Products
Number of Transcripts
1
Number of Unique Polypeptides
1

Please see the JBrowse view of Dmel\Clp for information on other features

To submit a correction to a gene model please use the Contact FlyBase form

Protein Domains (via Pfam)
Isoform displayed:
Pfam protein domains
InterPro name
classification
start
end
Protein Domains (via SMART)
Isoform displayed:
SMART protein domains
InterPro name
classification
start
end
Comments on Gene Model

Gene model reviewed during 5.46

Sequence Ontology: Class of Gene
Transcript Data
Annotated Transcripts
Name
FlyBase ID
RefSeq ID
Length (nt)
Assoc. CDS (aa)
FBtr0078011
1246
296
Additional Transcript Data and Comments
Reported size (kB)

1.3 (northern blot)

1.246 (longest cDNA)

Comments
External Data
Crossreferences
Polypeptide Data
Annotated Polypeptides
Name
FlyBase ID
Predicted MW (kDa)
Length (aa)
Theoretical pI
RefSeq ID
GenBank
FBpp0077676
33.5
296
8.20
Polypeptides with Identical Sequences

There is only one protein coding transcript and one polypeptide associated with this gene

Additional Polypeptide Data and Comments
Reported size (kDa)

33.5 (kD predicted)

Comments
External Data
Subunit Structure (UniProtKB)

Component of the cleavage and polyadenylation specificity factor (CPSF) complex, composed of at least Clp, Cpsf73, Cpsf100 and Cpsf160.

(UniProt, Q9VPT8)
Domain

The N-terminal region containing the five C3H1-type zinc fingers is essential for endonuclease activity.

The C-terminal region containing the two CCHC-type zinc fingers confers a binding preference for RNAs that contain G- and/or C-rich clusters.

(UniProt, Q9VPT8)
Crossreferences
Linkouts
Sequences Consistent with the Gene Model
Mapped Features

Click to get a list of regulatory features (enhancers, TFBS, etc.) and gene disruptions (point mutations, indels, etc.) within or overlapping Dmel\Clp using the Feature Mapper tool.

External Data
Crossreferences
Eukaryotic Promoter Database - A collection of databases of experimentally validated promoters for selected model organisms.
Linkouts
Expression Data
Expression Summary Ribbons
Colored tiles in ribbon indicate that expression data has been curated by FlyBase for that anatomical location. Colorless tiles indicate that there is no curated data for that location.
For complete stage-specific expression data, view the modENCODE Development RNA-Seq section under High-Throughput Expression below.
Transcript Expression
in situ
Stage
Tissue/Position (including subcellular localization)
Reference
organism

Comment: maternally deposited

northern blot
Stage
Tissue/Position (including subcellular localization)
Reference
Additional Descriptive Data

In Northern blots, Clp expression is detected in early embryos and in ovaries. In situ hybridization shows that embryonic expression, which is detected until the cellular blastoderm stage, is ubiquitous. In the ovary, expression is detected starting at stage S3, in both oocytes and nurse cells. Oocyte expression stops being detected at stage S8, but resumes at stage S12, with the transfer of nurse cell contents to the oocyte. RNase protection assays show that the Clp transcript is expressed in all developmental stages, and is most abundant in adults. Maximal adult female expression is in the ovary. Amount of embryonic Clp transcript decreases during embryogenesis. A second peak of expression in third instar larvae decays to barely detectable levels in late pupae.

Marker for
 
Subcellular Localization
CV Term
Polypeptide Expression
immunolocalization
Stage
Tissue/Position (including subcellular localization)
Reference
Additional Descriptive Data

The Clp protein is detected in nurse cell nuclei up to stage S12 of oogenesis, and in the oocyte nucleus throughout oogenesis. Follicle cell nuclei also stain. Clp protein is also detected in the nuclei of all larval tissues. Salivary gland expression is initially restricted to the anterior of the gland and subsequently spreads to the rest of the gland. No Clp protein was detected in the embryo.

Marker for
 
Subcellular Localization
CV Term
Evidence
References
Expression Deduced from Reporters
High-Throughput Expression Data
Associated Tools

GBrowse - Visual display of RNA-Seq signals

View Dmel\Clp in GBrowse 2
RNA-Seq by Region - Search RNA-Seq expression levels by exon or genomic region
Reference
See Gelbart and Emmert, 2013 for analysis details and data files for all genes.
Developmental Proteome: Life Cycle
Developmental Proteome: Embryogenesis
External Data and Images
Linkouts
BDGP expression data - Patterns of gene expression in Drosophila embryogenesis
FLIGHT - Cell culture data for RNAi and other high-throughput technologies
FlyAtlas - Adult expression by tissue, using Affymetrix Dros2 array
Flygut - An atlas of the Drosophila adult midgut
Images
Alleles, Insertions, and Transgenic Constructs
Classical and Insertion Alleles ( 0 )
For All Classical and Insertion Alleles Show
 
Other relevant insertions
Transgenic Constructs ( 9 )
For All Alleles Carried on Transgenic Constructs Show
Transgenic constructs containing/affecting coding region of Clp
Transgenic constructs containing regulatory region of Clp
Deletions and Duplications ( 2 )
Phenotypes
Orthologs
Human Orthologs (via DIOPT v8.0)
Homo sapiens (Human) (6)
Species\Gene Symbol
Score
Best Score
Best Reverse Score
Alignment
Complementation?
Transgene?
15 of 15
Yes
Yes
1  
6 of 15
No
Yes
1  
1 of 15
No
No
1 of 15
No
No
1 of 15
No
No
1 of 15
No
No
Model Organism Orthologs (via DIOPT v8.0)
Mus musculus (laboratory mouse) (6)
Species\Gene Symbol
Score
Best Score
Best Reverse Score
Alignment
Complementation?
Transgene?
13 of 15
Yes
Yes
6 of 15
No
Yes
1 of 15
No
No
1 of 15
No
No
1 of 15
No
No
1 of 15
No
No
Rattus norvegicus (Norway rat) (6)
13 of 13
Yes
Yes
2 of 13
No
Yes
2 of 13
No
Yes
1 of 13
No
No
1 of 13
No
No
Xenopus tropicalis (Western clawed frog) (4)
13 of 12
Yes
Yes
11 of 12
No
Yes
1 of 12
No
No
1 of 12
No
No
Danio rerio (Zebrafish) (3)
10 of 15
Yes
Yes
1 of 15
No
No
1 of 15
No
No
Caenorhabditis elegans (Nematode, roundworm) (3)
14 of 15
Yes
Yes
1 of 15
No
No
1 of 15
No
No
Arabidopsis thaliana (thale-cress) (2)
1 of 9
Yes
No
1 of 9
Yes
No
Saccharomyces cerevisiae (Brewer's yeast) (1)
10 of 15
Yes
Yes
Schizosaccharomyces pombe (Fission yeast) (2)
7 of 12
Yes
Yes
1 of 12
No
No
Ortholog(s) in Drosophila Species (via OrthoDB v9.1) ( EOG09190BQP )
Organism
Common Name
Gene
AAA Syntenic Ortholog
Multiple Dmel Genes in this Orthologous Group
Drosophila suzukii
Spotted wing Drosophila
Drosophila simulans
Drosophila erecta
Drosophila yakuba
Drosophila ananassae
Drosophila pseudoobscura pseudoobscura
Drosophila willistoni
Drosophila virilis
Drosophila mojavensis
Orthologs in non-Drosophila Dipterans (via OrthoDB v9.1) ( EOG0915090W )
Organism
Common Name
Gene
Multiple Dmel Genes in this Orthologous Group
Musca domestica
House fly
Lucilia cuprina
Australian sheep blowfly
Mayetiola destructor
Hessian fly
Aedes aegypti
Yellow fever mosquito
Anopheles darlingi
American malaria mosquito
Anopheles gambiae
Malaria mosquito
Culex quinquefasciatus
Southern house mosquito
Orthologs in non-Dipteran Insects (via OrthoDB v9.1) ( EOG090W09X1 )
Organism
Common Name
Gene
Multiple Dmel Genes in this Orthologous Group
Bombyx mori
Silkmoth
Danaus plexippus
Monarch butterfly
Heliconius melpomene
Postman butterfly
Apis florea
Little honeybee
Apis mellifera
Western honey bee
Bombus impatiens
Common eastern bumble bee
Bombus terrestris
Buff-tailed bumblebee
Linepithema humile
Argentine ant
Megachile rotundata
Alfalfa leafcutting bee
Nasonia vitripennis
Parasitic wasp
Dendroctonus ponderosae
Mountain pine beetle
Tribolium castaneum
Red flour beetle
Tribolium castaneum
Red flour beetle
Pediculus humanus
Human body louse
Cimex lectularius
Bed bug
Acyrthosiphon pisum
Pea aphid
Zootermopsis nevadensis
Nevada dampwood termite
Orthologs in non-Insect Arthropods (via OrthoDB v9.1) ( EOG090X09SV )
Organism
Common Name
Gene
Multiple Dmel Genes in this Orthologous Group
Strigamia maritima
European centipede
Strigamia maritima
European centipede
Strigamia maritima
European centipede
Ixodes scapularis
Black-legged tick
Stegodyphus mimosarum
African social velvet spider
Tetranychus urticae
Two-spotted spider mite
Tetranychus urticae
Two-spotted spider mite
Daphnia pulex
Water flea
Orthologs in non-Arthropod Metazoa (via OrthoDB v9.1) ( EOG091G0JUF )
Organism
Common Name
Gene
Multiple Dmel Genes in this Orthologous Group
Ciona intestinalis
Vase tunicate
Gallus gallus
Domestic chicken
Paralogs
Paralogs (via DIOPT v8.0)
Drosophila melanogaster (Fruit fly) (2)
2 of 10
1 of 10
Human Disease Associations
FlyBase Human Disease Model Reports
    Disease Model Summary Ribbon
    Disease Ontology (DO) Annotations
    Models Based on Experimental Evidence ( 0 )
    Allele
    Disease
    Evidence
    References
    Potential Models Based on Orthology ( 0 )
    Human Ortholog
    Disease
    Evidence
    References
    Modifiers Based on Experimental Evidence ( 0 )
    Allele
    Disease
    Interaction
    References
    Disease Associations of Human Orthologs (via DIOPT v8.0 and OMIM)
    Note that ortholog calls supported by only 1 or 2 algorithms (DIOPT score < 3) are not shown.
    Homo sapiens (Human)
    Gene name
    Score
    OMIM
    OMIM Phenotype
    DO term
    Complementation?
    Transgene?
    Functional Complementation Data
    Functional complementation data is computed by FlyBase using a combination of the orthology data obtained from DIOPT and OrthoDB and the allele-level genetic interaction data curated from the literature.
    Interactions
    Summary of Physical Interactions
    esyN Network Diagram
    Show neighbor-neighbor interactions:
    Select Layout:
    Legend:
    Protein
    RNA
    Selected Interactor(s)
    Interactions Browser

    Please see the Physical Interaction reports below for full details
    protein-protein
    Physical Interaction
    Assay
    References
    Summary of Genetic Interactions
    esyN Network Diagram
    Starting gene(s)
    Interaction type
    Interacting gene(s)
    Reference
    Starting gene(s)
    Interaction type
    Interacting gene(s)
    Reference
    External Data
    Subunit Structure (UniProtKB)
    Component of the cleavage and polyadenylation specificity factor (CPSF) complex, composed of at least Clp, Cpsf73, Cpsf100 and Cpsf160.
    (UniProt, Q9VPT8 )
    Linkouts
    BioGRID - A database of protein and genetic interactions.
    DroID - A comprehensive database of gene and protein interactions.
    InterologFinder - Protein-protein interactions (PPI) from both known and predicted PPI data sets.
    MIST (protein-protein) - An integrated Molecular Interaction Database
    Pathways
    Signaling Pathways (FlyBase)
    Metabolic Pathways
    External Data
    Linkouts
    KEGG Pathways - Wiring diagrams of molecular interactions, reactions and relations.
    Genomic Location and Detailed Mapping Data
    Chromosome (arm)
    2L
    Recombination map

    2-1.5

    Cytogenetic map
    Sequence location
    2L:819,964..821,209 [+]
    FlyBase Computed Cytological Location
    Cytogenetic map
    Evidence for location
    21E2-21E2
    Limits computationally determined from genome sequence between P{lacW}l(2)k09610k09610&P{lacW}Sk09538a and P{PZ}l(2)1068510685&P{lacW}Tango14k00619
    Experimentally Determined Cytological Location
    Cytogenetic map
    Notes
    References
    Experimentally Determined Recombination Data
    Location
    Left of (cM)
    Right of (cM)
    Notes
    Stocks and Reagents
    Stocks (9)
    Genomic Clones (11)
     

    Please Note FlyBase no longer curates genomic clone accessions so this list may not be complete

    cDNA Clones (8)
     

    Please Note This section lists cDNAs and ESTs that fall within the genomic extent of the gene model, which may include cDNAs and ESTs of genes within introns, or of overlapping genes. Please see GBrowse for alignment of the cDNAs and ESTs to the gene model.

    cDNA clones, fully sequenced
    BDGP DGC clones
    Other clones
    Drosophila Genomics Resource Center cDNA clones

    For each fully sequenced cDNA the DGRC maintains various forms of the cDNA (e.g tagged or untagged) in several different host vectors for subsequent cloning and expression in Drosophila and Drosophila cell lines.

    cDNA Clones, End Sequenced (ESTs)
    BDGP DGC clones
    RNAi and Array Information
    Linkouts
    DRSC - Results frm RNAi screens
    GenomeRNAi - A database for cell-based and in vivo RNAi phenotypes and reagents
    Antibody Information
    Laboratory Generated Antibodies
     
    Commercially Available Antibodies
     
    Other Information
    Relationship to Other Genes
    Source for database identify of

    Source for identity of: Clp CG3642

    Source for database merge of
    Additional comments
    Other Comments

    The C-terminus of the Clp protein, containing two CCHC zinc knuckles, confers a binding preference for RNAs containing G- and/or C-rich clusters.

    Preliminary work implicates Clp in the turnover of RNA.

    Identified in an expression cloning screen of an ovarian cDNA library for plaques that encode proteins that bind ssDNA (FBrf0074556). Clp is a developmentally regulated gene that encodes a bipartite zinc finger protein.

    Origin and Etymology
    Discoverer
    Etymology
    Identification
    External Crossreferences and Linkouts ( 34 )
    Sequence Crossreferences
    NCBI Gene - Gene integrates information from a wide range of species. A record may include nomenclature, Reference Sequences (RefSeqs), maps, pathways, variations, phenotypes, and links to genome-, phenotype-, and locus-specific resources worldwide.
    GenBank Nucleotide - A collection of sequences from several sources, including GenBank, RefSeq, TPA, and PDB.
    GenBank Protein - A collection of sequences from several sources, including translations from annotated coding regions in GenBank, RefSeq and TPA, as well as records from SwissProt, PIR, PRF, and PDB.
    RefSeq - A comprehensive, integrated, non-redundant, well-annotated set of reference sequences including genomic, transcript, and protein.
    UniProt/Swiss-Prot - Manually annotated and reviewed records of protein sequence and functional information
    Other crossreferences
    BDGP expression data - Patterns of gene expression in Drosophila embryogenesis
    Drosophila Genomics Resource Center - Drosophila Genomics Resource Center (DGRC) cDNA clones
    Eukaryotic Promoter Database - A collection of databases of experimentally validated promoters for selected model organisms.
    Flygut - An atlas of the Drosophila adult midgut
    GenomeRNAi - A database for cell-based and in vivo RNAi phenotypes and reagents
    KEGG Genes - Molecular building blocks of life in the genomic space.
    modMine - A data warehouse for the modENCODE project
    Linkouts
    BioGRID - A database of protein and genetic interactions.
    DPiM - Drosophila Protein interaction map
    DroID - A comprehensive database of gene and protein interactions.
    DRSC - Results frm RNAi screens
    FLIGHT - Cell culture data for RNAi and other high-throughput technologies
    FlyAtlas - Adult expression by tissue, using Affymetrix Dros2 array
    FlyCyc Genes - Genes from a BioCyc PGDB for Dmel
    FlyMine - An integrated database for Drosophila genomics
    InterologFinder - Protein-protein interactions (PPI) from both known and predicted PPI data sets.
    KEGG Pathways - Wiring diagrams of molecular interactions, reactions and relations.
    MIST (protein-protein) - An integrated Molecular Interaction Database
    Synonyms and Secondary IDs (10)
    Datasets (0)
    Study focus (0)
    Experimental Role
    Project
    Project Type
    Title
    References (54)