FB2026_01 , released March 12, 2026
FB2026_01 , released March 12, 2026
Gene: Dmel\PGRP-SA
Open Close
General Information
Symbol
Dmel\PGRP-SA
Species
D. melanogaster
Name
Peptidoglycan recognition protein SA
Annotation Symbol
CG11709
Feature Type
FlyBase ID
FBgn0030310
Gene Model Status
Stock Availability
Enzyme Name (EC)
muramoyltetrapeptide carboxypeptidase (3.4.17.13)
Gene Summary
Peptidoglycan recognition protein SA (PGRP-SA) encodes a secreted protein that mediates Toll pathway activation during bacterial infection. It binds to peptidoglycans and, together with the product of GNBP1, activates the serine protease encoded by modSP. [Date last reviewed: 2019-03-14] (FlyBase Gene Snapshot)
Also Known As

seml, semmelweis

Key Links
Genomic Location
Cytogenetic map
Sequence location
Recombination map
1-35
RefSeq locus
NC_004354 REGION:11561643..11562779
Sequence
Genomic Maps
Other Genome Views
The following external sites may use different assemblies or annotations than FlyBase.
Function
Gene Ontology (GO) Annotations (17 terms)
Molecular Function (8 terms)
Terms Based on Experimental Evidence (5 terms)
CV Term
Evidence
References
Terms Based on Predictions or Assertions (4 terms)
CV Term
Evidence
References
non-traceable author statement
inferred from electronic annotation with InterPro:IPR017331
traceable author statement
inferred from biological aspect of ancestor with PANTHER:PTN002475783
inferred from electronic annotation with InterPro:IPR006619
Biological Process (7 terms)
Terms Based on Experimental Evidence (4 terms)
CV Term
Evidence
References
inferred from mutant phenotype
inferred from genetic interaction with FLYBASE:SPE; FB:FBgn0039102
inferred from genetic interaction with FLYBASE:GNBP1; FB:FBgn0040323
inferred from direct assay
inferred from mutant phenotype
Terms Based on Predictions or Assertions (4 terms)
CV Term
Evidence
References
involved_in immune response
inferred from biological aspect of ancestor with PANTHER:PTN002475783
non-traceable author statement
inferred from electronic annotation with InterPro:IPR017331
Cellular Component (2 terms)
Terms Based on Experimental Evidence (1 term)
CV Term
Evidence
References
inferred from direct assay
Terms Based on Predictions or Assertions (1 term)
CV Term
Evidence
References
is_active_in extracellular space
inferred from biological aspect of ancestor with PANTHER:PTN002475783
Protein Family (UniProt)
Belongs to the N-acetylmuramoyl-L-alanine amidase 2 family. (Q9VYX7)
Catalytic Activity (EC/Rhea)
muramoyltetrapeptide carboxypeptidase activity
N-acetyl-D-glucosaminyl-N-acetylmuramoyl-L-alanyl-meso-2,6- diaminoheptanedioyl-D-alanine + H2O = N-acetyl-D-glucosaminyl-N- acetylmuramoyl-L-alanyl-meso-2,6-diaminoheptanedioate + D-alanine (3.4.17.13)
RHEA 48688:
Summaries
Gene Snapshot
Peptidoglycan recognition protein SA (PGRP-SA) encodes a secreted protein that mediates Toll pathway activation during bacterial infection. It binds to peptidoglycans and, together with the product of GNBP1, activates the serine protease encoded by modSP. [Date last reviewed: 2019-03-14]
Gene Group (FlyBase)
L,D-CARBOXYPEPTIDASES -
L,D-carboxypeptidases cleave L,D-configured peptide bonds present in the subunits of peptidoglycan components found in the cell walls of both Gram-positive and Gram-negative bacteria. The presence of this enzymatic activity in a eukaryotic protein may therefore mediate a response to bacterial pathogens. (Adapted from FBrf0180474.)
SHORT PEPTIDOGLYCAN RECOGNITION PROTEINS -
Peptidoglycan recognition proteins (PGRPs) are pattern recognition molecules of the innate immune response that bind peptidoglycans. Insect PGRPs can be divided into two groups based on transcript size: short PGRPs (PGRP-S) and long PGRPs (PGRP-L). PGRP-S have signal peptides and can be extracellular proteins. (Adapted from FBrf0223078).
UNCLASSIFIED PSEUDOENZYMES -
This group comprises pseudoenzymes that do not classify under other groups in FlyBase.
Pathway (FlyBase)
EXTRACELLULAR SPATZLE ACTIVATING PATHWAY CORE COMPONENTS -
In Drosophila, activation of the Toll pathway is controlled by the generation of a cleaved, active, Toll-binding form of spatzle ligand. Proteolytic activation of spatzle ligand lies downstream of several zymogen activation cascades that are initiated by different cues. In dorsal-ventral patterning, localized activation of spz results in ventral nuclear accumulation of dl. During viral and fungal and gram positive bacterial immune challenge, zymogen cascades are activated by extracellular pattern recognition receptors or virulence factor-mediated cleavage of the zymogen persephone (psh). (Adapted from FBrf0091014 and FBrf0223077).
Protein Function (UniProtKB)
Peptidoglycan-recognition protein that plays a key role in innate immunity by binding to peptidoglycans (PGN) of Gram-positive bacteria and activating the Toll pathway upstream of spz activating enzyme SPE (PubMed:11106397, PubMed:15448690, PubMed:16399077). Has no activity against Gram-negative bacteria and fungi (PubMed:11742401). Shows some partial redundancy with PRPGP-SD in Gram-positive bacteria recognition (PubMed:11742401, PubMed:15448690). May act by forming a complex with GNBP1 that activates the proteolytic cleavage of Spatzle and the subsequent activation of Toll pathway (PubMed:11742401, PubMed:14684822, PubMed:14722090). Binds to diaminopimelic acid-type tetrapeptide PGN (DAP-type PGN) and lysine-type PGN (Lys-type PGN) (PubMed:15361936). Has some L,D-carboxypeptidase activity for DAP-type PGN, which are specific to prokaryotes, but not for Lys-type PGN (PubMed:15361936).
(UniProt, Q9VYX7)
Gene Model and Products
Number of Transcripts
2
Number of Unique Polypeptides
1

Please see the JBrowse view of Dmel\PGRP-SA for information on other features

To submit a correction to a gene model please use the Contact FlyBase form

Protein Domains (via Pfam)
Isoform displayed:
Pfam protein domains
InterPro name
classification
start
end
Protein Domains (via SMART)
Isoform displayed:
SMART protein domains
InterPro name
classification
start
end
Structure
Protein 3D structure   (Predicted by AlphaFold)   (AlphaFold entry Q9VYX7)

If you don't see a structure in the viewer, refresh your browser.
Model Confidence:
  • Very high (pLDDT > 90)
  • Confident (90 > pLDDT > 70)
  • Low (70 > pLDDT > 50)
  • Very low (pLDDT < 50)

AlphaFold produces a per-residue confidence score (pLDDT) between 0 and 100. Some regions with low pLDDT may be unstructured in isolation.

Experimentally Determined Structures
Crossreferences
PDB - An information portal to biological macromolecular structures
Comments on Gene Model

Gene model reviewed during 5.49

Gene model reviewed during 5.50

Transcript Data
Annotated Transcripts
Name
FlyBase ID
RefSeq ID
Length (nt)
Assoc. CDS (aa)
FBtr0073509
1018
203
FBtr0340296
726
203
Additional Transcript Data and Comments
Reported size (kB)
Comments
External Data
Crossreferences
Polypeptide Data
Annotated Polypeptides
Name
FlyBase ID
Predicted MW (kDa)
Length (aa)
Theoretical pI
UniProt
RefSeq ID
GenBank
FBpp0073358
22.3
203
7.11
Polypeptides with Identical Sequences

The group(s) of polypeptides indicated below share identical sequence to each other.

203 aa isoforms: PGRP-SA-PA, PGRP-SA-PB
Additional Polypeptide Data and Comments
Reported size (kDa)
Comments
External Data
Linkouts
Sequences Consistent with the Gene Model
Mapped Features

Click to get a list of regulatory features (enhancers, TFBS, etc.) and gene disruptions (point mutations, indels, etc.) within or overlapping Dmel\PGRP-SA using the Feature Mapper tool.

External Data
Crossreferences
Eukaryotic Promoter Database - A collection of databases of experimentally validated promoters for selected model organisms.
Linkouts
Expression Data
Testis-specificity index

The testis specificity index was calculated from modENCODE tissue expression data by Vedelek et al., 2018 to indicate the degree of testis enrichment compared to other tissues. Scores range from -2.52 (underrepresented) to 5.2 (very high testis bias).

0.22

Transcript Expression
Additional Descriptive Data
Marker for
 
Subcellular Localization
CV Term
Polypeptide Expression
Additional Descriptive Data
Marker for
 
Subcellular Localization
CV Term
Evidence
References
inferred from direct assay
Expression Deduced from Reporters
High-Throughput Expression Data
Associated Tools

JBrowse - Visual display of RNA-Seq signals

View Dmel\PGRP-SA in JBrowse
RNA-Seq by Region - Search RNA-Seq expression levels by exon or genomic region
Reference
See Gelbart and Emmert, 2013 for analysis details and data files for all genes.
Developmental Proteome: Life Cycle
Developmental Proteome: Embryogenesis
External Data and Images
Linkouts
DRscDB - A single-cell RNA-seq resource for data mining and data comparison across species
EMBL-EBI Single Cell Expression Atlas - Single cell expression across species
FlyAtlas - Adult expression by tissue, using Affymetrix Dros2 array
FlyAtlas2 - A Drosophila melanogaster expression atlas with RNA-Seq, miRNA-Seq and sex-specific data
Fly-FISH - A database of Drosophila embryo and larvae mRNA localization patterns
Flygut - An atlas of the Drosophila adult midgut
Images
Alleles, Insertions, Transgenic Constructs, and Aberrations
Classical and Insertion Alleles ( 3 )
For All Classical and Insertion Alleles Show
 
Other relevant insertions
Transgenic Constructs ( 9 )
For All Alleles Carried on Transgenic Constructs Show
Transgenic constructs containing/affecting coding region of PGRP-SA
Transgenic constructs containing regulatory region of PGRP-SA
Aberrations (Deficiencies and Duplications) ( 2 )
Variants
Variant Molecular Consequences
Alleles Representing Disease-Implicated Variants
Phenotypes
Orthologs
Human Orthologs (via DIOPT v9.1)
Species\Gene Symbol
Score
Best Score
Best Reverse Score
Alignment
Complementation?
Transgene?
Homo sapiens (Human) (4)
10 of 14
Yes
No
1  
8 of 14
No
Yes
1  
7 of 14
No
Yes
6 of 14
No
No
Model Organism Orthologs (via DIOPT v9.1)
Species\Gene Symbol
Score
Best Score
Best Reverse Score
Alignment
Complementation?
Transgene?
Rattus norvegicus (Norway rat) (5)
10 of 14
Yes
No
9 of 14
No
Yes
6 of 14
No
Yes
5 of 14
No
No
1 of 14
No
Yes
Mus musculus (laboratory mouse) (4)
10 of 14
Yes
No
7 of 14
No
Yes
6 of 14
No
No
6 of 14
No
Yes
Xenopus tropicalis (Western clawed frog) (6)
5 of 13
Yes
No
4 of 13
No
No
3 of 13
No
No
1 of 13
No
No
1 of 13
No
No
1 of 13
No
No
Danio rerio (Zebrafish) (3)
8 of 14
Yes
No
7 of 14
No
Yes
7 of 14
No
Yes
Caenorhabditis elegans (Nematode, roundworm) (1)
1 of 14
Yes
No
Anopheles gambiae (African malaria mosquito) (8)
11 of 12
Yes
Yes
6 of 12
No
No
6 of 12
No
No
3 of 12
No
No
2 of 12
No
No
2 of 12
No
No
2 of 12
No
No
Arabidopsis thaliana (thale-cress) (0)
Saccharomyces cerevisiae (Brewer's yeast) (0)
Schizosaccharomyces pombe (Fission yeast) (0)
Escherichia coli (enterobacterium) (0)
Other Organism Orthologs (via OrthoDB)
Data provided directly from OrthoDB:PGRP-SA. Refer to their site for version information.
Paralogs
Paralogs (via DIOPT v9.1)
Drosophila melanogaster (Fruit fly) (12)
9 of 13
8 of 13
7 of 13
7 of 13
6 of 13
6 of 13
6 of 13
5 of 13
5 of 13
5 of 13
4 of 13
3 of 13
Human Disease Associations
FlyBase Human Disease Model Reports
    Disease Ontology (DO) Annotations
    Models Based on Experimental Evidence ( 0 )
    Allele
    Disease
    Evidence
    References
    Potential Models Based on Orthology ( 0 )
    Human Ortholog
    Disease
    Evidence
    References
    Modifiers Based on Experimental Evidence ( 0 )
    Allele
    Disease
    Interaction
    References
    Disease Associations of Human Orthologs (via DIOPT v9.1 and OMIM)
    Note that ortholog calls supported by only 1 or 2 algorithms (DIOPT score < 3) are not shown.
    Functional Complementation Data
    Functional complementation data is computed by FlyBase using a combination of the orthology data obtained from DIOPT and OrthoDB and the allele-level genetic interaction data curated from the literature.
    Interactions
    Summary of Physical Interactions
    Summary of Genetic Interactions
    Interaction Browsers
    Starting gene(s)
    Interaction type
    Interacting gene(s)
    Reference
    Starting gene(s)
    Interaction type
    Interacting gene(s)
    Reference
    External Data
    Linkouts
    DroID - A comprehensive database of gene and protein interactions.
    MIST (protein-protein) - An integrated Molecular Interaction Database
    Pathways
    Signaling Pathways (FlyBase)
    Metabolic Pathways
    FlyBase
    External Links
    External Data
    Linkouts
    KEGG Pathways - A collection of manually drawn pathway maps representing knowledge of molecular interaction, reaction and relation networks.
    Reactome - An open-source, open access, manually curated and peer-reviewed pathway database.
    Class of Gene
    Genomic Location and Detailed Mapping Data
    Chromosome (arm)
    X
    Recombination map
    1-35
    Cytogenetic map
    Sequence location
    FlyBase Computed Cytological Location
    Cytogenetic map
    Evidence for location
    10C6-10C6
    Limits computationally determined from genome sequence between P{EP}CG32666EP1452 and P{EP}Ptp10DEP1172
    Experimentally Determined Cytological Location
    Cytogenetic map
    Notes
    References
    Experimentally Determined Recombination Data
    Location
    Left of (cM)
    Right of (cM)
    Notes
    Stocks and Reagents
    Stocks (4)
    Genomic Clones (13)
     

    Please Note FlyBase no longer curates genomic clone accessions so this list may not be complete

    cDNA Clones (9)
     

    Please Note This section lists cDNAs and ESTs that fall within the genomic extent of the gene model, which may include cDNAs and ESTs of genes within introns, or of overlapping genes. Please see JBrowse for alignment of the cDNAs and ESTs to the gene model.

    cDNA clones, fully sequenced
    BDGP DGC clones
    Other clones
    Drosophila Genomics Resource Center cDNA clones

    For each fully sequenced cDNA the DGRC maintains various forms of the cDNA (e.g tagged or untagged) in several different host vectors for subsequent cloning and expression in Drosophila and Drosophila cell lines.

    cDNA Clones, End Sequenced (ESTs)
    BDGP DGC clones
      RNAi and Array Information
      Linkouts
      DRSC - Results frm RNAi screens
      Antibody Information
      Laboratory Generated Antibodies
       

      polyclonal

      Commercially Available Antibodies
       
      Cell Line Information
      Publicly Available Cell Lines
       
        Other Stable Cell Lines
         
          Other Comments

          PGRP-SA functions upstream of spe, sphinx2, sphinx1, spheroide, grass, and spirit in the spz-Tl pathway.

          PGRP-SA binds to specific peptidoglycans and shows elicitor specificity for bacteria with a peptidoglycan strucuture contain a Lys in the third position of the crosslinking tetrapeptide.

          PGRP-SA and GNBP1 act in a common pathway, upstream of spz, in raising an immune response to Gram-positive bacteria.

          Identified as a gene with significant level of mRNA cycling as assessed by expression analysis using high density oligonucleotide arrays with probe generated from adult heads harvested over six time points over the course of a day. Shows alteration in expression in a Clk mutant background.

          PGRP-SA has a role in activating Tl in Gram-positive bacterial infection.

          Relationship to Other Genes
          Source for database merge of

          Source for merge of: PGRP-SA BcDNA:AT30827

          Additional comments

          Source for merge of PGRP-SA BcDNA:AT30827 was TrEMBL update ( date:020807 ).

          Source for identity of PGRP-SA CG11709 was sequence comparison ( date:001020 ).

          Nomenclature History
          Source for database identify of

          Source for identity of: PGRP-SA seml

          Source for identity of: PGRP-SA CG11709

          Nomenclature comments
          Etymology
          Synonyms and Secondary IDs (17)
          Reported As
          Symbol Synonym
          BcDNA:AT30827
          PGRP-SA
          (Brutscher et al., 2025, Dillard et al., 2025, He et al., 2025, Kang et al., 2025, Qin et al., 2025, Rommelaere et al., 2025, Arias-Rojas et al., 2024, Davis et al., 2024, Fioriti et al., 2024, Huang et al., 2024, Li et al., 2024, Meyer et al., 2024, Nigg et al., 2024, Zhou et al., 2024, Gupta et al., 2023, Heigwer et al., 2023, Keith, 2023, Khan et al., 2023, Krejčová et al., 2023, Meng et al., 2023, Stączek et al., 2023, Wang et al., 2023, Xu et al., 2023, Bahuguna et al., 2022, Chen et al., 2022, Deshpande et al., 2022, Hultmark and Andó, 2022, Marshall and Dionne, 2022, Nayak and Mishra, 2022, Pavlidaki et al., 2022, Wang et al., 2022, Waring et al., 2022, Wu et al., 2022, Hall et al., 2021, Ozakman and Eleftherianos, 2021, Ågren et al., 2020, Alameh et al., 2020, Arora and Ligoxygakis, 2020, Benoit et al., 2020, Gilbert et al., 2020, Lu et al., 2020, Nishihara, 2020, Swanson et al., 2020, Younes et al., 2020, Zhou et al., 2020, Dudzic et al., 2019, Hall et al., 2019, Sanuki et al., 2019, Troha et al., 2019, Vaz et al., 2019, Nie et al., 2018, Troha et al., 2018, Wang et al., 2018, Patrnogic and Leclerc, 2017, Toggweiler et al., 2016, Katsuyama et al., 2015, Kavi et al., 2015, Yamamoto-Hino et al., 2015, Arefin et al., 2014, Atilano et al., 2014, Bonnay et al., 2014, Guo et al., 2014, Keebaugh and Schlenke, 2014, Kurata, 2014, Lindsay and Wasserman, 2014, Myllymäki et al., 2014, Panayidou et al., 2014, Péan and Dionne, 2014, Salazar-Jaramillo et al., 2014, Ferrandon, 2013, Gendrin et al., 2013, Kingsolver et al., 2013, Marianes and Spradling, 2013, Mbodj et al., 2013, Merkling and van Rij, 2013, Short and Lazzaro, 2013, Doroszuk et al., 2012, Eleftherianos and Castillo, 2012, Igboin et al., 2012, Kounatidis and Ligoxygakis, 2012, Petersen et al., 2012, Atilano et al., 2011, Bou Aoun et al., 2011, Chtarbanova and Imler, 2011, Fullaondo et al., 2011, Gerbasi et al., 2011, Marcu et al., 2011, Nehme et al., 2011, Schmidt et al., 2011, Valanne et al., 2011, Wang et al., 2011, Yano and Kurata, 2011, Aymeric et al., 2010, Hill-Burns and Clark, 2010, Kurata, 2010, Buchon et al., 2009, Buchon et al., 2009, Cronin et al., 2009, Geuking et al., 2009, Hill-Burns and Clark, 2009, Davis et al., 2008, El Chamy et al., 2008, Gilchrist et al., 2008, Lee and Edery, 2008, Leone et al., 2008, Sathyanarayanan et al., 2008, Straub et al., 2008, Tsai et al., 2008, Wagner et al., 2008, Wang and Ligoxygakis, 2008, Wang et al., 2008, Zhou et al., 2008, Beckstead et al., 2007, Bidla et al., 2007, Muse et al., 2007, Pal et al., 2007, Park et al., 2007, Pham et al., 2007, Quintin et al., 2007, Akira et al., 2006, Bischoff et al., 2006, Gottar et al., 2006, Jang et al., 2006, Kambris et al., 2006, Kambris et al., 2006, Pelte et al., 2006, Wang et al., 2006, Filipe et al., 2005, Johansson et al., 2005, Bischoff et al., 2004, Reiser et al., 2004, Roxstrom-Lindquist et al., 2004, Gobert et al., 2003, Jiggins and Hurst, 2003, Leulier et al., 2003, Mellroth et al., 2003, Christophides et al., 2002)
          Name Synonyms
          PG Recognition Protein-SA
          Peptidoglycan recognition protein SA
          Peptidoglycan recognition protein-SA
          Peptidoglycan-recognition protein-SA precursor
          Protein semmelweis
          peptidoglycan recognition protein SA
          semmelweiss
          Secondary FlyBase IDs
          • FBgn0061434
          • FBgn0064989
          Datasets (0)
          Study focus (0)
          Experimental Role
          Project
          Project Type
          Title
          Study result (0)
          Result
          Result Type
          Title
          External Crossreferences and Linkouts ( 68 )
          Sequence Crossreferences
          NCBI Gene - Gene integrates information from a wide range of species. A record may include nomenclature, Reference Sequences (RefSeqs), maps, pathways, variations, phenotypes, and links to genome-, phenotype-, and locus-specific resources worldwide.
          GenBank Nucleotide - A collection of sequences from several sources, including GenBank, RefSeq, TPA, and PDB.
          GenBank Protein - A collection of sequences from several sources, including translations from annotated coding regions in GenBank, RefSeq and TPA, as well as records from SwissProt, PIR, PRF, and PDB.
          RefSeq - A comprehensive, integrated, non-redundant, well-annotated set of reference sequences including genomic, transcript, and protein.
          UniProt/GCRP - The gene-centric reference proteome (GCRP) provides a 1:1 mapping between genes and UniProt accessions in which a single 'canonical' isoform represents the product(s) of each protein-coding gene.
          UniProt/Swiss-Prot - Manually annotated and reviewed records of protein sequence and functional information
          UniProt/TrEMBL - Automatically annotated and unreviewed records of protein sequence and functional information
          Other crossreferences
          AlphaFold DB - AlphaFold provides open access to protein structure predictions for the human proteome and other key proteins of interest, to accelerate scientific research.
          DRscDB - A single-cell RNA-seq resource for data mining and data comparison across species
          EMBL-EBI Single Cell Expression Atlas - Single cell expression across species
          FlyAtlas2 - A Drosophila melanogaster expression atlas with RNA-Seq, miRNA-Seq and sex-specific data
          FlyMine - An integrated database for Drosophila genomics
          KEGG Genes - Molecular building blocks of life in the genomic space.
          MARRVEL_MODEL - MARRVEL (model organism gene)
          PDB - An information portal to biological macromolecular structures
          Linkouts
          Drosophila Genomics Resource Center - Drosophila Genomics Resource Center (DGRC) cDNA clones
          DroID - A comprehensive database of gene and protein interactions.
          DRSC - Results frm RNAi screens
          Eukaryotic Promoter Database - A collection of databases of experimentally validated promoters for selected model organisms.
          FlyAtlas - Adult expression by tissue, using Affymetrix Dros2 array
          FlyCyc Genes - Genes from a BioCyc PGDB for Dmel
          Fly-FISH - A database of Drosophila embryo and larvae mRNA localization patterns
          Flygut - An atlas of the Drosophila adult midgut
          FlyMet - A comprehensive tissue-specific metabolomics resource for Drosophila.
          iBeetle-Base - RNAi phenotypes in the red flour beetle (Tribolium castaneum)
          KEGG Pathways - A collection of manually drawn pathway maps representing knowledge of molecular interaction, reaction and relation networks.
          MIST (protein-protein) - An integrated Molecular Interaction Database
          Reactome - An open-source, open access, manually curated and peer-reviewed pathway database.
          References (219)