A Database of Drosophila Genes & Genomes

FB2013_03, released May 7th, 2013
 

Dataset mE1_TFBS_HSA

General Information
Name mE1_TFBS_HSA Species D. melanogaster
Dataset type genomic sequence feature FlyBase ID FBlc0000258
Source & Content
Consists of
Genomic sequences identified by integrative analysis of several ChIP data sets.
Created by
Available from
Strain
Stage & tissue
Stage
Tissue/Position (including subcellular localization)
Reference
Comment:TF binding profiles used in this analysis were assayed at early embryo stages.
Cell Line
hide Recent Updates
Description
What does this section display?
This section contains items that were added to this record for each release. It currently only tracks new links between this FlyBase report and other FlyBase data classes (e.g. genes, references, stocks) or controlled vocabulary terms (e.g. GO, anatomy terms).
What does this section not display?
This section does not currently display links that were removed or gene model changes.
Update Feed
Click the icon below to subscribe to this FlyBase record and receive updates automatically through your feed reader.
FB2013_03
FB2013_02
All updates Click here to see a list of all updates to this record from FB2010_08 and on.
hide Description & Members
Description
Genomic sequences identified as unique regions of transcription factor (TF) binding using HOT spot analysis (HSA); one or many TFs may bind in a given region. A synthesis of ChIP data sets for 41 different transcription factors.
Parent collections
Component collection(s)
Number in collection
Comment on number in collection
Members
hide Experimental protocol
Vector
Sample preparation
See component collections for details.
Collection preparation
See component collections for details.
Mode of assay
Assay platform
See component collections for details.
Data analysis
To investigate the co-localization of transcription factors, modENCODE ChIP-chip data sets for 25 factors were integrated with ChIP-chip data sets for an 16 additional factors produced by the Berkeley Drosophila Transcription Network Project (BDTNP, FBrf0192397, FBrf0205197, FBrf0205197). Data sets generated for the same factor were merged and the union was used for further analysis. Highly occupied target (HOT) regions were identified using a Gaussian kernel density estimation across the genome with a bandwidth of 300 bp, using the centers of each of the TF binding peaks as points. The density was then scanned for peaks, and each peak was denoted a HOT region. To determine the complexity of the HOT region, the sum of the Gaussian kernalized distance from the peak to each transcription factor that contributed at least 0.1 to peak's strength was calculated. The reported window around each HOT peak was derived by finding the maximum distance (in bp) from the HOT peak to a contributing TF, and then adding 150 bp (one half of the bandwidth). Each window is centered on the HOT peak. In those analyses in this paper which required a complexity cutoff, those binding sites with TFBS complexity more than 8 were defined as HOT regions.
hide Additional data
Associated files
Additional sites
hide Comments
HOT regions are primarily associated with open chromatin but they do not always demarcate cis-regulatory elements.
Binding profiles of 41 TFs in early embryo development were used to assign a TF complexity score to each of 38,562 distinct TF binding sites, corresponding to the number of distinct TFs bound (from 1 to ~21). Of these distinct TF binding sites, a subset of 1,962 hot regions (hotspots) had a TF complexity of eight or greater, corresponding to ~10 overlapping factors bound.
HOT spots of increasing TF complexity were strongly correlated to regions of decreased nucleosome density and increased nucleosome turnover.
This data set is available at http://www.modencode.org/publications/integrative_fly_2010/.
hide Synonyms & Secondary IDs
Reported As
Symbol Synonym
combined TFBS
HOT regions analysis
HOT spot analysis
mE1_TFBS_HSA
 
mE_Transcription_Factor_Binding_Site_Complexity
 
unique binding sites
Secondary FlyBase IDs
    hide References ( 7 )
    Research paper
    Nègre et al., 2011, Nature 471(7339): 527--531
    A cis-regulatory map of the Drosophila genome. [FBrf0213303]
    modENCODE Consortium et al., 2010, Science 330(6012): 1787--1797
    Identification of functional elements and regulatory circuits by Drosophila modENCODE. [FBrf0212741]
    Supplementary material
    Negre et al., 2011, Nature 471(7339):
    Supplementary Information. [FBrf0213507]
    The modENCODE Consortium, 2010, Science 330(6012):
    Supporting Online Material for Identification of Functional Elements and Regulatory Circuits by Drosophila modENCODE. [FBrf0213506]
    The modENCODE Consortium, 2010, Science 330(6012):
    DataS8: HOT regions. [FBrf0213505]
    The modENCODE Consortium, 2010, Science 330(6012):
    DataS7: Predicted TFBS. [FBrf0213603]
    Personal communication to FlyBase
    Ma, 2011.6.8, HOT spots analysis (Data Set S8) and 41 TFBS GFF files (Data Set S7).
    HOT spots analysis (Data Set S8) and 41 TFBS GFF files (Data Set S7). [FBrf0213928]