Aerts et al., 2007, PLoS ONE 2(11): e1115

From FlyBase Wiki
Jump to: navigation, search
Aerts et al., 2007, PLoS ONE 2(11): e1115
FlyBase Identifier FBrf0210256
FlyBase URL http://flybase.org/reports/FBrf0210256.html
Publication Type paper
Publication Year 2007
PubMed ID 17973026
PubMed URL http://www.ncbi.nlm.nih.gov/pubmed/17973026

Title

Fine-tuning enhancer models to predict transcriptional targets across multiple genomes.

Abstract

Networks of regulatory relations between transcription factors (TF) and their target genes (TG)- implemented through TF binding sites (TFBS)- are key features of biology. An idealized approach to solving such networks consists of starting from a consensus TFBS or a position weight matrix (PWM) to generate a high accuracy list of candidate TGs for biological validation. Developing and evaluating such approaches remains a formidable challenge in regulatory bioinformatics. We perform a benchmark study on 34 Drosophila TFs to assess existing TFBS and cis-regulatory module (CRM) detection methods, with a strong focus on the use of multiple genomes. Particularly, for CRM-modelling we investigate the addition of orthologous sites to a known PWM to construct phyloPWMs and we assess the added value of phylogenentic footprinting to predict contextual motifs around known TFBSs. For CRM-prediction, we compare motif conservation with network-level conservation approaches across multiple genomes. Choosing the optimal training and scoring strategies strongly enhances the performance of TG prediction for more than half of the tested TFs. Finally, we analyse a 35(th) TF, namely Eyeless, and find a significant overlap between predicted TGs and candidate TGs identified by microarray expression studies. In summary we identify several ways to optimize TF-specific TG predictions, some of which can be applied to all TFs, and others that can be applied only to particular TFs. The ability to model known TF-TG relations, together with the use of multiple genomes, results in a significant step forward in solving the architecture of gene regulatory networks.

Genes from Reference

Gene(s) Dmel\Antp
Personal tools
Namespaces

Variants
Actions
Navigation
Toolbox