FlyBase Reference Report: Dresch et al., 2016, Gene Regul. Syst. Bio. 10: 21--33

Reference

Citation

Dresch, J.M., Zellers, R.G., Bork, D.K., Drewell, R.A. (2016). Nucleotide Interdependency in Transcription Factor Binding Sites in the Drosophila Genome. Gene Regul. Syst. Bio. 10(): 21--33.

FlyBase ID

FBrf0232709

Publication Type

Research paper

Abstract

A long-standing objective in modern biology is to characterize the molecular components that drive the development of an organism. At the heart of eukaryotic development lies gene regulation. On the molecular level, much of the research in this field has focused on the binding of transcription factors (TFs) to regulatory regions in the genome known as cis-regulatory modules (CRMs). However, relatively little is known about the sequence-specific binding preferences of many TFs, especially with respect to the possible interdependencies between the nucleotides that make up binding sites. A particular limitation of many existing algorithms that aim to predict binding site sequences is that they do not allow for dependencies between nonadjacent nucleotides. In this study, we use a recently developed computational algorithm, MARZ, to compare binding site sequences using 32 distinct models in a systematic and unbiased approach to explore nucleotide dependencies within binding sites for 15 distinct TFs known to be critical to Drosophila development. Our results indicate that many of these proteins have varying levels of nucleotide interdependencies within their DNA recognition sequences, and that, in some cases, models that account for these dependencies greatly outperform traditional models that are used to predict binding sites. We also directly compare the ability of different models to identify the known KRUPPEL TF binding sites in CRMs and demonstrate that a more complex model that accounts for nucleotide interdependencies performs better when compared with simple models. This ability to identify TFs with critical nucleotide interdependencies in their binding sites will lead to a deeper understanding of how these molecular characteristics contribute to the architecture of CRMs and the precise regulation of transcription during organismal development.

PubMed ID

27330274

PubMed Central ID

PMC4907338 (PMC) (EuropePMC)

DOI

10.4137/GRSB.S38462

Associated Information

Comments

Associated Files

Other Information

Secondary IDs

Language of Publication

English

Additional Languages of Abstract

Parent Publication

Publication Type

Journal

Abbreviation

Gene Regul. Syst. Bio.

Title

Gene regulation and systems biology

ISBN/ISSN

1177-6250

Data From Reference

Genes (15)

bcd
cad
D
dl
ftz
gt
hb
hkb
hry
kni
Kr
prd
run
shn
slp1

Report Sections

Open Close