Bookmark

De novo motif identification improves the accuracy of predicting transcription factor binding sites in ChIP-Seq data analysis

Boeva, Valentina; Surdez, Didier; Guillon, Noëlle; Tirode, Franck; Fejes, Anthony P.; Delattre, Olivier; Barillot, Emmanuel
Nucleic Acids Research , Volume 38 (11): e126 Oxford University PressJun 1, 2010

Preview Only

De novo motif identification improves the accuracy of predicting transcription factor binding sites in ChIP-Seq data analysis

Abstract

Abstract Dramatic progress in the development of next-generation sequencing technologies has enabled accurate genome-wide characterization of the binding sites of DNA-associated proteins. This technique, baptized as ChIP-Seq, uses a combination of chromatin immunoprecipitation and massively parallel DNA sequencing. Other published tools that predict binding sites from ChIP-Seq data use only positional information of mapped reads. In contrast, our algorithm MICSA (Motif Identification for ChIP-Seq Analysis) combines this source of positional information with information on motif occurrences to better predict binding sites of transcription factors (TFs). We proved the greater accuracy of MICSA with respect to several other tools by running them on datasets for the TFs NRSF, GABP, STAT1 and CTCF. We also applied MICSA on a dataset for the oncogenic TF EWS-FLI1. We discovered >2000 binding sites and two functionally different binding motifs. We observed that EWS-FLI1 can activate gene transcription when (i) its binding site is located in close proximity to the gene transcription start site (up to ∼150 kb), and (ii) it contains a microsatellite sequence. Furthermore, we observed that sites without microsatellites can also induce regulation of gene expression—positively as often as negatively—and at much larger distances (up to ∼1 Mb). © The Author(s) 2010. Published by Oxford University Press. This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/2.5), which permits unrestricted non-commercial use, distribution, and reproduction in any medium, provided the original work is properly cited.
Loading next page...
1 Page

Preview Only. This article cannot be rented because we do not currently have permission from the publisher.

 
/lp/oxford-university-press/de-novo-motif-identification-improves-the-accuracy-of-predicting-i0kc00GmO1
Title
De novo motif identification improves the accuracy of predicting transcription factor binding sites in ChIP-Seq data analysis
Author(s)
Boeva, Valentina; Surdez, Didier; Guillon, Noëlle; Tirode, Franck; Fejes, Anthony P.; Delattre, Olivier; Barillot, Emmanuel
Journal
Nucleic Acids Research , Volume 38 (11): e126 Oxford University Press – Jun 1, 2010
Publisher
Oxford University Press
Copyright
Copyright © 2010 Oxford University Press
ISSN
0305-1048
eISSN
1362-4962
D.O.I.
10.1093/nar/gkq217
Publisher site
Get PDF