Statistical Identification of Co-regulatory Gene Modules using Multiple ChIP-Seq Experiments

Xi Chen, Xu Shi, Ayesha N. Shajahan-Haq, Leena Hilakivi-Clarke, Robert Clarke, Jianhua Xuan

2014

Abstract

ChIP-Seq experiments provide accurate measurements of the regulatory roles of transcription factors (TFs) under specific condition. Downstream target genes can be detected by analyzing the enriched TF binding sites (TFBSs) in genes’ promoter regions. The location and statistical information of TFBSs make it possible to evaluate the relative importance of each binding. Based on the assumption that the TFBSs of one ChIP-Seq experiment follow the same specific location distribution, a statistical model is first proposed using both location and significance information of peaks to weigh target genes. With genes’ binding scores from different TFs, we merge them into a weighted binding matrix. A Markov Chain Monte Carlo (MCMC) based approach is then applied to the binding matrix for co-regulatory module identification. We demonstrate the efficiency of our statistical model on an ER-α ChIP-Seq dataset and further identify co-regulatory modules by using eleven breast cancer related TFs from ENCODE ChIP-Seq datasets. The results show that the TFs in individual module regulate common high score target genes; the association of TFs is biologically meaningful, and the functional roles of TFs and target genes are consistent.

Download


Paper Citation


in Harvard Style

Chen X., Shi X., N. Shajahan-Haq A., Hilakivi-Clarke L., Clarke R. and Xuan J. (2014). Statistical Identification of Co-regulatory Gene Modules using Multiple ChIP-Seq Experiments . In Proceedings of the International Conference on Bioinformatics Models, Methods and Algorithms - Volume 1: BIOINFORMATICS, (BIOSTEC 2014) ISBN 978-989-758-012-3, pages 109-116. DOI: 10.5220/0004736801090116

in Bibtex Style

@conference{bioinformatics14,
author={Xi Chen and Xu Shi and Ayesha N. Shajahan-Haq and Leena Hilakivi-Clarke and Robert Clarke and Jianhua Xuan},
title={Statistical Identification of Co-regulatory Gene Modules using Multiple ChIP-Seq Experiments},
booktitle={Proceedings of the International Conference on Bioinformatics Models, Methods and Algorithms - Volume 1: BIOINFORMATICS, (BIOSTEC 2014)},
year={2014},
pages={109-116},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0004736801090116},
isbn={978-989-758-012-3},
}


in EndNote Style

TY - CONF
JO - Proceedings of the International Conference on Bioinformatics Models, Methods and Algorithms - Volume 1: BIOINFORMATICS, (BIOSTEC 2014)
TI - Statistical Identification of Co-regulatory Gene Modules using Multiple ChIP-Seq Experiments
SN - 978-989-758-012-3
AU - Chen X.
AU - Shi X.
AU - N. Shajahan-Haq A.
AU - Hilakivi-Clarke L.
AU - Clarke R.
AU - Xuan J.
PY - 2014
SP - 109
EP - 116
DO - 10.5220/0004736801090116