Multi-scale chromatin state annotation using a hierarchical hidden Markov model.

Nat Commun
Authors
Abstract

Chromatin-state analysis is widely applied in the studies of development and diseases. However, existing methods operate at a single length scale, and therefore cannot distinguish large domains from isolated elements of the same type. To overcome this limitation, we present a hierarchical hidden Markov model, diHMM, to systematically annotate chromatin states at multiple length scales. We apply diHMM to analyse a public ChIP-seq data set. diHMM not only accurately captures nucleosome-level information, but identifies domain-level states that vary in nucleosome-level state composition, spatial distribution and functionality. The domain-level states recapitulate known patterns such as super-enhancers, bivalent promoters and Polycomb repressed regions, and identify additional patterns whose biological functions are not yet characterized. By integrating chromatin-state information with gene expression and Hi-C data, we identify context-dependent functions of nucleosome-level states. Thus, diHMM provides a powerful tool for investigating the role of higher-order chromatin structure in gene regulation.

Year of Publication
2017
Journal
Nat Commun
Volume
8
Pages
15011
Date Published
2017 Apr 07
ISSN
2041-1723
DOI
10.1038/ncomms15011
PubMed ID
28387224
PubMed Central ID
PMC5385569
Links
Grant list
K25 HL133599 / HL / NHLBI NIH HHS / United States
R01 HL119099 / HL / NHLBI NIH HHS / United States
R21 HG006778 / HG / NHGRI NIH HHS / United States