Prediction and functional interpretation of inter-chromosomal genome architecture from DNA sequence with TwinC.

Nature communications
Authors
Abstract

Three-dimensional nuclear DNA architecture comprises well-studied intra-chromosomal (cis) folding and less characterized inter-chromosomal (trans) interfaces. Current predictive models of 3D genome folding overlook trans-genome organization. We present TwinC, an interpretable convolutional neural network model that reliably predicts trans contacts measurable through proximity ligation-dependent (in situ and intact Hi-C) and independent (DNA SPRITE) genome-wide chromatin conformation assays. TwinC achieves high predictive accuracy (AUROC=0.80) on a cross-chromosomal test set from in situ and intact Hi-C experiments in heart tissue. Furthermore, we train TwinC using in situ Hi-C data from the widely used GM12878 cell line and validate its performance with orthogonal DNA SPRITE assay in the same cell type. Mechanistically, the neural network learns the importance of compartments, chromatin accessibility, clustered transcription factor binding, and G-quadruplexes in forming trans contacts. In summary, TwinC models and interprets trans genome architecture, illuminating this poorly understood aspect of gene regulation.

Year of Publication
2026
Journal
Nature communications
Date Published
04/2026
ISSN
2041-1723
DOI
10.1038/s41467-026-72031-5
PubMed ID
42009674
Links