Proceeding talk – Theme: Genome.
Abstract
In this work we develop a robust algorithmic framework that takes two measurement derived datasets: raw Hi-C and partial short-range haplotypes, and constructs the full-genome haplotype as well as phased diploid Hi-C maps. By analyzing both data sets together we thus bridge important gaps in both technologies – from short to long haplotypes and from unphased to phased Hi-C. We demonstrate that our method can recover ground truth haplotypes with high accuracy, using measured biological data as well as simulated data. We analyze the impact of noise, Hi-C sequencing depth and measured haplotype lengths on performance. Finally, we use the inferred 3D structure of a human genome to point at transcription factor targets nuclear co-localization.
Authors
Shay Ben-Elazar, Tel Aviv University, Israel
Benny Chor, Tel Aviv University, Israel
Zohar Yakhini, Technion – Israel Institute of Technology, Israel
