Proceedings Presentation: Leveraging Transcription Factor Physical Proximity for Enhancing Gene Regulation Inference
Confirmed Presenter: Yijie Wang, School of Informatics and Computing, Indiana University, Bloomington, IN 47408, United States
Room: 02F
Format: In person
Moderator(s): Anthony Mathelier
Authors List: Show
- Xiaoqing Huang, Department of Biostatistics and Health Data Science School of Medicine, Indiana University, United States
- Aamir Raza Muneer Ahemad Hullur, School of Informatics and Computing, Indiana University, Bloomington, IN 47408, United States
- Elham Jafari, INDIANA UNIVERSITY, United States
- Kaushik Shridhar, School of Informatics and Computing, Indiana University, Bloomington, IN 47408, United States
- Mu Zhou, Rutgers University, United States
- Kenneth Mackie, Indiana University Bloomington, United States
- Kun Huang, Indiana University School of Medicine, United States
- Yijie Wang, School of Informatics and Computing, Indiana University, Bloomington, IN 47408, United States
Presentation Overview: Show
Motivation: Gene regulation inference, a key challenge in systems biology, is crucial for understanding cell function, as it governs processes such as differentiation, cell state maintenance, signal transduction, and stress response. Leading methods utilize gene expression, chromatin accessibility, Transcription Factor (TF) DNA binding motifs, and prior knowledge. However, they overlook the fact that TFs must be in physical proximity to facilitate transcriptional gene regulation.
Results: To fill the gap, we develop GRIP – Gene Regulation Inference by considering TF Proximity – a gene regulation inference method that directly considers the physical proximity between regulating TFs. Specifically, we use the distance in a protein-protein interaction (PPI) network to estimate the physical proximity between TFs. We design a novel Boolean convex program, which can identify TFs that not only can explain the gene expression of target genes (TGs) but also stay close in the PPI network. We propose an efficient algorithm to solve the Boolean relaxation of the proposed model with a theoretical tightness guarantee. We compare our GRIP with state-of-the-art methods (SCENIC+, DirectNet, Pando, and CellOracle) on inferring cell-type-specific (CD4, CD8, and CD 14) gene regulation using the PBMC 3k scMultiome-seq data and demonstrate its out-performance in terms of the predictive power of the inferred TFs, the physical distance between the inferred TFs, and the agreement between the inferred gene regulation and PCHiC ground-truth data.