FinaleToolkit: Accelerating Cell-Free DNA Fragmentation Analysis with a High-Speed Computational Toolkit
Confirmed Presenter: James W. Li, Department of Computer Science, Wake Forest University, Winston-Salem, NC, USA, United States
Room: 520a
Format: In person
Authors List: Show
- James W. Li, Department of Computer Science, Wake Forest University, Winston-Salem, NC, USA, United States
- Ravi Bandaru, Dept. of Biochemistry and Molecular Genetics, Feinberg School of Medicine, Northwestern University, Chicago, IL, USA, United States
- Yaping Liu, Dept. of Biochemistry and Molecular Genetics, Feinberg School of Medicine, Northwestern University, Chicago, IL, USA, United States
Presentation Overview: Show
The fragmentation pattern of cell-free DNA (cfDNA) represents a promising non-invasive biomarker for disease diagnosis and prognosis. Numerous fragmentation features, such as end motif and window protection score (WPS), have been characterized in cfDNA genomic sequencing. However, the analytical tools developed in these studies are often not released to the liquid biopsy community or are inefficient for processing large datasets. To address this gap, we have developed FinaleToolkit, a fast and memory-efficient Python package designed to generate comprehensive fragmentation features from large cfDNA genomic sequencing data. For instance, FinaleToolkit can generate genome-wide WPS features from a ~100X cfDNA whole-genome sequencing dataset in 74 minutes using 16 CPU cores and 49 GB of memory, offering up to a ~30-fold increase in processing speed compared to original implementations. We have benchmarked FinaleToolkit against existing studies or implementations where possible, confirming its efficacy. Furthermore, FinaleToolkit is open source and thoroughly documented with both command line interface and Python application programming interface (API) to facilitate its widespread adoption and use within the research community: https://github.com/epifluidlab/FinaleToolkit
Development and Application of the MultiSEp R Package to Identify Multiple Myeloma Achilles' Heels for Drug Discovery
Confirmed Presenter: Adeline McKie, Patrick G Johnston Centre for Cancer Research, Queen’s University Belfast; Belfast, Northern Ireland, United Kingdom
Room: 520a
Format: In person
Authors List: Show
- Adeline McKie, Patrick G Johnston Centre for Cancer Research, Queen’s University Belfast; Belfast, Northern Ireland, United Kingdom
- Mark Wappett, Almac Discovery; Belfast, Northern Ireland, United Kingdom
- Benayu Priyanto, Patrick G Johnston Centre for Cancer Research, Queen’s University Belfast; Belfast, Northern Ireland, United Kingdom
- Hans Vandierendonck, School of Electronics, Electrical Engineering and Computer Science, Queen’s University Belfast; Belfast, United Kingdom
- Ian Overton, Patrick G Johnston Centre for Cancer Research, Queen’s University Belfast; Belfast, Northern Ireland, United Kingdom
Presentation Overview: Show
Almost all Multiple Myeloma (MM) patients relapse and ultimately succumb to therapy-resistant disease; there is an urgent need for more effective treatment. Achilles' heel relationships arise when the status of one gene exposes a cell's vulnerability to perturbation of a second gene, such as chemical inhibition, providing opportunities for precision oncology.
We developed MultiSEp for integrative discovery of candidate gene dependency relationships in multiomics data. We predicted MM GDRs at genome-scale (27,232 genes, 370,777,296 candidate interactions) using transcriptomic data from the Multiple Myeloma Research Foundation (MMRF) CoMMpass study (n=928 patients). Filtering steps to derive a high-confidence synthetic lethal network (SynLethNet) included predicting characteristic mutual exclusive loss patterns (q<0.05). We predicted the population coverage achieved by drugging SynLethNet genes, and the impact of deleterious mutations. Our analysis only utilised deleterious mutations predicted by the variant effect prediction tools, SNPeff and SNPsift (annotated ‘high-impact’ mutations).
We characterised GDRs in the CoMMpass cohort and derived a high confidence predicted synthetic lethal network (1,466 genes, 5,245 edges; SynLethNet). Functional annotation of SynLethNet revealed many genes involved in the ubiquitin-proteasome system, which is dysregulated in MM and a target of current front-line therapy. Predictions were validated with the Cancer Dependency Map and the Cancer Therapeutics Response database.
We present the MultiSEp R package, demonstrated with a case study in Multiple Myeloma where we predict candidate drug targets and provide mechanistic insights to advance precision oncology.
Bridging Education and Research: Data Hunters Workshop Empowering Bioinformatics Education via Microbiome Studies
Room: 520a
Format: In person
Authors List: Show
- Sara Fumagalli, University of Milano-Bicocca, Italy
- Giulia Ghisleni, University of Milano-Bicocca, Italy
- Alice Armanni, University of Milano-Bicocca, Italy
- Luca Corneo, University of Milano-Bicocca, Italy
- Maurizio Casiraghi, University of Milano-Bicocca, Italy
- Antonia Bruno, University of Milano-Bicocca, Italy
Presentation Overview: Show
Ensuring access to the bioinformatics field shall be on the agenda of life sciences degrees. This especially applies to biology-related degrees, where future biologists often ignore its existence due to the scarcity of dedicated courses.
We present our contribution to bioinformatics education, using the Data Hunters Workshop as a case study. Kicked off on February 28th, Data Hunters constitutes an ongoing student-science activity for students of the University of Milano-Bicocca, particularly those in the Biotechnology and Biosciences Department. Combining educational engagement with scientific research, this initiative enables students to tackle the key issue of metadata standardization’s lack in metagenomics, while supporting their learning. We provided a 6-hour lecture and an autonomous hands-on phase, structured as a learn-and-play activity with in-house built online educational and command-line resources. Thus, 29 students gained the fundamentals of metagenomics and Python language. Thanks to these tools, they have now stepped into the role of bioinformaticians, actively curating metadata from 379 amplicon-based and shotgun sequencing projects of the human skin microbiome to collaboratively create a curated metadata collection. Upon the workshop’s conclusion, we will assess the efficacy of our activity via surveys and standardized evaluation scales.
Our workshop represents the effort to bridge educational and research aims, including students as bioinformaticians of the future. As a reflection of the synergy of our objectives, the workshop's outcomes will include significant educational impacts that will lead to the development of a collaborative curated collection of human skin microbiome metadata, alongside the advancement of bioinformatics dissemination.
Seven Domain Topics in Bioinformatics Education - Refining the ISCB Core Competencies to Access Diversity in Training
Confirmed Presenter: Nilson Da Rocha Coimbra, Laboratorio de Bacteriologia, Faculdade de Medicina, Universidade de São Paulo, São Paulo, Brazil, Brazil
Room: 520a
Format: In person
Authors List: Show
- Nilson Da Rocha Coimbra, Laboratorio de Bacteriologia, Faculdade de Medicina, Universidade de São Paulo, São Paulo, Brazil, Brazil
- Bernardo Velozo, Programa de Pós Graduação em Bioquímica, UFRJ, Rio de Janeiro, Brazil, Brazil
- Clara Carvalho, Programa Interunidades de Pós-graduação em Bioinformática, Universidade de São Paulo, São Paulo, Brazil, Brazil
- Lucas Aleixo Leal Pedroza, Programa de Pós-graduação em Biologia Aplicada à Saúde, Instituto Keizo Asami, UFPE, Recife, Brazil, Brazil
- Emerson Danzer, Programa Interunidades de Pós-graduação em Bioinformática, UFMG, Belo Horizonte, Minas Gerais, Brazil, Brazil
- Sandy Ingrid Aguiar Alves, Programa de Pós-Graduação em Biologia de Agentes Infecciosos e Parasitários, UFPA, Belém, Pará, Brazil, Brazil
- Rayssa Feitosa, The Hospital for Sick Children, Genetics and Genome Biology Department, Toronto, Canada, Canada
- Maira Neves, Programa Interunidades de Pós-graduação em Bioinformática, Universidade de São Paulo, São Paulo, Brazil, Brazil
- Bibiana Fam, Hospital de Clínicas de Porto Alegre, Porto Alegre, Rio Grande do Sul, Brazil, Brazil
Presentation Overview: Show
The ISCB Regional Student Group of Brazil (RSG-Brazil) is at the forefront of promoting bioinformatics and computational biology education in Brazil. In 2019, RSG-Brazil launched its Educational Committee (EduComm) with a dual mission: to develop educational materials in Portuguese and to assess the efficacy of bioinformatics training in the country. Leveraging the ISCB core competency framework 3.0, the EduComm devised a novel training model to evaluate confidence in various technical aspects of bioinformatics. This model encompasses seven domain topics crucial for bioinformatics education: Biology, Statistics, Computer Science, Ethics, Bioinformatics Applications, Communication, and Professional Development. To gauge the educational needs of the Brazilian bioinformatics community, a survey was conducted in November 2023, collected 375 responses from across 21 states, predominantly from academia. Notably, the majority of respondents identified themselves as Bioinformatics Users, with a significant representation from Undergraduate and Graduate students. The survey revealed regional and profile-specific interests in Bioinformatics Topics, providing valuable insights for curriculum development. Through the efforts of EduComm, RSG-Brazil aims to tailor educational courses to meet the diverse needs of Brazil's computational biology student community. This work presents the findings from the survey conducted by RSG-Brazil's EduComm and highlights the importance of localized educational initiatives in advancing bioinformatics education on a global scale.