11:00-11:20

Gemma: Curation, re-analysis and dissemination of 18,000 gene expression studies

Confirmed Presenter: Paul Pavlidis, University of British Columbia, Canada

Room: 524ab

Format: In Person

Moderator(s): Hervé Ménager

Authors List: Show

Presentation Overview: Show

11:20-11:40

EASTR: Identifying and eliminating systematic alignment errors in multi-exon genes

Confirmed Presenter: Ida Shinder, Johns Hopkins School of Medicine, United States

Room: 524ab

Format: In Person

Moderator(s): Hervé Ménager

Authors List: Show

Presentation Overview: Show

ROC Picker: propagating statistical and systematic uncertainties in biological analyses

Confirmed Presenter: Jeffrey Roskes, Johns Hopkins University, United States

Room: 524ab

Format: In Person

Moderator(s): Hervé Ménager

Authors List: Show

Presentation Overview: Show

Djerba: Sharing and Updating a Modular System for Clinical Report Generation

Confirmed Presenter: Iain Bancarz, Ontario Institute for Cancer Research, Canada

Room: 524ab

Format: In Person

Moderator(s): Hervé Ménager

Authors List: Show

Presentation Overview: Show

Q&A For Flash Talks

Room: 524ab

Format: In person

Moderator(s): Hervé Ménager

Authors List: Show

11:40-12:00

Antimicrobial resistance prediction of nontuberculous mycobacteria from whole genome sequence data

Confirmed Presenter: Idowu Olawoye, Department of Microbiology & Immunology, University of Western Ontario, London, Ontario, Canada, Canada

Room: 524ab

Format: In Person

Moderator(s): Hervé Ménager

Authors List: Show

Presentation Overview: Show

12:00-12:20

Open2C: Advancing 3D and functional genomics research

Confirmed Presenter: Vedat Yilmaz, UMass Chan Medical School, United States

Room: 524ab

Format: In Person

Moderator(s): Hervé Ménager

Authors List: Show

Presentation Overview: Show

A Framework for DNA Binding Motifs Prediction for Nontraditional Model Organism Transcription Factors

Confirmed Presenter: Stephanie Hao, Boston University, United States

Room: 524ab

Format: In Person

Moderator(s): Hervé Ménager

Authors List: Show

Presentation Overview: Show

Bioinformatics tools for comparative genomics analysis of highly similar duplicate genes in eukaryotic genomes

Confirmed Presenter: Xi Zhang, Dalhousie University, Canada

Room: 524ab

Format: Live Stream

Moderator(s): Hervé Ménager

Authors List: Show

Presentation Overview: Show

Q&A For Flash Talks

Room: 524ab

Format: In person

Moderator(s): Hervé Ménager

Authors List: Show

14:20-15:20

Invited Presentation: The Data Shows We Need Better Data

Confirmed Presenter: Mélanie Courtot, Ontario Institute for Cancer Research and University of Toronto, Canada

Room: 524ab

Format: In Person

Moderator(s): Hervé Ménager

Authors List: Show

Presentation Overview: Show

Big data, AI, LLMs… do they live up to the hype? In a bright and hopeful future, AI accelerates progress, revolutionizes healthcare, alerts us to health risks, and creates fresh career paths. Yet, in a bleaker outlook, it obliterates jobs, fosters rampant misinformation and increases inequity.

At the root of AI is the data it relies on. In this talk we will discuss how to steer the course by improving the data AI leverages. We will explore the vast ecosystem formed by data, projects and infrastructure. We will travel along different axes to think about the data we are generating and using every day. We will consider data governance – where does it come from, who owns it, and how can we access it? We will investigate open data – how can we leverage health care knowledge for research? Finally, we will share a few thoughts about data quality and data sharing to increase reproducibility and reuse.

Dr Mélanie Courtot is the Director of Genome Informatics at the Ontario Institute for Cancer Research in Toronto, and an Assistant Professor in the Department of Medical Biophysics at University of Toronto. Dr Courtot is passionate about translational informatics – building intelligent systems to gain new insights and impact human health. Her lab aims to build a globally shared knowledge ecosystem to advance science and improve health for all. Her team develops the Overture open source software suite, which supports many active large-scale cancer genomics projects including ICGC and ICGC-ARGO, VirusSeq, and the upcoming Pan-Canadian Genome Library. It also drives the African Pathogen Data Sharing and Archive Platform.

Dr Courtot obtained her PhD in Bioinformatics from the University of British Columbia in 2014, followed by a postdoctoral fellowship in Public Health. Dr Courtot co-leads the Clinical and Phenotypic workstream and Data Use and Cohort representation groups for the Global Alliance for Genomics and Health (GA4GH) as well as cohort harmonization efforts for the International HundredK+ Cohorts Consortium. She is an advisory board member for the Public Health Alliance for Genomic Epidemiology coalition, European Open Science Cloud for Cancer project and the eLwazi open data science platform.

15:20-15:40

Creating an open-source data platform.

Confirmed Presenter: Mitchell Shiell, Ontario Institute for Cancer Research (OICR), Canada

Room: 524ab

Format: In Person

Moderator(s): Monica Munoz-Torres

Authors List: Show

Presentation Overview: Show

15:40-16:00

Going Viral: The Development of the VirusSeq Data Portal

Confirmed Presenter: Justin Richardsson, Ontario Institute for Cancer Research (OICR), Canada

Room: 524ab

Format: In Person

Moderator(s): Monica Munoz-Torres

Authors List: Show

Presentation Overview: Show

intermine.bio2rdf.org : A QLever SPARQL endpoint for InterMine databases

Confirmed Presenter: François Belleau, Arnaud Droit Computational Laboratory, Canada

Room: 524ab

Format: In Person

Moderator(s): Monica Munoz-Torres

Authors List: Show

Presentation Overview: Show

Organizing community curation to create an Open database on Thermodynamics of Enzyme-Catalyzed Reactions (openTECR)

Confirmed Presenter: Robert T. Giessmann, Institute for Globally Distributed Open Research and Education, igdore.org, Germany

Room: 524ab

Format: Live Stream

Moderator(s): Monica Munoz-Torres

Authors List: Show

Presentation Overview: Show

Q&A For Flash Talks

Room: 524ab

Format: In person

Moderator(s): Monica Munoz-Torres

Authors List: Show

16:40-17:00

Connecting Integrated Genome Browser to a huge genome database using its open API solves one problem and creates another

Confirmed Presenter: Ann Loraine, University of North Carolina Charlotte, United States

Room: 524ab

Format: In Person

Moderator(s): Karsten Hokamp

Authors List: Show

Presentation Overview: Show

17:00-17:20

Collaborating our way to optimal integration between Tripal 4 and JBrowse 2

Confirmed Presenter: Carolyn T. Caron, University of Saskatchewan, Canada

Room: 524ab

Format: In Person

Moderator(s): Karsten Hokamp

Authors List: Show

Presentation Overview: Show

An integrated environment for browsing 3-D protein structures and multiple sequence alignments in JBrowse 2

Confirmed Presenter: Colin Diesh, University of California, Berkeley, United States

Room: 524ab

Format: In Person

Moderator(s): Karsten Hokamp

Authors List: Show

Presentation Overview: Show

iCn3D, a Platform to Integrate Structures with Functions and Genomics

Confirmed Presenter: Jiyao Wang, NIH/NLM/NCBI, United States

Room: 524ab

Format: In Person

Moderator(s): Karsten Hokamp

Authors List: Show

Presentation Overview: Show

Q&A For Flash Talks

Room: 524ab

Format: In person

Moderator(s): Karsten Hokamp

Authors List: Show

17:20-17:40

Codefair: Make Biomedical Research Software FAIR Without Breaking a Sweat

Confirmed Presenter: Bhavesh Patel, FAIR Data Innovations Hub, California Medical Innovations Institute, United States

Room: 524ab

Format: In Person

Moderator(s): Swapnil Savant

Authors List: Show

Presentation Overview: Show

An Open-source Ecosystem For Scalable And Computationally Efficient Nanopore Data Processing

Confirmed Presenter: Hasindu Gamaarachchi, University of New South Wales, Australia

Room: 524ab

Format: In Person

Moderator(s): Swapnil Savant

Authors List: Show

Presentation Overview: Show

GenomeKit, a Python library for fast and easy access to genomic resources

Confirmed Presenter: Avishai Weissberg, Deep Genomics, Canada

Room: 524ab

Format: In Person

Moderator(s): Swapnil Savant

Authors List: Show

Presentation Overview: Show

Q&A For Flash Talks

Room: 524ab

Format: In person

Moderator(s): Swapnil Savant

Authors List: Show

17:40-18:00

Tataki: Enhancing the robustness of bioinformatics workflows with simple, tolerant file format detection

Confirmed Presenter: Masaki Fukui, Sator, Inc., Japan

Room: 524ab

Format: In Person

Moderator(s): Swapnil Savant

Authors List: Show

Presentation Overview: Show

Arvados Project Update

Confirmed Presenter: Peter Amstutz, Curii Corporation, United States

Room: 524ab

Format: In Person

Moderator(s): Swapnil Savant

Authors List: Show

Presentation Overview: Show

BiocPy: Facilitate Bioconductor Workflows in Python

Confirmed Presenter: Jayaram Kancherla, Genentech, United States

Room: 524ab

Format: Live Stream

Moderator(s): Swapnil Savant

Authors List: Show

Presentation Overview: Show

Q&A For Flash Talks

Room: 524ab

Format: In person

Moderator(s): Swapnil Savant

Authors List: Show

8:40-9:00

Enhancing Reproducibility in Immunogenetics: Leveraging Containerization Technology for Bioinformatics Workflows

Confirmed Presenter: Rayo Suseno, UCSF, United States

Room: 524ab

Format: Live Stream

Moderator(s): Jason Williams

Authors List: Show

Presentation Overview: Show

9:00-9:20

Breaking the silo: composable bioinformatics through cross-disciplinary open standards

Confirmed Presenter: Nezar Abdennur, UMass Chan Medical School, United States

Room: 524ab

Format: In Person

Moderator(s): Jason Williams

Authors List: Show

Presentation Overview: Show

9:20-9:40

For long-term sustainable software in bioinformatics: a manifesto

Confirmed Presenter: Luis Pedro Coelho, Queensland University of Technology, Australia

Room: 524ab

Format: In Person

Moderator(s): Jason Williams

Authors List: Show

Presentation Overview: Show

BioCompute: A Descriptive Standard for Computable Metadata

Confirmed Presenter: Jonathon Keeney, The George Washington university, United States

Room: 524ab

Format: In Person

Moderator(s): Jason Williams

Authors List: Show

Presentation Overview: Show

Breaking Down Research Silos and Fostering Radical Collaboration through Collective Intelligence

Confirmed Presenter: Alberto Pepe, Sage Bionetworks, United States

Room: 524ab

Format: In Person

Moderator(s): Jason Williams

Authors List: Show

Presentation Overview: Show

Q&A For Flash Talks

Room: 524ab

Format: In person

Moderator(s): Jason Williams

Authors List: Show

9:40-10:00

Tripal: a community-driven framework supporting open science, sustainable data web portals

Confirmed Presenter: Lacey-Anne Sanderson, University of Saskatchewan, Canada

Room: 524ab

Format: In Person

Moderator(s): Jason Williams

Authors List: Show

Presentation Overview: Show

10:40-11:40

Invited Presentation: Open Data, Knowledge Graphs, and Large Language Models

Confirmed Presenter: Andrew Su

Room: 524ab

Format: In Person

Moderator(s): Nomi Harris

Authors List: Show

Presentation Overview: Show

Bioinformatics is the science of collecting, storing, analyzing, and disseminating biological data and information. As in most domains of data science, bioinformaticians have long focused on structured data – information that is represented using ontologies and controlled vocabularies in well-defined data formats and often stored in databases with predefined schemas. This focus on structured data over the last 30 years has been the most efficient way to convert information into testable hypotheses and new scientific insights.

Recent developments in artificial intelligence, particularly the advent of large language models (LLMs), have started to challenge this traditional focus on structured data. By utilizing massive training sets of unstructured text, LLMs have shown exceptional capabilities not only in tasks like question answering and text generation but also in summarization, translation, and code generation. In this presentation, we will examine how LLMs are changing and will continue to change the practice of bioinformatics, particularly at the interface between structured and unstructured data.

Andrew Su, Ph.D., is the Elden and Verna Strahm Professor at the Scripps Research Institute in the Department of Integrative Structural and Computational Biology (ISCB). Dr. Su earned his PhD in chemistry at Scripps Research in 2002, and was the Associate Director of Bioinformatics at The Genomics Institute of the Novartis Research Foundation (GNF) before returning to Scripps Research as a faculty member in 2011.

The Su lab focuses on building and applying bioinformatics infrastructure for biomedical discovery. Dr. Su has had a long-standing interest in leveraging crowdsourcing to organize and integrate knowledge though projects like the Gene Wiki and Wikidata. In partnership with Chunlei Wu’s lab, he has also worked extensively on creating biomedical APIs and enabling API interoperability through the BioThings project. Most recently, his lab has a particular emphasis on constructing and mining knowledge graphs for drug repurposing. In all this work, the Su lab has embraced the principles of open science, open data, and open source software.

11:40-12:00

Gene Set Summarization Using Large Language Models

Confirmed Presenter: Marcin Joachimiak, Lawrence Berkeley National Laboratory, United States

Room: 524ab

Format: In Person

Moderator(s): Jessica Maia

Authors List: Show

Presentation Overview: Show

12:00-12:20

FAIR, modular and reproducible image-based ML workflows for biologists: a template and case study from imageomics

Confirmed Presenter: Hilmar Lapp, Duke University, United States

Room: 524ab

Format: In Person

Moderator(s): Jessica Maia

Authors List: Show

Presentation Overview: Show

14:20-14:40

Trust and Transparency in Reporting Machine Learning: The DOME-GigaScience Press Trial

Confirmed Presenter: Chris Armit, GigaScience Press, Hong Kong

Room: 524ab

Format: In Person

Moderator(s): Jessica Maia

Authors List: Show

Presentation Overview: Show