11:21-12:20

Invited Presentation: Open Knowledge Bases in the Age of Generative AI

Confirmed Presenter: Chris Mungall

Room: 03A

Format: In person

Moderator(s): Karsten Hokamp

Authors List: Show

Presentation Overview: Show

12:20-12:40

textToKnowledgeGraph: Generation of Molecular Interaction Knowledge Graphs Using Large Language Models for Exploration in Cytoscape

Confirmed Presenter: Favour James, Obafemi Awolowo University, Nigeria

Room: 03A

Format: In person

Moderator(s): Karsten Hokamp

Authors List: Show

Presentation Overview: Show

12:40-13:00

BMFM-RNA: An Open Framework for Building and Evaluating Transcriptomic Foundation Models built on Biomed-Multi-Omic

Confirmed Presenter: Bharath Dandala, IBM, United States

Room: 03A

Format: In person

Moderator(s): Karsten Hokamp

Authors List: Show

Presentation Overview: Show

DOME Registry - Supporting ML transparency and reproducibility in the life sciences

Confirmed Presenter: Gavin Farrell, Uni Padova, Italy

Room: 03A

Format: In person

Moderator(s): Karsten Hokamp

Authors List: Show

Presentation Overview: Show

AutoPeptideML 2: An open source library for democratizing machine learning for peptide bioactivity prediction

Confirmed Presenter: Raúl Fernández-Díaz, IBM Research | UCD Conway Institute, Ireland

Room: 03A

Format: In person

Moderator(s): Karsten Hokamp

Authors List: Show

Presentation Overview: Show

Peptides are a rapidly growing drug modality with diverse bioactivities and accessible synthesis, particularly for canonical peptides composed of the 20 standard amino acids. However, enhancing their pharmacological properties often requires chemical modifications, increasing synthesis cost and complexity. Consequently, most existing data and predictive models focus on canonical peptides. To accelerate the development of peptide drugs, there is a need for models that generalize from canonical to non-canonical peptides.

We present AutoPeptideML, an open-source, user-friendly machine learning platform designed to bridge this gap. It empowers experimental scientists to build custom predictive models without specialized computational knowledge, enabling active learning workflows that optimize experimental design and reduce sample requirements. AutoPeptideML introduces key innovations: (1) preprocessing pipelines for harmonizing diverse peptide formats (e.g., sequences, SMILES); (2) automated sampling of negative peptides with matched physicochemical properties; (3) robust test set selection with multiple similarity functions (via the Hestia-GOOD framework); (4) flexible model building with multiple representation and algorithm choices; (5) thorough model evaluation for unseen data at multiple similarity levels; and (6) FAIR-compliant, interpretable outputs to support reuse and sharing. A webserver with GUI enhances accessibility and interoperability.

We validated AutoPeptideML on 18 peptide bioactivity datasets and found that automated negative sampling and rigorous evaluation reduce overestimation of model performance, promoting user trust. A follow-up investigation also highlighted the current limitations in extrapolating from canonical to non-canonical peptides using existing representation methods.

AutoPeptideML is a powerful platform for democratizing machine learning in peptide research, facilitating integration with experimental workflows across academia and industry.

14:00-14:20

BioPortal: a rejuvenated resource for biomedical ontologies

Confirmed Presenter: J. Harry Caufield, Lawrence Berkeley National Laboratory, United States

Room: 03A

Format: In person

Moderator(s): Karsten Hokamp

Authors List: Show

Presentation Overview: Show

14:20-14:40

Formal Validation of Variant Classification Rules Using Domain-Specific Language and Meta-Predicates

Confirmed Presenter: Michael Bouzinier, Forome Association, Harvard University, IDEXX Laboratories, United States

Room: 03A

Format: In person

Moderator(s): Karsten Hokamp

Authors List: Show

Presentation Overview: Show

14:40-15:00

BioChatter: An open-source framework integrating knowledge graphs and large language models for Accessible Biomedical AI

Confirmed Presenter: Sebastian Lobentanzer, Helmholtz Munich, Germany

Room: 03A

Format: In person

Moderator(s): Karsten Hokamp

Authors List: Show

Presentation Overview: Show

15:00-15:20

Applications of Bioschemas in FAIR, AI and knowledge representation

Confirmed Presenter: Nick Juty, The University of Manchester, United Kingdom

Room: 03A

Format: In person

Moderator(s): Karsten Hokamp

Authors List: Show

Presentation Overview: Show

15:20-15:40

RO-Crate: Capturing FAIR research outputs in bioinformatics and beyond

Confirmed Presenter: Phil Reed, The University of Manchester, United Kingdom

Room: 03A

Format: In person

Moderator(s): Karsten Hokamp

Authors List: Show

Presentation Overview: Show

PheBee: A Graph-Based System for Scalable, Traceable, and Semantically Aware Phenotyping

Confirmed Presenter: David Gordon, Office of Data Sciences at Nationwide Children's Hospital, United States

Room: 03A

Format: In person

Moderator(s): Karsten Hokamp

Authors List: Show

Presentation Overview: Show

The role of the Ontology Development Kit in supporting ontology compliance in adverse legal landscapes

Confirmed Presenter: Damien Goutte-Gattat, University of Cambridge, United Kingdom

Room: 03A

Format: In person

Moderator(s): Karsten Hokamp

Authors List: Show

Presentation Overview: Show

15:40-16:00

10 years of the AberOWL ontology repository: moving towards federated reasoning and natural language access

Confirmed Presenter: Robert Hoehndorf, King Abdullah University of Science and Technology, Saudi Arabia

Room: 03A

Format: In person

Moderator(s): Karsten Hokamp

Authors List: Show

Presentation Overview: Show

16:40-16:50

The global biodata infrastructure: how, where, who, and what?

Confirmed Presenter: Guy Cochrane, Global Biodata Coalition, United Kingdom

Room: 03A

Format: In person

Moderator(s): Karsten Hokamp

Authors List: Show

Presentation Overview: Show

16:50-17:50

Panel: Data Sustainability

Room: 03A

Format: In person

Moderator(s): Monica Munoz-Torres

Authors List: Show

Presentation Overview: Show

17:50-18:00

Closing Remarks

Room: 03A

Format: In person

Moderator(s): Monica Munoz-Torres

Authors List: Show

Wednesday, July 23^rd

11:20-11:40

Knowledge-Graph-driven and LLM-enhanced Microbial Growth Predictions

Confirmed Presenter: Marcin Joachimiak, Lawrence Berkeley National Laboratory, United States

Room: 03A

Format: In person

Moderator(s): Tiffany Callahan

Authors List: Show

Presentation Overview: Show

11:40-12:00

ProDiGenIDB – a unified resource of disease-associated genes, their protein products, and intrinsic disorder annotations

Confirmed Presenter: Jovana Kovacevic, Faculty of Mathematics, Belgrade University, Belgrade, Serbia, Serbia

Room: 03A

Format: In person

Moderator(s): Tiffany Callahan

Authors List: Show

Presentation Overview: Show

12:00-12:20

Causal knowledge graph analysis identifies adverse drug effects

Confirmed Presenter: Sumyyah Toonsi, King Abdullah Unversity of Science and Technology, Saudi Arabia

Room: 03A

Format: In person

Moderator(s): Tiffany Callahan

Authors List: Show

Presentation Overview: Show

12:20-12:40

CROssBARv2: A Unified Biomedical Knowledge Graph for Heterogeneous Data Representation and LLM-Driven Exploration

Confirmed Presenter: Erva Ulusoy, Hacettepe University, Turkey

Room: 03A

Format: In person

Moderator(s): Tiffany Callahan

Authors List: Show

Presentation Overview: Show

12:40-12:45

Benchmarking Data Leakage on Link Prediction in Biomedical Knowledge Graph Embeddings

Confirmed Presenter: Galadriel Brière, Aix Marseille Univ, INSERM, MMG, Marseille, France, France

Room: 03A

Format: In person

Moderator(s): Tiffany Callahan

Authors List: Show

Presentation Overview: Show

12:45-12:50

A machine learning framework for extracting and structuring biological pathway knowledge from scientific literature

Confirmed Presenter: Mun Su Kwon, Korea Advanced Institute of Science and Technology (KAIST), South Korea

Room: 03A

Format: In person

Moderator(s): Tiffany Callahan

Authors List: Show

Presentation Overview: Show

12:50-13:00

Invited Presentation: Poster Madness

Room: 03A

Format: In person

Moderator(s): Tiffany Callahan

Authors List: Show

Presentation Overview: Show

14:00-14:20

Proceedings Presentation: ScGOclust: leveraging gene ontology to find functionally analogous cell types between distant species

Confirmed Presenter: Yuyao Song, European Bioinformatics Institute, United Kingdom

Room: 03A

Format: In person

Moderator(s): Robert Hoehndorf

Authors List: Show

Presentation Overview: Show

14:20-14:40

Integrating autoantibody-related knowledge in an ontology populated using a curated dataset from literature

Confirmed Presenter: Fabien Maury, Inserm, France

Room: 03A

Format: In person

Moderator(s): Robert Hoehndorf

Authors List: Show

Presentation Overview: Show

14:40-15:00

Ontology pre-training improves machine learning predictions of aqueous solubility and other metabolite properties

Confirmed Presenter: Charlotte Tumescheit, University of Zurich, Swiss Institute of Bioinformatics, Switzerland

Room: 03A

Format: In person

Moderator(s): Robert Hoehndorf

Authors List: Show

Presentation Overview: Show

15:00-15:20

Building the Aging Biomarkers Ontology and Its Applications in Aging Research

Confirmed Presenter: Hande McGinty, Kansas State University, Manhattan KS, United States

Room: 03A

Format: In person

Moderator(s): Robert Hoehndorf

Authors List: Show

Presentation Overview: Show

15:20-15:40

Discovering cellular contributions to disease pathogenesis in the NLM Cell Knowledge Network

Confirmed Presenter: Richard Scheuermann, Division of Intramural Research, National Library of Medicine, United States

Room: 03A

Format: In person

Moderator(s): Robert Hoehndorf

Authors List: Show

Presentation Overview: Show

15:40-16:00

Cat-VRS for Genomic Knowledge Curation: A Hyperintensional Representation Framework for FAIR Categorical Variation

Confirmed Presenter: Daniel Puthawala, Nationwide Children's Hospital, United States

Room: 03A

Format: In person

Moderator(s): Robert Hoehndorf

Authors List: Show

Presentation Overview: Show

16:40-17:40

Invited Presentation: Knowledge Graphs: Theory, Applications and Challenges

Confirmed Presenter: Ian Horrocks

Room: 03A

Format: In person

Moderator(s): Augustin Luna

Authors List: Show

Presentation Overview: Show

17:40-17:45

Bridging Language Barriers in Bio-Curation: An LLM-Enhanced Workflow for Ontology Translation into Japanese

Confirmed Presenter: Mark Streer, SciBite (Elsevier Ltd.), United Kingdom

Room: 03A

Format: In person

Moderator(s): Augustin Luna

Authors List: Show

Presentation Overview: Show

17:45-17:50

Enabling FAIR Single-Cell RNAseq Data Management with COPO

Confirmed Presenter: Felix Shaw, Earlham Institute, United Kingdom

Room: 03A

Format: In person

Moderator(s): Augustin Luna

Authors List: Show

Presentation Overview: Show

17:50-17:55

Cancer Complexity Knowledge Portal: A centralized web portal for finding cancer related data, software tools, and other resources

Confirmed Presenter: Susheel Varma, Sage Bionetworks, United States

Room: 03A

Format: In person

Moderator(s): Augustin Luna

Authors List: Show

Presentation Overview: Show