Return to ISMB/ECCB 2025 Homepage Click here for the abridged agenda

Schedule for TechTrack

NOTE: Browser resolution may limit the width of the agenda and you may need to scroll the iframe to see additional columns.
Click the buttons below to download your current table in that format

Date	Start Time	End Time	Room	Track	Title	Confrimed Presenter	Format	Authors	Abstract
2025-07-21	11:20:00	11:40:00	11BC	Tech Track	UniProt: Evolving Tools and Data for Protein Science	Daniel Rice	In person	Daniel Rice	The Universal Protein Resource (UniProt) is a cornerstone of molecular biology and bioinformatics, delivering high-quality, freely accessible protein sequence and functional information for over 20 years. This session presents a guided tour of UniProt’s latest features, datasets, and tools, reflecting its continued evolution to meet the needs of the scientific community. We will highlight data integrations—including AlphaFold structural predictions, AlphaMissense variant effect predictions, RNA editing, post-translational modifications (PTMs), and Human Proteome Project (HPP) datasets—and demonstrate embedded visualizations developed by UniProt and third-party contributors. Attendees will learn about improved tools for browsing, analyzing, and exporting data, along with recent enhancements to UniProt’s API and new Swagger documentation that streamline programmatic access and data integration. Whether you're a student or a seasoned researcher, this session will help you better leverage UniProt in your work. We will emphasize practical applications and encourage engagement with UniProt’s expanding capabilities. Attendees will leave with a deeper understanding of how to integrate UniProt resources into their workflows—and how to contribute feedback to guide its future development.
2025-07-21	11:40:00	12:20:00	11BC	Tech Track	Genomics 2 Proteins portal: A resource and discovery platform for linking genetic screening outputs to protein sequences and structures			Sumaiya Iqbal, Jordan Safer	Recent advances in AI-based methods have revolutionized the field of structural biology. Concomitantly, high-throughput sequencing and functional genomics have generated genetic variants at an unprecedented scale. However, efficient tools and resources are needed to link disparate data types – to “map” variants onto protein structures, to better understand how the variation causes disease, and thereby design therapeutics. Here we present the Genomics 2 Proteins Portal (G2P; g2p.broadinstitute.org): a human proteome-wide resource that maps 49,500,857 genetic variants onto 42,481 protein sequences and 84,318 structures (according to Dec 2024 release), with a comprehensive set of structural and functional features. Additionally, the G2P portal allows users to interactively upload protein residue-wise annotations (variants, scores, etc.) as well as the protein structure beyond databases to establish the connection between genomics to proteins. The portal serves as an easy-to-use discovery tool for researchers and scientists to hypothesize the structure-function relationship between natural or synthetic variations and their molecular phenotypes.
2025-07-21	12:20:00	12:40:00	11BC	Tech Track	ModCRE: modelling protein–DNA interactions and transcription-factor co-operativity in cis-regulatory elements	Patrick Gohl	In person	Patrick Gohl, Patricia Bota	ModCRE is a web server using a structural approach to predict Transcription Factor binding preferences and auto-mate modelling of higher order regulatory complexes with DNA.
2025-07-21	14:00:00	14:20:00	11A	Tech Track	Orchestrating Microbiome Analysis with Bioconductor	Tuomas Borman	In person	Tuomas Borman, Leo Lahti	Microbes play a crucial role in human health, disease, and the environment. Despite their significant impact, the mechanisms underlying microbiome interactions remain largely unknown due to their complexity. While microbiome research has heavily relied on sequencing data, understanding these interactions requires multi-omics approaches and computational methods. R/Bioconductor is a well-established platform for biological data analysis, providing high-quality, open-source software. It is driven by a global community of researchers who collaborate through software development, shared standards, and active forums. The software is built on standardized data containers, with SummarizedExperiment being the most widely used, adopted across a wide range of biological fields. Shared data containers enable interoperability and facilitate advanced data integration. This session will showcase Bioconductor tools for microbiome data science, with a particular focus on the mia (Microbiome Analysis) framework through a practical case study. By the end of the session, participants will gain insights into the latest advances in microbiome research within Bioconductor, including TreeSummarizedExperiment data container along with essential methods. They will also be prepared to further explore the data science ecosystem, supported by the online book Orchestrating Microbiome Analysis with Bioconductor.
2025-07-21	14:20:00	14:40:00	11A	Tech Track	Smart Turkana Beads: A Culturally Embedded IoT Innovation for Health Monitoring and Drug Adherence			Meya Brian	Access to quality healthcare remains a persistent challenge in marginalized regions such as Turkana County in northern Kenya, where traditional beliefs, nomadic lifestyles, and weak infrastructure significantly hinder health service delivery and uptake, particularly in chronic disease management and drug adherence. This proposal introduces an innovative solution: culturally embedded Internet of Things (IoT) technology in the form of Smart Turkana Beads—a wearable health-monitoring device seamlessly integrated into the community’s traditional attire to enhance health surveillance and promote drug adherence. Leveraging Indigenous Knowledge Systems (IKS), the project seeks to reduce the cultural dissonance typically encountered by conventional biomedical technologies, while addressing key barriers to healthcare access in a way that aligns with local values and practices (Mwakalinga et al., 2021)
2025-07-21	14:40:00	15:00:00	11A	Tech Track	Integrating Long-Read Sequencing and Multiomics for Precision Cell Line Engineering			Daniel Fabian	Optimizing the biomanufacturing of therapeutic molecules, such as monoclonal antibodies, is critical for delivering efficient and scalable patient treatment. A key early step in this production pipeline is the development of Chinese Hamster Ovary (CHO) cell lines that produce these biologic molecules. To address the rising demand for therapeutic biologics, it is essential to enhance cell line expression to achieve consistently high and stable product titers. Lonza has recently integrated nanopore long-read sequencing into its multiomics pipelines, providing unprecedented molecular insights into the genetic and epigenetic landscapes of CHO cell lines, advancing both genetic engineering and biomarker discovery. This presentation will highlight recent progress in improving omics data accuracy through de novo genome assembly, and the integration of nanopore whole genome sequencing and DNA methylation analysis with other next-generation sequencing technologies.
2025-07-21	15:00:00	15:20:00	11A	Tech Track	Lonza Sponsor Talk	Harry Allsopp		, Harry Allsopp
2025-07-21	15:20:00	15:40:00	11A	Tech Track	Revving up insights in single cell and spatial exploration	Stephen Williams		, Stephen Williams	10x Genomics will present a comprehensive overview of our cutting-edge technologies: Visium for spatial transcriptomics, Chromium for single-cell analysis, and Xenium for in situ analysis. The talk will delve into the foundational principles of each platform, highlighting their unique capabilities in advancing biological discovery. Attendees will gain insight into how these technologies enable researchers to explore biological systems at unprecedented resolution, from whole tissue sections to individual cells and subcellular localization. The presentation will also cover the robust software ecosystem surrounding 10x Genomics assays, including both 10x developed software and widely used third-party tools for data analysis and visualization. Furthermore, the talk will address current challenges in analyzing complex data, alongside exciting opportunities for novel software and algorithm development with 10x datasets. We hope to inspire attendees to unlock deeper biological insights from the vast amounts of data generated by all 10x Genomics platforms.
2025-07-21	15:40:00	16:00:00	11A	Tech Track	The Next Generation of scverse - Tools for Scalable Single-Cell Analysis	Severin Dicks		Severin Dicks	Scverse is a modular, open-source ecosystem for scalable single-cell omics analysis. It provides standardized data structures through AnnData, MuData, and SpatialData, enabling seamless handling of multi-modal and spatial datasets, and offers integration with core analysis tools such as scanpy and scvi-tools. To address evolving community needs, four new core packages were added: pertpy, decoupler, snapatac2 and rapids-singlecell.
2025-07-22	11:20:00	11:40:00	11A	Tech Track	Scale with Seqera: Accelerate, Expand, and Collaborate	Adam Talbot	In person	Adam Talbot, Geraldine Van der Auwera	Turning a promising research project into a robust, real-world solution requires tools that support both early experimentation and enterprise-scale deployment. When reproducibility and reliability are non-negotiable, you need a platform that's flexible during ideation and powerful enough to meet the demands of mega-scale computation and collaborative research. Too often, scaling up means switching tools, rewriting pipelines, or reprovisioning infrastructure — an expensive, frustrating process that can introduce errors and undermine scientific reproducibility. In this talk, we will explore how Seqera's integrated suite of products empowers you to scale and accelerate your scientific research.
2025-07-22	11:40:00	12:00:00	11A	Tech Track	SimpleVM - Effortless Cloud Computing for Research	Viktor Rudko	In person	Viktor Rudko, Peter Belmann, Nils Hoffmann, David Weinholz, Alexander Sczyrba	SimpleVM empowers life science researchers to harness cloud resources, regardless of their expertise in cloud computing. As a multi-cloud application, SimpleVM is optimized for seamless integration with multiple OpenStack® installations. From an OpenStack administrator's perspective, all that is required is an OpenStack project without the need for additional admin privileges. SimpleVM features enhanced security components that scan connection attempts to virtual machines and automatically block suspicious access attempts.  By combining KeyCloak with a Django-based service layer, SimpleVM provides comprehensive user management and customizable role-based access control. This facilitates the integration of Authentication and Authorisation Infrastructure (AAI) for seamless use of various Identity Providers (IDPs), including LifeScience AAI or a local university IDP.   In addition to the straightforward launch of virtual machines, the emphasis is placed on advanced features that are intended to streamline and enhance the user experience when working with cloud resources. For example, Virtual Research Environments (VREs) can be deployed with just a few clicks, providing access to powerful applications such as Visual Studio Code(R) or RStudio directly from the browser.  The SimpleVM workshop mode fosters the realization of high-attendance teaching sessions. Workshop instructors can easily pre-configure, launch and assign machines to attendees in no time. Finally, SimpleVM improves the use of cloud resources with features such as the auto-scalable SLURM clusters.
2025-07-22	12:00:00	12:20:00	11A	Tech Track	Overture Prelude: A toolkit for small teams with big data problems	Mitchell Shiell	In person	Mitchell Shiell, Melanie Courtot, Brandon Chan, Jon Eubank, Robin Haw, Justin Richardsson, Leonardo Rivera, Lincoln Stein, Overture Team	Overture is used to build platforms that enable researchers to organize and share their data quickly, flexibly and at multiple scales. While Overture successfully powers major international platforms like ICGC-ARGO (100,000+ participants) and VirusSeq (500,000+ genomes), smaller teams generating massive data face prohibitive technical requirements during implementation. How then can we enable teams to build their data platform efficiently and with fewer resources? Prelude addresses this challenge by breaking down platform development into incremental phases, reducing the technical overhead during development and allowing teams to systematically verify requirements through hands-on testing, gaining insights into workflows, data needs, and platform fit. Prelude guides teams through three progressive phases of data platform development each building upon the previous one's foundation: - Phase one focuses on data exploration and theming, enabling teams to visualize and search their data through a customizable UI; - Phase two expands capabilities to enable tabular data management and validation with persistent storage; - Phase three adds file management and object storage. These phases are supported by comprehensive documentation, deployment automations, and utilities that generate key configuration files, reducing unnecessary time spent on tedious manual configurations. Prelude represents a practical step toward making data platform development accessible to all teams of all sizes. By providing a widely accessible platform we hope to encourage community requests and feedback such that we can improve and iterate on the platform making it the best it can be for advancing data sharing and reuse across the scientific community.
2025-07-22	12:20:00	13:00:00	11A	Tech Track	AI and Quantum in Healthcare and Life Sciences	Filippo Utro		Filippo Utro	The advent of foundation models (FM) and quantum computing (QC) has ushered in a new paradigm for tackling complex problems, igniting significant interest across diverse sectors, particularly within healthcare and life sciences. This talk will provide an exploration of the latest efforts at IBM Research dedicated to leveraging FM and QC for accelerating discovery in healthcare and life sciences. The discussion will span a range of applications in omics data, clinical trials, and drug discovery. Finally, in this presentation, I will discuss technical challenges, envisioning the new era of FM and QC in healthcare and life sciences.
2025-07-22	14:00:00	14:20:00	11A	Tech Track	GPCRVS - AI-driven decision support system for GPCR virtual screening	Dorota Latek	In person	Dorota Latek	GPCRVS represents an efficient, simple, easily accessible, and open-source web service that, as a decision support system, aims to facilitate the preclinical testing of drug candidates targeting peptides and small protein-binding G protein-coupled receptors. There are three major areas of drug discovery that GPCRVS could facilitate: prediction of drug selectivity, prediction of drug efficacy approximated by Autodock Vina docking scores, or by activity class assigned by the TensorFlow multiclass classifier, or by pChEMBL predictions using the LightGBM regressor, and finally prediction of the drug binding mode, showing the most crucial amino acids involved in the drug-receptor interactions. A comparison with precomputed results for known active compounds enables the prioritization of drug candidates, thereby significantly reducing the cost and length of experimental screening. In addition, a novel approach to using peptide ligand data sets as SMILES-based fingerprints in conjunction with small-molecule ligand data sets in the training of DNN and GBM models was proposed. This makes possible to benefit from all GPCR-like ligand data sets deposited in ChEMBL, and to design new drugs that could include both peptide and non-peptide scaffolds of increased, unified activity and selectivity. Currently, two groups of peptide/small protein-binding GPCR receptors are included in GPCRVS, allowing it to make comparative predictions for class A and B receptors at the same time. The evaluation of GPCRVS performed using the patent compound data set from Google Patents showed that LightGBM provides the most accurate results among the three classifiers implemented in GPCRVS.
2025-07-22	14:20:00	14:40:00	11A	Tech Track	Self-supervised generative AI enables conversion of two non-overlapping cohorts	Supratim Das	In person	Supratim Das, Mahdie Rafiei, Andreas Maier, Linda Baumbach, Jan Baumbach	Prognostic models in healthcare often rely on big data, which is typically distributed across multiple medical cohorts. Even if collected for similar purposes (e.g., capturing symptoms, treatments, and outcomes in osteoarthritis), they frequently differ in acquisition methods, structures, and variable definitions used. These discrepancies impede their integration into a unified, multi-cohort database for joint prognostic model training and pose significant challenges to model transferability, meaning a model trained on one cohort needs to be applied to data of a similar cohort with an incompatible data structure. Current cohort conversion approaches rely on AIs trained on linked, overlapping samples, which many healthcare cohorts lack. Here, we present DB-converter, a self-supervised deep learning architecture leveraging category theory and designed to convert data from different cohorts with different data structures into each other. We demonstrate the power and robustness of the DB-converter using synthetic and real health survey data. Our approach opens new avenues for multi-cohort analyses operating under the assumption that all cohorts to be integrated have been acquired for at least a similar real-world purpose.
2025-07-22	14:40:00	15:00:00	11A	Tech Track	The Bioverse - Biomolecule data processing for AI made easy	Tim Kucera	In person	Tim Kucera, Karsten Borgwardt	We introduce the bioverse, a free and open-source Python package that streamlines biological data preparation for machine learning. Focused on structural biology, it standardizes diverse biomolecular formats for flexible, high-performance workflows. Our demonstration will showcase key features, code examples, and how to launch your own ML projects in minutes.
2025-07-22	15:00:00	15:20:00	11A	Tech Track	Data, We Need to Chat	Alberto Pepe	In person	Susheel Varma, Jineta Banerjee, Robert Allaway, John Hill, Jay Hodgson, Alberto Pepe, Christine Suver, Luca Foschini	"The exponential growth of biomedical datasets presents unprecedented opportunities for scientific discovery, yet researchers struggle to find and explore relevant data. Traditional search methods fall short when navigating complex, highly regulated biomedical data repositories. This paper examines these limitations and proposes AI-powered conversational interfaces as a solution. Key obstacles to effective data discovery include repository fragmentation, inconsistent metadata, vocabulary mismatches, complex search requirements, and inadequate interface design. These challenges are intensified in biomedical research by regulatory restrictions on accessing sensitive data. Conversational AI systems offer a promising alternative by enabling natural language dialogue with data repositories. Unlike keyword searches, these interfaces understand user intent, ask clarifying questions, and guide researchers to relevant datasets. Synapse.org's experimental chatbot implementation demonstrates how AI-assisted discovery processes complex queries (e.g., ""datasets related to people over 60 with Alzheimer's disease and Type 2 diabetes"") without requiring database expertise. This approach leverages Retrieval-Augmented Generation (RAG) while respecting authorization levels and regulatory compliance. Such systems facilitate ""metadata spelunking,"" allowing researchers to explore dataset composition, methodology, and potential utility without needing to access sensitive raw data. The paper addresses ethical considerations related to privacy, bias, and trust, while outlining future possibilities for interdisciplinary data discovery. By bridging the gap between vast biomedical data repositories and researchers, conversational AI interfaces promise to democratize data access, accelerate discovery, and ultimately improve human health."
2025-07-22	15:20:00	15:40:00	11A	Tech Track	BioInfore: A No-Code Genome Data Management System Based On AI Agents	Zheng Chen	In person	Zheng Chen, Ziwei Yang, Xihao Piao, Peng Gao, Yasushi Sakurai, Yasuko Matsubara	In many genomic projects, selecting and preparing assemblies requires complex database queries, manual metadata curation, and bespoke code scripting. We introduce a no-code AI agent workflow that replaces all of these steps with a single plain natural language request. Behind the scenes, five specialized AI agents handle retrieval, quality filtering, ranking, and format conversion. Users receive analysis-ready genome datasets in minutes, freeing them from programming barriers and manual errors so they can focus on biological discovery.
2025-07-22	15:40:00	16:00:00	11A	Tech Track	Omi: Bridging the Informatics to Bio Gap with a Natural Language Co-pilot			Prashant Bharadwaj Kalvapalle, Eddie Kim, Marko Tanevski, Sahil Joshi, Benjamin Mao, Anshumali Shrivastava, Todd Treangen	Omi facilitates bioinformatics analysis by replacing complex command-line processes with a natural language bioinformatics co-pilot. We codify bioinformatics best practices into the LLM to select appropriate pipelines, provide explanations before running them, and return results after pipeline execution. Further democratization through coding bespoke statistical and data visualizations is underway.
2025-07-22	16:40:00	17:00:00	11A	Tech Track	Introducing the Longitude Prize on ALS			Amalia Papanikolaou	The Longitude Prize on ALS is a £7.5m [~$10m USD] international programme that seeks to incentivise the use of AI-based approaches to transform therapeutic discovery for the treatment of amyotrophic lateral sclerosis (ALS), the most common form of motor neurone disease (MND). The five-year Prize will see global multidisciplinary teams compete in a three-phased programme to identify, prioritise and validate high-potential drug targets for ALS. To drive discovery, the Prize has brought together a range of ALS datasets, working with data holders worldwide, and participants will be provided with access to these data, funding, compute, technical expertise and a range of additional support and partnership opportunities. Come join our talk and learn more about this funding opportunity and how you can join us in harnessing the power of AI to tackle ALS!

- top -