Leading Professional Society for Computational Biology and Bioinformatics
Connecting, Training, Empowering, Worldwide

ISCB News and Announcements

Links within this page: Invitation Letters | Electronic Travel Authorizations | European Travel Information and Authorization System (ETIAS) Requirement



In order to travel to England, there are certain entry requirements you must meet depending on the country you live in.

You can check if you need a visa and if you meet basic eligibility requirements HERE.

If you require a visa, you can apply for a visa up to 3 months before the day you are due to travel to the UK. Once you’ve applied online, proved your identity and provided your documents, you’ll usually get a decision on your visa within 3 weeks if you’re outside the UK, or 8 weeks if you’re inside the UK. If deemed eligible, you may also pay an additional fee for a faster decision using ‘priority service’.

Please refer to the UK Visas and Immigrations Department webpages for additional links and information.

You can also visit the IATA TravelCentre website, input information about your trip, and receive details about what immigration and health requirements must be met.

- top -


Invitation Letters

If you require an invitation letter to support your visa application, please complete our Visa Letter Form. This must be completed after you register for the conference, and we recommend filling it out no later than Sunday, April 27th, 2025.

Using this form results in an auto-generated visa letter. If you experience any issues generating your visa letter, please reach out to This email address is being protected from spambots. You need JavaScript enabled to view it..

Visa Letter Form

- top -


Electronic Travel Authorizations (ETA)

Non-Europeans can apply in advance for an ETA, and will need one to enter the UK beginning January 8, 2025. An ETA is not a visa; it is a digital permission to travel to the UK. Applying for an ETA costs £16.

For more details on the ETA application steps, what you can/cannot do with an ETA, and materials needed to apply, please read this page

If you are unsure whether you need an ETA, you can check the "Who does not need an ETA" section on the page linked above.

- top -


European Travel Information and Authorization System (ETIAS) Requirement

The ETIAS travel authorization is a new program that was set to launch in 2025, but is now launching in the last quarter of 2026. No action from travelers is required at this time.

If you would like to review the ETIAS requirements in advance of the program's launch, please visit the links below.

It is a requirement for entry for visa-exempt nationals traveling to any of a list of 30 European countries. The travel authorization is linked to your passport, and with it you’re able to enter the territory of the listed European countries as often as you want for short-term stays in any 180-day period. Note, however, that the ETIAS authorization does not guarantee entry to these countries. You will still be required to show your passport and other documentation in order to verify that you meet each country’s entry requirements.

For detailed information about the ETIAS travel authorization, click HERE.

- top -

Links within this page: Funding Information | Application Overview | Key Dates | Maximum Award Amounts | Application Process | Eligibility Requirements | Eligible Expenses | Award Notification | Contact | Contributors



Funding Information

ISCB is pleased to offer conference fellowships, including registration waivers for virtual participants, for students and postdoctoral fellows to present a talk or poster at the conference. Funding sources for Conference Fellowships are very limited and we regret that we are not able to fund all applicants. The conference organizers are committed to providing support to as many eligible applicants as possible. Conference Fellowship consideration is based on membership and accepted work.

Conference Fellowship Application Invitations are sent directly to eligible individuals after acceptance of scientific submissions to Proceedings, Abstracts, and/or Posters.

- top -


Application Overview

  1. The submitting author will be sent the invitation and is responsible for getting the invitation to the presenting author if the work is not being presented by the submitting author;
  2. Applicant must be a current member of ISCB prior to submitting an application. The membership must be valid through the calendar year;
  3. Applicant must be listed as an author and be the presenter of an accepted Oral or Poster presentation (Late Poster submissions are ineligible for a fellowship) in order to be eligible to apply for conference fellowship funds
  4. All applicants must attend all conference days and secure additional funding from other sources in order to be able to cover the full costs of attending the conference;
  5. The deadline to submit a fellowship application is May 20, 2025. No exceptions will be made.

- top -


Key Dates

May 14, 2025 Conference Fellowship invitations sent for accepted talks and posters
May 20, 2025 Conference Fellowship Application Deadline
May 31, 2025 Conference Fellowship Acceptance Notification
June 13, 2025 Conference Applicant Registration Deadline

- top -


Maximum Award Amounts

The maximum fellowship award is determined based on the geographical location of the applicant and upon submission of appropriate receipts. Please note that funded applicants will only be able to cover approximately 50% of the expense of travel and registration fees with these fellowship amounts. Thus all applicants must seek and secure additional funding sources (e.g., from your home institution/university, or grant funding). Maximum awards are as follows:

Africa $1500 USD
Asia-Pacific $1500 USD
Canada 1500 USD
East Coast USA $1000 USD
Europe $675 USD
Mexico $1500 USD
Central USA $1000 USD
South America $1500 USD
United Kingdom $500 USD
West Coast USA $1250 USD
West Coast Canada $1250 USD

- top -


Application Process

Application is by invitation-only, sent automatically via email to the submitting author of an accepted Proceeding, Abstract Talk, and/or Poster submission (excluding accepted Late Posters). This invitation email will arrive after notification of acceptance of one of these submission types as a separate email. IF YOU HAVE AN ACCEPTED PRESENTATION AND HAVE NOT RECEIVED AN INVITATION BY END OF DAY, ANYWHERE ON EARTH, MAY 14 – PLEASE CHECK YOUR SPAM FOLDER AND THEN CONTACT US IF AN INVITATION IS NOT THERE: This email address is being protected from spambots. You need JavaScript enabled to view it.

Each invitation will include a travel fellowship application URL to link to an application. The application URL must be submitted by the presenting author only, if the qualifying requirements are met. If the submitting author is not the presenting author, it is the responsibility of the submitting author to forward the invitation to the presenting author if the eligibility requirements are met. Each application URL can only be used one time and no application will be accepted after the deadline of May 20, 2025.

- top -


Eligibility Requirements

  1. Applicant must be a current ISCB member whose membership does not expire prior to December 31, 2025. Applications will not be accepted from non-members; pending memberships do not qualify and must be paid in full prior to submission of an application.
  2. Applicant must be listed as an author or co-author on the original submission of an accepted Proceedings paper, Abstract, or Poster (excluding accepted Late Posters), and, per the requirements of the funding agencies, the funded applicant must be the presenting author of the work. (Submitters to the "Call for Late Posters" are not eligible for fellowship funding.)
  3. Applicant must be registered in a degree program (undergraduate or graduate) or as a *post-doctoral research fellow at an accredited educational institution at the time of the conference; early career researcher (low - Upper-Middle Economic countries); post docs and employees of any US federal agency are ineligible for funding using US federal funds - currently we have only US federal funds for this travel fellowship program. (*The period of eligibility for a Post-Doc is five (5) years from the time of their PhD completion date).
  4. Applicant must be registered by June 13, 2025, and attend all four conference days. If attendance at the conference is dependent on receipt of fellowship funds, please do not register until after the notification of travel fellowship funding. Any funded applicant failing to register for the conference by June 13, 2025 will automatically forfeit the funds so that another applicant can be awarded from among the original pool of applicants.
  5. Applicant must be able to pay all expenses of attending the conference up front, including conference registration fee (as noted in above), travel, accommodations, and meals. Travel Fellowship funding will be provided via the ISCB payment system (bill.com) via secure electronic funds transfer (wire or ACH) approximately 6-8 weeks after the conference. 

- top -


Eligible expenses

Eligible expenses toward fellowship funds include registration for the conference, Student Council Symposium or Tutorials, transportation (air or land transportation from home region to conference city), hotel accommodations (booked within the ISCB official block), and a maximum of $250.00 in meal expenses. In order to receive the full-awarded amount, receipts for registration, transportation, and hotel accommodations that equal or exceed the awarded amount are required.

- top -


Notification

Applicants will be notified no later than May 31, 2025 of the funding status. In some cases applicants may be notified they are on a waitlist for funding, which means that ISCB is fully expecting but still awaiting the formal confirmation of our grant award from one or more granting agency, and that awarding of those funds will not be possible until the grant needed to fund the travel fellowship is confirmed. Any waitlisted applicant that is eventually awarded funds will be offered the opportunity to register at the early registration rate, therefore, please do not register for the conference if your attendance is fully dependent on being awarded a travel fellowship as any cancellation of an applicant's registration will be subject to the full regular registration cancellation policy.

Funded applicants will be required to present evidence of their eligibility status (such as student identification card) when signing in with the Conference Fellowships Desk to record their attendance. In all cases, funds will be mailed to funded applicants after the conference per the details noted in Eligibility Requirements above.

- top -


Contacts

Questions regarding fellowships should be addressed to: This email address is being protected from spambots. You need JavaScript enabled to view it.

The information on this page is subject to change without notice, and all changed information will be considered final for the purposes of awarding and funding of Conference Fellowships.

- top -


Contributors

The Conference Fellowships are made possible by generous donations from:

- top -

ISCB recognizes that many of our members have personal care responsibilities that can present a challenge to attending scientific conferences, and hence interfere with career advancement. We also recognize that you know best how to care for your own dependents and what their needs are.

Accordingly, ISCB offers Dependent Care grants in the amount of $600 USD to as many eligible candidates as allowed by available funding to be used towards expenses that will facilitate the member’s attendance.

Candidate eligibility

  • Be an ISCB member in good standing (active membership at the time of the application and during the conference).
  • Have a submission accepted for presentation in which the candidate is the presenting author. Any presentation is eligible, but priority will be given to members presenting talks.
  • Be a primary caregiver (the one that cares more than 50% of the time) of a child or dependent adult at the time of the conference
    • Alternatively, the award can be used if the member needs caregiving assistance during the conference
  • Priority will be given to applicants who have a demonstrated need and/or are in early stages of their academic careers (e.g., graduate students, postdocs), as well as ISCB members working in non-high-income countries.

Eligible expenses include (but are not limited to) providing airfare for your child or for a caregiver to accompany you and attend functions with you if needed, costs for childcare at home while you are away (be it professionally provided or costs related to having a relative fly to provide care), or extra help paying for on-site care. Awardees can spend this money as they think best.

Applicants will be asked to provide:

  • Career stage (undergraduate student/graduate student/postdoctoral researcher/non-tenured faculty/tenured faculty/unemployed).
  • Age of dependents, if applicable.
  • Description of role as caregiver for a dependent adult, child, or relative; or needs for personal assistance.
  • A brief justification of the need for this award and how the award will be used (300 words max).

Payments will be sent promptly following the completion of the meeting after the awardee submits proof of attendance.

Submission deadline: May 30, 2025
Notifications will be sent by June 6, 2025


- top -

We are seeking event staff applications. Event Staff must be ISCB members with memberships expiring on or after the last day of the conference.

Event staff are expected to assist as scheduled for approximately 20 - 24 hours during the conference (generally for a shift of 5 - 6 hours each day).
Event staff are asked to participate in a training session in the afternoon of Saturday, July 19. The session will last approximately 90 minutes.
Event staff should be available for scheduled shifts on all dates beginning the first day of the conference, through the end of the conference. A schedule of shift allocations will be provided prior to the conference start date.

In return for working as event staff, those selected are provided with a complimentary conference registration, time-based pay, and a conference T-shirt. Regarding registration, we ask that you DO NOT register in advance, we will send a code for your registration after decisions have been made. If you are not selected, a discount code will be sent to you to allow you to register at the early bird rate.

Some event staff roles:

  • Technical Moderator (TM) - responsible for assisting the speakers in loading their presentations on the laptop, managing the audio and visual recording of presentations from the room in the virtual platform, moderate the online chatroom and answer attendee questions as needed. Assist the Scientific Session Moderator as required.
  • Room Monitor (RM) - monitor the doors once the session starts to ensure late arriving delegates enter quietly. Track and record the number of delegates for each session and submit them to ISCB staff. Provide back up to the TM as needed.
  • Information Desk (ID) - assist delegates with general questions related to the venue and the conference to include room directions, programme timeline and details, and social activities.
  • Registration Desk (RD) - assist by handing out delegate badges and additional items as needed, ensuring the orderly and smooth operation of the registration area and flow of delegate item collection.
  • Poster Hall (PH) - set up poster numbers, provide pins, directions, and other questions about posters as required by attendees and ISCB staff.

In addition to above specific roles, all event staff are asked to assist with general conference items such as directions and other duties as required.

Application deadline is Friday, May 9, 2025.
Notifications will be sent on Friday, May 23, 2025.

Click here to submit an application

Links within this page: Bioinformatics in the UK | Come Together: A Night of Networking & Fun | ColaborationFest | Edit-A-ThonISCB-China Workshop | LGBTQI+ Meet-Up | Bioinfo4Women Meet-Up | Serene Stretch Symposium



Bioinformatics in the UK

Date: Monday, July 21, 2025
Time: 18:00
Location: Exhibition Centre Terrace
Cost: £10 Students, £10 Post-Docs, £20 Professionals

Join us for a dedicated poster and networking reception at the conclusion of the Bioinformatics in the UK track sessions. Attendance is open, but space is limited and registration is required.  You can register while registering for ISMB/ECCB 2025.

- top -


Come Together: A Night of Networking & Fun

Date: Wednesday, July 23, 2025
Time: 19:00 – 21:00
Location: Punch Tarmey's
Website: https://www.punchtarmeys.co.uk/
Cost: £25 Students, £25 Post-Docs, £50 Professionals
Registration for this off-site event closes EoD July 7th!

Join us for an unforgettable evening of fun and networking at Punch Tarmey's on Wednesday, July 23, 2025! This exclusive add-on event for ISMB/ECCB 2025 combines great food, good company, and endless entertainment. Your ticket includes a delicious dinner, dessert, two drink tickets, and unlimited access to Arcains, the incredible three-story arcade next door. Whether you're connecting with colleagues, challenging new friends to a game, or just unwinding, this is your chance to kick back and level up your conference experience!

- top -


CollaborationFest

Date: Wednesday, July 23, 2025 and Thursday, July 24, 2025
Time: TBD
Location: TBD

Join us at ISMB CollaborationFest 2025!

CollaborationFest will be a collaborative work event at which participants work together to contribute code, documentation, training materials, and challenging analysis problems and use cases. Bring your own project ideas or come ready to collaborate with others on their projects!

BOSC has held CollaborationFests (aka CoFests) every year before or after ISMB since 2010. (See, for example, https://www.open-bio.org/events/bosc-2024/obf-bosc-collaborationfest-2024/) This year, we decided to hold the CollaborationFest as part of ISMB/ECCB and open it to all registered ISMB/ECCB participants. It will take place during the last two days of ISMB/ECCB.

ISMB CollaborationFest 2025 will be co-organized by volunteers from four COSIs:

  • Bioinformatics Open Source Conference (BOSC)
  • Bio-Ontologies and Knowledge Representation (BOKR)
  • Function COSI
  • 3DSig

Due to space constraints, participation in CollaborationFest is limited. To indicate your interest in participating, register for ISMB/ECCB 2025 by June 20 and sign up for CollaborationFest as an “add-on” during the registration process. We will contact you by June 23 to request a few pieces of information to help us organize the event. Last-minute walk-ins may be possible if we are not at capacity.

Limited remote participation will be possible but how much you can interact will be highly dependent on the people and projects active on the day. We expect to have a video feed from the room available; joining will require you to have registered as a (virtual or in-person) ISMB attendee.

- top -


Edit-A-Thon

Date: TBD
Time: TBD
Location: TBD
Capacity: 50 in-person, 100 virtual

The ISCB Wikipedia Committee is organizing an Editathon at the Intelligent Systems for Molecular Biology (ISMB) Conference in Liverpool, July 2025. This event aims to improve and expand Wikipedia content related to bioinformatics and computational biology, making high-quality scientific knowledge accessible to the global community.

Join forces with researchers, students (from advanced undergraduates to PhDs), and veteran Wikipedia editors as we:

  • Upgrade existing articles with up-to-date bioinformatics insights
  • Create new entries on critical topics that deserve recognition
  • Fortify citations with high-impact, peer-reviewed sources

The editathon will also include an introductory session for newcomers, providing guidance on Wikipedia editing policies and best practices.
By contributing to Wikipedia, attendees will help bridge the gap between scientific advancements and public knowledge, ensuring that key bioinformatics topics are well-documented and freely available. No prior Wikipedia editing experience is required—just a passion for science communication and open knowledge!

Want to contribute? Register here and help shape the future of open science!

Frequently asked Questions here

- top -


ISCB-China Workshop

Coming soon.

- top -


LGBTQI+ Meet-Up at ISMB/ECCB

Date: Tuesday, July 22, 2025
Time: 13:00 – 14:00
Location: TBD

Join us for an LGBTQI+ lunch meet-up, an opportunity to connect, share experiences, and foster community in a welcoming space. This gathering continues the momentum of the first ISCB LGBTQI+ Symposium and the informal meet-up at ISMB 2024. Whether you're looking to network, find support, or simply enjoy meaningful conversations, we welcome you to be part of this inclusive event!

All LGBTQI+ attendees and allies are invited! Bring your lunch and join us for an hour of connection and camaraderie. 🌈

Want to learn more about the LGBTQI+ Symposium, its impact, and why these spaces are important? Check out our editorial article: Queer voices in computational biology: the first ISCB LGBTQI+ Symposium in Bioinformatics Advances: https://doi.org/10.1093/bioadv/vbae204

- top -


Bioinfo4Women Meet-Up: Supporting Women in Bioinformatics

Date: Monday, July 21, 2025
Time: 13:00 – 14:00
Location: TBD
Organized by: Bioinfo4Women

Women are still underrepresented in science, particularly at senior career levels and leadership. The "leaky pipeline" persists in bioinformatics, as it does in other STEM areas, due to biases in hiring, tenure, visibility, and recognition. Addressing these disparities is essential for fostering diversity, inclusion, and innovation in academia. Gender equality is not merely an ethical imperative but also a key promoter of productivity and innovation among scientific teams.

This Birds of a Feather session is a continuation of desk research and takeaways from a workshop that will be conducted during the ELIXIR All Hands in June 2025, providing an open forum to share and exchange initiatives that enable and foster women in science. The Bioinfo4Women initiative will share its successful activities, including mentoring and fellowship schemes, visibility actions, establishment of research lines, and best practices for advancing women to leadership positions.

The session will end with an open debate, where participants can exchange experiences and discuss pragmatic strategies for enhancing gender equality in bioinformatics and beyond.

Speakers:
Davide Cirillo
María Chavero Díez 
María Flores Rodero
María Morales Martínez

Format: Open discussion and presentation of successful initiatives.  All are welcome to participate and share experiences.

- top -



Serene Stretch Symposium

Date: Monday, July 21 to Thursday, July 24
Time: 7:30 – 8:00
Location: TBD

Join us for the Serene Stretch Symposium, a refreshing way to start your conference day!

This gentle, 30-minute morning yoga session is open to all and designed to help you feel energized and focused for the scientific programme ahead. Registration isn't required, but is appreciated so we can best plan how to accommodate all those interested. Simply add it to your registration and make sure to bring your own mat or towel!

Whether you're a beginner or a seasoned yogi, take this time to reset and recharge before diving into the day!

There will be a series of in-person and virtual tutorials prior to the start of the conference. Tutorial registration fees are shown at: https://www.iscb.org/ismbeccb2025/register#tutorials

In-person Tutorials (All times GMT)

Virtual Tutorials: (All times GMT) Presented through the conference virtual platform

Tutorial IP1: Machine Learning for Omics: Best practices and Real-Life Insights with TidyModels SOLD OUT

Room: TBD
Date: July 20, 2025
Start Time: 09:00
End Time: 18:00

Organizer:
Jamie Soul

Speakers:
Jamie Soul, University of Liverpool
Eva Caamano Gutierrez, University of Liverpool
Anthony Evans, University of Liverpool

Max Participants: 30

Description
Omics data analysis presents unique challenges due to its high dimensionality and complexity. Supervised machine learning (ML) offers powerful tools for gaining insights from these data but currently faces a crisis of reproducibility due to poor adherence to best practices when undertaking feature selection, model evaluation, and needs for further interpretability.

This full-day tutorial introduces participants to the common pitfalls and best practices of applying ML to omics research. It exemplifies good practice through example using the Tidymodels framework for ML workflows in R, tailored to omics applications. The course will feature a mixture of lectures, quizzes, real-life coding tutorials and hands-on practicals with 1-1 support. Example applications will illustrate regression analysis with methylation clocks, gene prioritisation and classification with cancer biomarker discovery.

Special attention will be paid to challenges in working with highly multivariate data and integrating various data types as well as providing tips to extract meaningful insights from complex data. Beginner-level R skills are required, and attendees will leave with practical skills to apply Tidymodels to their own datasets.

Learning Objectives

  • Understand the challenges and pitfalls of using supervised machine learning on omics data, including reproducibility, overfitting, and feature selection.
  • Ability to critically appraise published examples as well as gain familiarity with reporting practices such as DOME.
  • Gain hands-on experience using Tidymodels to implement machine learning workflows in R.
  • Learn techniques for feature selection, dimensionality reduction, and inclusion of network-based information.
  • Explore interpretable machine learning approaches to improve biological insights.
  • Develop skills to apply Tidymodels for methylation clock modeling, gene prioritisation and mechanistic biomarker discovery.

Intended Audience and Level
This course is designed for researchers with at least a basic understanding of R programming and an interest in applying machine learning to omics data analysis. Prior exposure to omics data is beneficial but not mandatory. The course will provide a gentle introduction to relevant concepts while focusing on practical skills development. No prior experience with Tidymodels or advanced machine learning techniques is required. The tutorial is tailored to beginners and those looking to expand their skills in machine learning data analysis using R.

Schedule

09:00-09:45 Lecture: Introduction to Machine Learning in Omics
  • Introductions and overview of the course structure and learning objectives
  • Introductory lecture on ML principles, applications and common pitfalls in application to omics data, including “crisis of reproducibility”, high dimensionality, and interpretability challenges
  • Introduction to reporting recommendations (e.g. DOME, FAIR adherence) and quiz to reinforce key concepts
Eva Caamano Gutierrez
09:45-10:00 Hand-on: Critical appraisal of published examples
Identifying good practice and areas of development. Expand familiarity with best reporting practices.
10:00-10:45 Tidymodels Framework: A Practical Introduction
A follow along tutorial covering:
  • Introduction to Tidymodels: Overview of the Tidymodels ecosystem and its core packages and how tidymodels streamlines the end-to-end machine learning workflow
  • Understanding and Avoiding Data Leakage: Explanation of data leakage and why it can compromise model validity and examples of best practices
  • Specifying Preprocessing Recipes: How to define reusable preprocessing pipelines using the recipes package
  • Model Fitting with Parsnip: Introduction to different model types and how Tidymodels supports multiple ML algorithms
  • Model Evaluation with Yardstick: Best practices for evaluating model performance using appropriate metrics and hands on demonstration
Dr Jamie Soul
10:45-11:00 Coffee Break
11:00-12:30 Hands-on: Explore Tidymodels basics with a provided Quarto notebook
Participants will work through a quarto notebook at their own pace implementing a simple classification model using the tidymodels framework based on the previous demonstration.
12:30-13:00 Lecture: Strategies for Managing Large Omics Datasets
This lecture will provide strategies for handling the challenges posed by large omics datasets. Topics will include class imbalance, techniques for reducing dimensionality, and feature selection methods. Participants will learn how these methods can enhance model performance and reveal key biological insights. The session will conclude with a quiz to test understanding and reinforce key concepts discussed during the lecture.
Jamie Soul
13:00-14:00 Lunch Break
14:00-15:15 Hands-on: Using Tidymodels to build a methylation clock
In this interactive session, participants will learn how to use the Tidymodels framework to build a cross-validated regression model. The focus will be on predicting the age of an organism using methylation array data, providing a practical introduction to model building, validation, and analysis in the context of biological data.
15:15-15:30 Lecture: A short snapshot of networks and pathways
Introduction to networks and considerations for inclusion for ML
Anthony Evans
15:30-16:00 Hands on: Enhancing Machine Learning with Biological Context: Integrating Networks and Pathways
This practical session will explore strategies for incorporating additional biological information, such as networks and pathway data, into machine learning models. Participants will examine how these methods can enhance model performance and biological relevance when applied to the task of gene prioritsation.
16:00-16:15 Coffee Break
16:15-16:30 Lecture: a short introduction to gaining model interpretability
Introduction to interpreting models and tradeoffs between performance and interpretability.
Anthony Evans
16:30-17:30 Hands On: Identifying biological mechanisms with ML
With examples applied to cancer biomarker discovery, participants will explore how interpretable machine learning (ML) techniques can be used to gain insights into biological mechanisms. The session will provide a practical introduction to key interpretability tools, such as SHAP (SHapley Additive exPlanations) values which help understand the relationship between input features and model predictions.
17:30-18:00 Wrap-Up and Discussion Review of key concepts, Q&A, and discussion of applications to their own research.

- top -

Tutorial IP2: Massively parallel reporter assays in functional regulatory genomics and as part of the IGVF data resource

Room: TBD
Date: July 20, 2025
Start Time: 09:00
End Time: 18:00

Organizer:
Martin Kircher

Speakers:
Michael Love, University of North Carolina
Martin Kircher, University of Lübeck and Kiel University
Max Schubach, Berlin Institute of Health at Charité
Jonathan Rosen, University of North Carolina
Pia Keukeleire, University of Lübeck and Kiel University
Kilian Salomon, Berlin Institute of Health at Charité

Max Participants: 100

Description
This tutorial is designed to empower bioinformatics researchers with the knowledge and skills to effectively utilize Massively Parallel Reporter Assays (MPRAs) data in their work. MPRAs are gaining wider applications across the functional genomics community and are used as part of the Impact of Genomic Variation on Function (IGVF) Consortium. IGVF is a collaborative research initiative funded by the NHGRI that aims to systematically study how genomic variations affect genome function and, consequently, phenotypes. By integrating experimental and computational approaches, IGVF seeks to map and predict the functional impacts of genetic variants, providing a comprehensive catalog of these effects.

This tutorial provides a thorough introduction in MPRAs and IGVF data resources, practical training on MPRA data, and insights into advanced analysis methods for such data. Participants will gain an understanding of MPRA experiments, including their various experimental designs and the rationale for using them in functional genomics. This will involve learning the process of associating tags/barcodes with sequences incorporated in the reporter constructs from raw sequencing reads and counting barcodes from DNA sequencing and RNA expression. The tutorial will guide participants through data processing using MPRAsnakeflow, a streamlined snakemake workflow developed with IGVF for efficient MPRA data handling and QC reporting. Statistical analysis for sequence-level and variant-level effect testing of MPRA count data will be introduced using BCalm, a barcode-level MPRA analysis package developed as part of our IGVF efforts.

Further, the tutorial will provide a starting point for training (deep learning) sequence models on MPRA data and related functional genomics datasets. Participants will learn how to extract meaningful insights from their datasets by investigating the sequence activity relationship and extracting important sequence motifs. By integrating these topics and methods, participants will leave the tutorial equipped with both theoretical knowledge and practical skills necessary for analyzing and using MPRA data effectively.

Learning Objectives

  • Introduction to IGVF and how to access its data.
  • MPRA Experiments: Explanation of experimental design, including the association of barcode/tag sequences to tested regulatory sequences and variants.
  • Step-by-step data processing using MPRAsnakeflow, including construction of count tables and interpretation of quality control metrics.
  • Statistical analysis of DNA and RNA barcode counts using BCalm.
  • Applications of MPRA data like identifying active regulatory regions and significant variant effects, optimising cell-type-specific expression through synthetic sequences.
  • Sequence-based modeling of regulatory sequence activity, i.e. using deep learning models to predict open chromatin and MPRA activity, investigate transcription factor motif importance and composition.

Intended Audience and Level
Beginner

Schedule

09:00-09:15 Welcome and introduction to the schedule
09:15-09:30 Tutorial materials access and setup of the Google CoLab environment
Participants that fail with setting up their access und CoLab will be brought outside of the seminar room and helped by other speakers while the theoretical introductions commence
09:30-09:50 Lecture: IGVF and its data access
Introduction to the IGVF mission and how to access its data
09:50-10:15 Lecture: Fundamentals of MPRA Experiments
Explanation of experimental design, including the association of barcode/tag sequences to tested regulatory sequences and variants.
10:15-10:45 Coffee break
10:45-11:05 Hands-on: Association of Barcode/Tag Sequences
Participants will learn how to associate barcode/tag sequences with designed oligos using the association step of the MPRAsnakeflow pipeline on a subsetted/small dataset.
11:05-11:30 Hands-on: Count Sequencing Analysis
We will demonstrate how to count barcodes in both DNA and RNA sequencing on a subsetted/small dataset.
11:30-11:50 Discussion of QC metrics
We will discuss the QC parameters and plots returned by MPRAsnakeflow for the full data set, emphasizing the importance of accurate quantification in MPRA experiments.
11:50-12:30 Hands-on: Data Analysis Steps (Regions and Variants)
We will walk through potential analytical steps for utilizing count tables and perform statistical analysis using BCalm to identify regions with high activity compared to other sequences in the assay. Additionally, we will highlight methods for identifying variants with significant differences between reference and alternative sequences within designed oligos.
12:30-13:30 Lunch break
13:30-13:50 Lercture: Modeling of regulatory activity with sequence models
We will provide a brief introduction and review of modeling efforts for sequence activity using gapped kmers, convolutional neural networks and language model approaches.
13:50-14:30 Hands-on: Training a sequence based model
We will walk through the steps of training a model based on the DeepSTARR CNN architecture.
14:30-14:50 Discussion of pre-training and fine-tuning approaches
We discuss the opportunity of pre-training convolutional layers on open chromatin data from multiple cell-types and fine-tuning these convolutions on MPRA data vs. using a language model like NT to train MPRA activity models.
14:50-15:30 Hands-on: Interpreting models with in-silico mutagenesis
Supported by tools like TF-MoDISco for motif discovery, participants will investigate important transcription factor binding motifs using models trained on sequence data that predict activity. We will assess whether these motifs exert activating or repressing effects by comparing the activity of sequences with and without these identified motifs in the cell-type of interest.
15:30-16:00 Coffee break
16:00-16:40 Hands-on: Linking motifs to variant effects
We will identify transcription factor binding sites (TFBS) overlapping significant variants using motifs derived from model interpretation or available position weight matrices (PWMs) to support the interpretation of variant effects.
16:40-16:50 Discussion of data limitations
We discuss the limitations of variant effects in various sequence contexts. We consider the effects of studying only few or related cell-types and the sensitivity of the available modeling approaches for cell-type effects.
16:50-17:20 Participant questions and feedback
17:20-17:30 Concluding remarks by the speakers

- top -

Tutorial IP3: Genomic Variant Interpretation & prioritisation for clinical research

Room: TBD
Date: July 20, 2025
Start Time: 09:00
End Time: 18:00

Organizer:
Aleena Mushtaq

Speakers:
Mallory Freeberg, EMBL's European Bioinformatics Institute (EMBL-EBI)
Aleena Mushtaq, EMBL's European Bioinformatics Institute (EMBL-EBI)
Diana Lemos, EMBL's European Bioinformatics Institute (EMBL-EBI)
James Stephenson, EMBL's European Bioinformatics Institute (EMBL-EBI)
Sarah Hunt, EMBL's European Bioinformatics Institute (EMBL-EBI)
Irene Lopez Santiago, Open Targets Bioinformatician (EMBL - EBI)
Genevieve Evans, EMBL's European Bioinformatics Institute (EMBL-EBI)

Max Participants: 40

Description
The interpretation of genetic variation is important for understanding human health and disease. Increased knowledge leads to societal benefits including faster disease diagnosis, a better understanding of disease progression, more efficient identification and prioritisation of drug targets for testing, resulting in overall better health outcomes for a population. Whilst the speed and cost of sequencing has reduced, the complexity of variant interpretation remains a bottleneck for understanding. This tutorial will explore the variety of annotations and techniques available to assess human variation and the implications of variant effects on human health and disease.

Learning Objectives

By the end of the tutorial, we expect participants will be able to:

  1. Explore human variation types in commonly used bioinformatics file formats
  2. Compare experimental methods for variant effect analysis and understand the specific strengths of each.
  3. Use bioinformatics knowledge bases to explore information about genetic variants and to contrast different approaches to see how they can be combined for thorough interpretation.
  4. Evaluate evidence from multiple sources supporting variant effect and impact in the context of research and study design
  5. Explore the impact of variant interpretation on clinical diagnostics and drug tractability.

Intended Audience and Level
Attendees should have an undergraduate degree / diploma level understanding of molecular biology, genetics, or biochemistry. It is a tutorial offered at the beginner level and no prior background in variant interpretation is required. Participants must be comfortable using web browsers. Laptop computers with an Internet connection will be required for the tutorial sessions.

Schedule

09:15-09:30 Introduction - The challenge of variant interpretation
Variation in context of human health
09:30-10:00 Lecture: - Genomic Annotation for variation datasets
Public annotation datasets, Variation sources, Transcript based annotations, Non-coding variation, Structural variation
10:00-10:45 Hands-on: Annotating and predicting molecular variant effect
Variant prioritisation and scoring methods
10:45-11:00 Break
11:00-11:15 Lecture: Understanding Variant Effects Using Protein Structure
Protein position, interaction and complexes for variant interpretation, Alphafold3 and predicted structures
11:15-11:45 Lecture: Understanding Variant Effects Using Protein Function
Combining functional, structural and population annotations to contextualise variant effects in proteins
11:45-12:15 Hands-on: Using protein databases to investigate variant impact
Using structural information to interpret variant effect
12:15-12:45 Lecture: Deep Mutational Scanning
Genome Editing for Variant Analysis
12:45-14:00 Lunch Break
14:00-14:30 Lecture: Resources for clinical variant classification
Utilising clinical data in variant prioritisation and classification, Applications fordisease research and genomic diagnosis, GWAS, DECIPHER and G2P
14:30-15:00 Lecture: Target tractability and drug associations
Target prioritisation for drug discovery, Case studies
15:00-16:00 Hands-on: Group Projects
Hands on activity using bioinformatics resources for variant interpretation
16:00-16:15 Break
16:15-17:00 Hands-on: Group Projects
Hands on activity using bioinformatics resources for variant interpretation
17:00-17:45 Presentations from groups
Present to peers to discuss ideas and future work
17:45-18:00 Closing Remarks

- top -

Tutorial IP4: Quantum Machine Learning for multi-omics analysis

Room: TBD
Date: July 20, 2025
Start Time: 09:00
End Time: 18:00

Organizer:
Aritra Bose

Speakers:
Aritra Bose, IBM Research
Hakan Doga, IBM Research
Akhil Mohan, Cleveland Clinic
Laxmi Parida, ISCB Fellow
Bryan Raubenolt, Cleveland Clinic
Filippo Utro, IBM Research

Max Participants: 50

Description
Single-cell and population-level multi-omics analyses have greatly enhanced our understanding of biological complexity. By integrating various types of biological data—such as genomics, proteomics, and transcriptomics, collectively known as multi-omics—these approaches have provided deep insights into the molecular mechanisms underlying complex diseases, both at the cellular level and across patient populations. As the size and complexity of multi-omics data continues to grow, the need to leverage emerging technologies such as artificial intelligence (AI) and quantum computing (QC) also grows. Recently, advances in QC have shown promise in solving real-world problems in machine learning and optimization in biomedicine, drug discovery, biomarker discovery, clinical trials, among other healthcare and life sciences objectives [1,2,3,4,5].

In this tutorial, participants will learn the fundamental concepts of QC, engage in hands-on experiments that apply classical machine learning (ML) techniques. They will also learn best practices for pre-processing multi-omics data in preparation for quantum machine learning (QML) tasks. Through a systematic evaluation of various data complexity measures and their impact on the performance of different ML and QML models, participants will gain insights into when to effectively utilize QML models. Additionally, they will explore quantum-classical hybrid workflows for ML, with a focus in biomedical data analysis.

Learning Objectives

  • Understanding the fundamentals of quantum computing, including learning how to
    implement algorithms in a quantum computer with quantum gates and circuits using
    Qiskit.
  • Practical experience of pre-processing multi-omics data and preparing it for a quantum
    hardware experiment.
  • Analyze machine learning methods on multi-omics data, understand their shortcomings
    and review the impact of data complexity measures on ML models.
  • How to apply QML models on multi-omics data.
  • Learn design of experiments for biomedical data using quantum computers by gaining
    an in-depth knowledge of quantum-classical hybrid workflows.
  • Understand when to apply QML models and benchmark it with classical ML models.

Intended Audience and Level
This tutorial is aimed at computational biologists, bioinformaticians, clinicians, practitioners, data analysts, including early-career to senior researchers in the fields of healthcare and life sciences enthusiastic to learn about new frontiers of computational biology. There are very few prerequisites for the tutorial, listed as follows:

  • Create an IBM Quantum account in IBM Quantum Learning website, click on “Create an IBMid” and follow the instructions.
  • Watch the Qiskit Global Summer School videos – QML 2021 (optional).
  • Entry-level knowledge of multi-omics data analyses and machine learning concepts.
  • Review https://github.com/IBM/QMLOmics for ISMB 2024’s “Quantum-enabled multiomics analysis” tutorial content (optional).

Schedule

09:00-09:15 Introduction

09:15-10:15 Hands-on: Quantum computing fundamentals with Qiskit
10:15-10:45 Hands-on: Classical Machine learning applications in multi-omics data
10:45-11:00 Coffee Break
11:00-12:00 Hands-on: Quantum Machine Learning in multi-omics data
12:00-13:00 Hands-On: Data Complexity measures and learning algorithms
13:00-14:00 Lunch
14:00-14:30 Lecture: Quantum-classical hybrid framework for machine learning applications in multiomics
14:30-16:00 Hands-on: Implement your own quantumclassical machine learning hybrid framework
16:00-16:15 Coffee Break
16:15-17:15 Hands-onm: Reviewing results from the practical session
17:15-17:45 Interactive Q&A
17:45-18:00 Concluding Remarks

- top -

Tutorial IP5: Introduction to Causal Analysis using Mendelian Randomisation

Room: TBD
Date: July 20, 2025
Start Time: 09:00
End Time: 13:00

Organizer:
Handan Melike Dönertaş

Speakers:
Handan Melike Dönertaş, Fritz Lipmann Institute on Aging
Mark Olenik, Fritz Lipmann Institute on Aging
Tayyaba Alvi, Fritz Lipmann Institute on Aging

Max Participants: 20

Description
Mendelian randomisation (MR) is a method that uses genetic variation associated with an exposure (e.g., behaviours, biomarkers) to infer its causal effect on an outcome (e.g. health status). In statistical terms, it functions as an "instrumental variable" approach.

By mimicking the design of a randomised controlled trial through genetic inheritance, MR provides a framework for addressing confounding and reverse causation, making it a valuable tool in epidemiological and biomedical research.

This workshop offers a beginner-friendly introduction to the key concepts and assumptions underlying MR, such as the use of genome-wide association study (GWAS) data and the three key assumptions for valid instrumental variables: relevance, independence, and exclusion restriction. Participants will explore common challenges in MR analysis, including pleiotropy, population stratification, and measurement error while learning strategies to overcome these using advanced methods. The workshop also includes a two-hour hands-on session in which attendees will work with real-world data to conduct MR analyses using R. By the end of the session, participants will have a clear understanding of MR principles, the ability to critically evaluate MR studies, and practical skills to apply MR methods in their own research.

Learning Objectives

  1. Understand the key principles and assumptions underlying Mendelian Randomization.
  2. Identify potential challenges in MR studies, including biases and violations of assumptions.
  3. Familiarize with various MR methods and their applications (e.g., two-sample MR, IVW, MREgger, weighted median).
  4. Learn how to access and prepare genetic datasets for MR analysis.
  5. Conduct MR analyses using R, visualize and interpret results.

Intended Audience and Level
This workshop is designed for researchers and students new to MR or looking to incorporate it into their studies. No prior experience is required, although familiarity with R is required.

Schedule

09:00-09:45 Introduction to Mendelian Randomization
  • Overview of MR and its applications
  • Key assumptions: relevance, independence, and exclusion restriction
  • Types of MR studies (one-sample, two-sample, bi-directional)
09:45-10:45 Challenges in Mendelian Randomization
  • Horizontal pleiotropy and methods to address it
  • Sample overlap and measurement error
  • Population stratification and genetic heterogeneity
10:45-11:00 Coffee Break
11:00-13:00 Hands-On Tutorial in R
  • Preparing exposure and outcome GWAS data
  • Conducting basic MR analysis (TwoSampleMR R package)
  • Robust methods: MR-Egger, weighted median, and leave-one-out analysis
  • Interpreting and visualizing results

- top -

Tutorial IP6: Hello Nextflow: Getting started with workflows for bioinformatics

Room: TBD
Date: July 20, 2025
Start Time: 09:00
End Time: 13:00

Organizer:
Geraldine Van der Auwera

Speakers:
Geraldine Van der Auwera

Max Participants: 30

Description
Nextflow is a powerful and flexible open-source workflow management system that simplifies the  development, execution, and scalability of data-driven computational pipelines. It is widely used in bioinformatics and other scientific fields to automate complex analyses, making it easier to manage and reproduce large-scale data analysis workflows.

This training workshop is intended as a “getting started” course for students and early-career researchers who are completely new to Nextflow. It aims to equip participants with foundational knowledge and skills in three key areas: (1) understanding the logic of how data analysis workflows are constructed, (2) Nextflow language proficiency and (3) command-line interface (CLI) execution.

Participants will be guided through hands-on, goal-oriented exercises that will allow them to practice the following skills:

  • Use core components of the Nextflow language to construct simple multi-step workflows effectively.
  • Launch Nextflow workflows locally, navigate output directories to access results, interpret log outputs for insights into workflow execution, and troubleshoot basic issues that may arise during workflow execution.

By the end of the workshop, participants will be well-prepared for tackling the next steps in their journey to develop and apply reproducible workflows for their scientific computing needs. Additional study-at-home materials will be provided for them to continue learning and developing their skills further.

Learning Objectives
We have delivered versions of this tutorial session at both online and in-person events previously (total of ~300 participants since May 2024), and have typically had to turn away would-be participants due to higher demand than we could accommodate. Post-event satisfaction surveys have typically shown the material was very well received. We are continuing to further expand and refine the material based on participant feedback.

Intended Audience and Level

Schedule

09:00-10:00 Hello World
Basic components and principles involved in assembling and running a Nextflow workflow.
  1. Learn the basics of the Nextflow syntax, how a workflow script is structured and how it can be modified
  2. Run a workflow for the first time, parse messages and logs produced during a run, and find outputs
  3. Use variables and key command-line parameters to control inputs, outputs and execution behavior
  4. Chain multiple steps together and handling transfer of data between steps
  5. Handle input and output files
10:00-10:20 Hello Containers
Using containers as a mechanism for managing software dependencies in the context of reproducible bioinformatics workflows.
10:20-10:50 Hello Config
Setting up and managing a pipeline’s configuration to customize its behavior, adapt it to different environments, and optimize resource usage.
10:50-11:05 Hello Modules
Using code modules to make pipeline development and maintenance more efficient and sustainable.
11:05-11:20 Next steps
Overview of educational resources that participants can use to continue developing their Nextflow skills. Includes several domain-specific tutorials (currently Genomics, RNAseq in development) that provide a practical application of the concepts learned in this workshop to relevant use cases, and an orientation to the nf-core ecosystem of tools and pipelines that can be used out-of-the-box or as building blocks for customized solutions.

- top -

Tutorial IP7: AI large cellular models and in-silico perturbation

Room: TBD
Date: July 20, 2025
Start Time: 14:00
End Time: 18:00

Organizer:
Xuegong Zhang

Speakers:
Xuegong Zhang, Tsinghua University
Chen Li, Tsinghua University
Erpai Luo, Tsinghua University
Mo Chen, Tsinghua University

Max Participants: 50

Description
Transformer-based large language models (LLMs) are changing the world. The capabilities they illustrated in sophisticated natural language, vision and multi-modal tasks have inspired the development of large cellular models (LCMs) for single-cell transcriptomic data, such as scBERT, Geneformer, scGPT, scFoundation, GeneCompass, scMulan, etc. After pretraining on massive amount of single-cell RNA-seq data agnostic to any downstream task, these transformer-based models have demonstrated exceptional performance in various tasks such as cell type annotation, data integration, gene network inference, and the prediction of drug sensitivity or perturbation responses. Such advancements, albeit still in their early stage, suggested promising revolutionary approaches for leveraging AI to understand the complex system of cells from extensive datasets beyond human analytical capacity. Especially, such models have made it possible to conduct in-silico perturbation on cells of various types to predict their responses to gene perturbations without doing experiments on the cells. These models provided prototypes of digital virtual cells that can be used to reconstruct and simulate live cells, which will revolutionize many aspects of future biomedical studies.
Although the community is high enthusiastic to these exciting progresses, the structures and algorithms of LCMs and other similar-scale AI models are mysterious to many people who were not equipped with relevant backgrounds. This tutorial will try to fill this gap. In the tutorial, we will begin from an introduction of basic principles of deep neural networks, and explain the basic structure and algorithm of the original Transformer for natural language tasks. We’ll show to the attendees how to build such models based on current machine learning platforms. Then we’ll introduce several successful ways to build large cellular models based on the basic Transformer model, and overview how such models are pretrained on single-cell RNA-seq data. We’ll show and let the attendees to practice how to use LCMs for basic tasks such as cell type annotation, and look into the specific application of LCMs on in-silico perturbation tasks. Attendees will engage in hands-on activities such as building basic transformer models and executing downstream single-cell tasks, including cell type annotation and in-silico perturbation. These activities will remove the mystery of LCMs for the attendees and help them better understand and feel how LCMs can be built and applied.

Learning Objectives
Attendees will build up their basic understanding of deep neural networks, Transformers, LLMs and LCMs. They will gain hands-on experiences on building such models and using existing models for downstream tasks. The particular knowledge and skill they’ll learn from this tutorial include:

  • Understanding the basic principles of transformer models and large language models
  • Knowing of the basic structure and components of transformer-based LCMs
  • Experiencing building their own Transformer models
  • Using pretrained LCMs cellular analyses on tasks such as cell label transfer and in-silico perturbation
  • Forming a more complete picture of the current and future development of the field.

Intended Audience and Level
This tutorial is designed to be friendly for everyone who is interested in current AI but does not know much about how it works, including computational biologists, biologists, clinicians, or other professionals. For those who know something about AI and who had some pervious experiences on it, the tutorial will provide deeper understandings and hands-on experiences on advanced virtual cell applications of LCMs. Attendees are expected to have some basic knowledge of Python programming for completing the hands-on activities.

Schedule

14:00-14:30 Introduction to artificial neural networks and deep learning
14:35-15:10 Introduction to language models and Transformers
15:10-15:25 Coffee break
15:25-15:55 Basic structure and components of transformer-based LCMs
15:55-16:15 Hands-on: Build transformer model with Python
  • Building a toy transformer model with PyTorch
  • Using pretrained LCMs for cell type annotation
16:15-16:45 LLMs for in-silico perturbation
16:45-17:00 Coffee break
17:00-17:20 Hands-on: in-silico gene perturbation
  • Model construction and data preparation
  • In-silico gene perturbation with LCMs
  • Evaluating the results
17:20-17:40 Discussions
17:40-18:00 Summary

- top -

Tutorial IP8: Representation Learning and Feature Engineering for Genomic Sequences Analysis SOLD OUT

Room: TBD
Date: July 20, 2025
Start Time: 14:00
End Time: 18:00

Organizer:
Fabiana Goes

Speakers:
Fabiana Goes, The Rosalind Franklin Institute
Aparajita Karmakar, The University of Edinburgh and The Rosalind Franklin Institute, UK

Max Participants: 15

Description
Machine learning (ML) has been successfully applied in different omics problems, such as sequence classification in the field of genomics. The effectiveness of ML methods relies greatly on the selection of the data representation, or features, that extract meaningful information from sequences. Genomic sequences can be viewed as one-dimensional strings of successive letters representing nucleotides. However, to make these sequences compatible with ML methods, they must first be transformed into structured numerical representations, such as vectors or matrices. Traditional methods for sequence classification often rely on manually crafted or pre-defined features, which require domain expertise and may not fully capture the complexity of the underlying biological information. Recently, representation learning has emerged as a powerful alternative, enabling the automatic extraction of latent patterns directly from raw data and reducing the dependence on manually crafted features. In genomics, representation learning methods have been introduced to characterize DNA and RNA sequences. In genomics, techniques like Word2Vec, Convolutional Neural Networks (CNNs) and Large Language Models (LLMs) have demonstrated the ability to learn optimal sequence representations that effectively capture both local and global patterns in DNA and RNA sequences.

This tutorial provides a comprehensive introduction to feature engineering and representation learning for genomic sequences (DNA/RNA). Participants will explore traditional techniques for extracting features from genomic sequences, building a foundation in classical approaches. Furthermore, the tutorial will cover representation learning, introducing concepts such as embeddings and their applications. Topics include methods such as Word2vec and LLMs to obtain meaningful representations from genomic sequences. Through hands-on exercises and comparative analyses, attendees will learn to combine traditional feature engineering with representation learning approaches, developing practical skills and insights that are adaptable to diverse genomic research challenges. The goal is to offer participants the knowledge and tools to enhance genomic sequence analysis using different techniques for sequence representation.

Learning Objectives

  • Participants will learn to analyze and extract numerical representations from genomic sequences (DNA/RNA) using both traditional and advanced techniques. Following the tutorial, they will be able to:
  • Obtain an overview of the advancement of methods, from traditional feature extraction methods to representation learning techniques, and understand their impact on genomic data analysis.
  • Understand the differences between genomic sequence embeddings and handcrafted features, exploring their roles in capturing complex patterns and insights within genomic sequences.
  • Learn how to extract and engineer features from genomic sequences, focusing on key aspects like k-mers, sequence composition, and numerical encoding, and apply bioinformatics tools and scripts for feature engineering.
  • Comprehend the fundamentals of representation learning and its applications in genomics, focusing on techniques such as word2vec, DNABERT2, and Nucleotide Transformer
  • Apply machine learning pipelines for genomic sequence classification, utilizing both feature engineering and representation learning techniques, and gain hands-on experience with relevant tools and libraries.
  • Evaluate and compare feature extraction and representation learning approaches in terms of their effectiveness for genomic sequence classification task.

Intended Audience and Level
This tutorial is designed for participants at different academic levels, from undergraduate students to early career researchers, as well as scientists from both academic and industry backgrounds, who have an interest in genomic sequence analysis and Machine Learning. We encourage participants to follow along with the hands-on exercises, whether they choose to actively participate or simply observe. For those who wish to engage in hands-on training, we recommend having a basic understanding of a few prerequisites: familiarity with running commands in a terminal, foundational knowledge of computer programming (preferably in Python), and a basic understanding of Machine Learning concepts.

The slides and materials for the hands-on exercises (scripts and datasets) will be shared online prior to the tutorial. All resources will be freely accessible to participants through a dedicated GitHub repository.

Schedule

14:00-14:45 Lecture: Exploring Traditional Feature Engineering in Genomics
  • Overview: Introduction to machine learning for sequence classification and methods to extract numerical information from biological sequences
  • Introduction to traditional feature extraction:
    • Sequence composition: GC content, nucleotide composition, basic k-mer and accumulated nucleotide frequency
    • Numerical mapping: Z-curve and one-hot encoding
  • Example of public software packages: Seq2Feature, iFeature and iLearn
14:45-15:45 Hands-on: Feature Extraction in Practice — Crafting Descriptors for Genomic Analysis
  • Practical exercises for extracting traditional genomic features
  • Application of basic ML pipeline for sequence classification
  • Comparative analysis of the effectiveness of different feature extraction approaches
15:45-16:00 Coffee Break
16:00-16:45 Lecture: Decoding the Genomic Language — Embeddings and Representations for Genomic Sequences
  • Introduction to representation learning for genomic sequences, highlighting their role in advancing genomic analysis
  • Introduction to embeddings: Overview of word embeddings, with a focus on Word2Vec and its application to genomic sequences
  • Foundation models in genomics: Overview of LLMs, focusing on DNABERT2 and Nucleotide Transformer for sequence representation learning
16:45-18:00 Hands-on: Embedding Genomic Data — From Word Embeddings to Large Language Models
  • Practical exercises applying Word2vec and LLMs (DNABERT2 and Nucleotide Transformer) to extract features from genomic sequences
  • Evaluation and comparison of the features obtained from traditional methods and embedding techniques

- top -

Tutorial VT1: Visualising and interpreting your -omics results using ggplot2 and R

Room: Virtual
Date: July 14, 2025
Start Time: 09:00
End Time: 18:00

Organizer:
Emily Johnson

Speakers:
Emily Johnson, University of Liverpool
Euan McDonnell, University of Liverpool
Lauren Mee, University of Liverpool

Max Participants: 30

Description
This full-day tutorial introduces participants to the principles of impactful data visualisation and equips them with the skills to create publication-ready visualisations for -omics data. Designed for beginners with basic knowledge of the R programming language, the tutorial will guide attendees through creating and interpreting key visualisations such as volcano plots, box plots, heatmaps, dot plots, and network diagrams.

Participants will work with real biological datasets in Quarto notebooks and learn how to use tools like ggplot2, ComplexHeatmap, igraph, ggraph, and ClusterProfiler. Through hands-on coding exercises and interactive lectures, attendees will develop an intuition for ggplot2’s grammar of graphics, best practices in data visualisation, and the application of functional enrichment methods to contextualise results.

Whether you are a student or researcher wanting to improve your visualisation skills, or a computational biologist looking to enhance the presentation of your results, this tutorial will provide the tools and knowledge to produce professional-quality figures.

Learning Objectives

  • To understand the principles of good data visualisation, including best practices and common pitfalls to avoid.
  • To learn about the most common types of visualisations for -omics data, including volcano plots, box plots, heatmaps, dot plots, and networks:
    • Create each type of visualisation using appropriate R packages.
    • Effectively interpret outputs generated.
  • To develop a working understanding of ggplot2 syntax, enabling participants to apply learned concepts to their own datasets.
  • To learn how to produce and interpret biological networks.
  • To contextualise -omics results using functional enrichment techniques such as Over-Representation Analysis (ORA) and Gene Set Enrichment Analysis (GSEA).

Intended Audience and Level
Beginner. This tutorial is suitable for students and researchers want to learn more about common omics visualisations and how to produce them, and computational biologists that might want to enhance their visualisations. Basic familiarity with R is recommended to get the most out of the tutorial. Prior exposure to omics data is beneficial but not mandatory for this tutorial.

Schedule

09:00-09:45 Lecture: Introduction to Data Visualization
  • Welcome and introductions
  • Overview of most common types of data visualisation in omics papers
  • What makes for a good or bad data visualisation
  • Interactive quiz – participants will be given the opportunity to apply knowledge gained and suggest how they’d improve various data visualisations
Emily Johnson
09:45-10:30 Lecture: introduction to ggplot2
  • Introduction to the tidyverse and why ggplot2 is ideal for reproducible visualisation
  • Overview of the “grammar of graphics” and ggplot2 syntax
  • Examples of advanced visualisations produced with ggplot2
  • Live coding demo to introduce core concepts
Lauren Mee
10:30-10:45 10:30 - 10:45: Coffee Break
10:45-12:00 Hands-on: plotting omics data with ggplot2
A follow along tutorial covering volcano plots and box plots to explore a differential expression analysis. For each type of visualisation, we will start from the most basic way to produce the plot and iteratively build up the plot to create an intuition for ggplot2 syntax. Themes and colour palettes will also be introduced during this session. The differential expression analysis results will be pre-processed.
12:00-13:00 12:00 - 13:00: Lunch Break
13:00-13:45 Hands-on: creating heatmaps with ggplot2 and ComplexHeatMap
A follow along tutorial that builds on from the morning practical. Participants will first create heatmaps using ggplot2, then they will be introduced to a more advanced heatmap-specific package: ComplexHeatMap. The merits of both approaches will be compared. The participants will also be introduced to the idea of clustering their data for visualisation purposes.
13:45-14:15 Lecture: networks and how to interpret them
  • Introduction to networks in biological data visualisation and where they’re appropriate
  • Understanding network-specific terminology
  • Examples of networks (e.g., protein-protein interactions, co-expression networks)
  • Brief overview of inputs for and challenges of network visualisation
Euan McDonnell
14:15-15:30 Hands-on: creating network visualisations with igraph and ggraph
Hands-on coding session where attendees will use igraph and ggraph to create network diagrams. Attendees will learn about different network layouts, how to identify and emphasise certain network structures and how to use igraph outputs as an input for ggraph (a package which is built on ggplot2 and compatible with tidyverse workflows) – for high quality and high-resolution network visualisations.
15:30-15:45 Coffee break
15:45-16:15 Lecture: Contextualising Results with Functional Enrichment
  • What is functional enrichment and how does it help us make sense of our results?
  • Common databases for enrichment analysis (e.g., GO, KEGG)
  • Differences between Over-Representation Analysis (ORA) and Gene Set Enrichment Analysis (GSEA), including best practices for both (use of appropriate background for ORA and choice of ranking metrics for GSEA)
  • Visualising enrichment results: dot plots, ridge plots, and network diagrams.
Emily Johnson
16:15-17:30 Hands-on: ORA and GSEA practical
Hands-on coding session using the ClusterProfiler package to perform ORA and GSEA on the results of a differential expression analysis. Participants will produce dot plots and ridge plots and compare the outputs of the two approaches. We will cover best practices for ORA and GSEA. There will be additional supplementary content including how to carry out functional enrichment for non-model organisms and advanced analysis such as Gene set variation analysis (GSVA).
17:30-18:00 Wrap-Up and Discussion
  • Review of key concepts covered in the tutorial.
  • Q&A session and discussion on applying concepts to attendees’ own research projects.

- top -

Tutorial VT2: OmicsViz: Interactive Visualization and ML for Omics Data

Room: Virtual
Date: July 15, 2025
Start Time: 09:00
End Time: 18:00

Organizer:
Ragothaman M. Yennamalli

Speakers:
Ragothaman M. Yennamalli, SASTRA Deemed to be University
Shashank Ravichandran, Incedo Inc
Megha Hegde, Kingston University London
Jean-Christophe Nebel, Kingston University London
Farzana Rahman, Kingston University London

Max Participants: 30

Description
Data Science and Machine Learning are intricately connected, particularly in computational biology. In a time when biological data is being produced on an unprecedented scale — encompassing genomic sequences, protein interactions, and metabolic pathways- meeting the demand has never been more crucial.

Data visualization plays a crucial role in biological data sciences since it allows the transformation of complex, often incomprehensible raw data into visual formats that are easier to understand and interpret. This allows biologists to recognize patterns, anomalies, and correlations that would otherwise be lost in the sheer volume of data. In addition, machine learning (ML) has brought about a revolution in the analysis of biological data. Exploiting extensive datasets, ML provides tools to model complex systems and generate predictions. Indeed, ML algorithms excel at uncovering subtle patterns in data, contributing to tasks like predicting protein structures, comprehending genetic variations and their implications for diseases, and even facilitating drug discovery by predicting molecular interactions.

The integration of data visualization and machine learning is particularly powerful. In particular, visualization may aid in interpreting machine learning models, allowing biologists to understand and trust their predictions. It could also help fine-tune these models by identifying outliers or anomalies in the data.

Due to its remarkable capability, there has been a surge in the development and application of tools that combine data visualization and machine learning in biology. Platforms that integrate these technologies enable biologists to conduct comprehensive analyses without needing deep expertise in computer science. Assuredly, this democratization of data science and ML has empowered more and more biologists to engage in sophisticated, data-driven research.

Learning Objectives
This tutorial is divided into two parts. In the first part of the tutorial, the participants will learn how to install
and use tools for data visualization using Python.

The second part will focus on installing and using ML tools for feature selection, model training, and model optimization using Python.

By the end of this tutorial, the participants will be able to:

  1. Explain the role and significance of data visualization in the context of scientific research. (Revised Bloom’s Taxonomy [RBT] Assessment: Understand)
  2. Apply fundamental principles of data visualization to create clear and informative visual representations of data. (RBT Assessment: Apply)
  3. Create a variety of data visualizations using Python libraries, i.e., Matplotlib, Seaborn, and Plotly. (RBT Assessment: Understand)
  4. Understand the basics of colour theory and its implications for creating accessible and aesthetically pleasing visualizations. (RBT Assessment: Understand)
  5. Design data visualizations that are accessible to a diverse audience, including those with colour vision deficiencies. (RBT Assessment: Create)
  6. Gain practical skills in preprocessing data and selecting appropriate features for machine learning models. (RBT Assessment: Apply)
  7. Build, train, and evaluate machine learning models using Python libraries like Scikit-learn and TensorFlow/Keras. (RBT Assessment: Analyze)
  8. Implement machine learning algorithms on real-world biological datasets, demonstrating an understanding of the application of these techniques in biology. (RBT Assessment: Apply)
  9. Create integrated visualizations of machine learning results using tools like Yellowbrick, Bokeh, and TensorBoard. (RBT Assessment: Create)
  10. Critically evaluate and discuss the applications, challenges, and implications of data visualization and machine learning in scientific research, particularly in biology. (RBT Assessment: Evaluate)

Intended Audience and Level
The tutorial is aimed towards participants at the level of Graduate students, researchers, and scientists in both
academia and industry who are interested in Data Visualization and ML. Prerequisites: Basic to intermediate
knowledge in Python and basic knowledge in machine learning.

Schedule

09:00-10:00 Lecture: Introduction to Data visualization: Importance and Basic principles of data visualization in scientific research
Jean-Christophe Nebel
10:00-10:45 Hands-on: Python Libraries for Visualization: Matplotlib, Seaborn, Plotly and others
Farzana Rahman, Ragothaman Yennamalli, Shashank Ravichandran, and Megha Hegde
10:45-11:00 10:45AM - Coffee/Tea Break
11:00-12:00  Lecture: Colour theory in Visualization: Colour palettes, Accessible and Inclusive visualizations
Ragothaman Yennamalli
12:00-13:00 Hands-on: Creating various types of charts, plots for clarity and aesthetics
Case studies with real world datasets
Farzana Rahman, Ragothaman Yennamalli, Shashank Ravichandran, and Megha Hegde
13:00-14:00 Lunch Break
14:00-15:00 Lecture: Fundamentals of Machine Learning: Types of ML, Data preprocessing and feature
selection, model selection and training
Ragothaman Yennamalli and Farzana Rahman
15:00-16:00 Hands on: Python libraries for Machine Learning: Scikit-learn, Pandas, NumPy, TensorFlow/Keras Building models using real-world biological data
Shashank Ravichandran, and Megha Hegde
16:00-16:15 Coffee/Tea Break
16:15-17:15 Hands on: Integrating Data Viz and ML: Yellowbrick, Bokeh, Tensorboard, Scikit-plot, etc.
Farzana Rahman and Megha Hegde
17:15-18:00 Question and Answer session
Identify and highlight blocks of hands-on content in your submission

- top -

Tutorial VT3: Computational approaches for deciphering cell-cell communication from single-cell transcriptomics and spatial transcriptomics data

Room: Virtual
Date: July 15, 2025
Start Time: 09:00
End Time: 13:00

Organizer:
Giulia Cesaro

Speakers:
Giulia Cesaro, University of Padova
James Shiniti Nagai, RWTH Aachen University
Mayra Ruiz, RWTH University Hospital
Barbara di Camillo, University of Padova
Ivan G. Costa, RWTH University Hospital
Giacomo Baruzzo, University of Padova

Max Participants: 40

Description
Tissues and organs are complex and highly-organized systems composed of diverse cells that work together to maintain homeostasis, drive development and mediate complex disease progression as Myocardial Infarction (Kuppe et al. 2022). A key focus of modern biology is understanding how heterogeneous populations of cells coexist and communicate with each other (intercellular signaling), how they properly respond (intracellular signaling) within a tissue and organ system and how these processes vary across different experimental conditions (comparative analysis). Recently, a rapid expansion of computational tools exploring the expression of ligand and receptor has enabled the systematic inference of cell-cell communication from single-cell transcriptomics and spatial transcriptomics data (Armingol et al. 2021; Armingol, Baghdassarian, and Lewis 2024). These are crucial in unravelling the complex landscape of biological systems.

This tutorial aims to provide a comprehensive introduction to computational approaches for cell-cell communication inference using high throughput transcriptomics data. It covers the fundamental concepts of cellular communication and assumptions underlying analysis focusing on the main computational methods used in the field. This includes computational approaches for inter-cellular communication inference (CellphoneDB (Efremova et al. 2020); LIANA (Dimitrov et al. 2022, 2024)) and for intra-cellular signals communication (scSeqComm (Baruzzo, Cesaro, and Di Camillo 2022); NicheNet (Browaeys, Saelens, and Saeys 2019)). Next, we will describe approaches for comparative analysis of cell-cell networks in distinct biological conditions (CrossTalkeR (Nagai et al. 2021)) and methods for spatially resolved cell-cell communications (Ischia (Regan-KomitoDaniel 2024); DeepCOLOR (Kojima et al. 2024)).

In the first part of the tutorial, participants will be introduced to the theoretical basis of state-of-the-art computational approaches and will learn how to use representative tools for inferring intercellular signaling and intracellular signaling pathways. In the second part, we will focus on the comparative analyses, i.e. changes of cell-cell communication in two conditions, and subsequently highlighting the unique insights spatial transcriptomics data can provide for understanding tissue architecture and cellular communication. Both sections will be followed by a hands-on component based on the analysis of single cell and spatial transcriptomics data from the myocardial infarction atlas (Kuppe et al. 2022). To promote transparency, all the codes, software tools and the datasets used throughout the tutorial will be available and accessible through open-access repositories (e.g. GitHub repositories or Zenodo platforms).

Learning Objectives

  • Understanding and identifying the key theoretical concepts of cell-cell communication analysis
  • Learn the foundations of main computational approaches for cell-cell communication inference and develop critical thinking skills to choose and apply the most appropriate tools tailored to the specific research questions and analysis contents
  • Gain hands-on experience in applying these computational methods to real-world data and learn how to interpret and evaluate the results in the context of biological systems

Intended Audience and Level
This tutorial is aimed at Master’s or PhD students, as well as researchers in the fields of bioinformatics, computational biology, medical informatics, and related fields. It is designed for those who are users and/or developers of bioinformatics software and are interested in incorporating cell-cell communication analysis into their workflows. The tutorial is suitable for beginner to intermediate users of computational tools and those looking to expand their skill set in cell-cell communication analysis. Basic knowledge of R and Python is strongly recommended, as well as previous experience in gene expression and next-generation sequencing analysis.

Schedule

09:00-09:30 Introduction to computational inference of intercellular and intracellular cell-cell communication
Overview of cell-cell communication inference, assumptions and challenges, a priori biological knowledge, current approaches for intercellular and intracellular signaling
Di Camillo, Baruzzo
09:30-10:30 Hands-on analysis of Myocardial Infarction scRNA-seq
Presentation of case study: human myocardial infarction, tutorial on cell-cell communication analysis from single cell transcriptomics data
Cesaro (Helpers: Baldan Matteo and Tussardi Gaia)
10:30-10:45 Coffee Break
10:45-11:15 Introduction to comparative analysis of cell-cell communication and spatial analysis
Overview of cell-cell communication comparative analysis and computational approaches, spatial transcriptomics technologies and platforms: pro and cons
Costa
11:15-11:45 Hands on analysis of Myocardial Infarction ST and scRNA-seq (part 1)
Tutorial on comparative analysis from single-cell transcriptomics and spatial transcriptomics data using myocardial infarction case study
Nagai, Ruiz (Helpers: Thiago Maié and Kai Peng)
11:45-12:00 Coffee Break
12:00-12:45 Hands on analysis of Myocardial Infarction ST and scRNA-seq (part 2)
Tutorial on comparative analysis from single-cell transcriptomics and spatial transcriptomics data using myocardial infarction case study
Nagai, Ruiz (Helpers: Thiago Maié and Kai Peng)
12:45-13:00 Q&A Conclusion

- top -

Tutorial VT4: An applied genomics approach to crop breeding: A suite of tools for exploring natural and artificial diversity

Room: Virtual
Date: July 15, 2025
Start Time: 14:00
End Time: 18:00

Organizer:
Maria Skrabisova

Speakers:
Trupti Joshi, University of Missouri-Columbia
Yen On Chan, University of Missouri-Columbia
Maria Skrabisova, Palacky University in Olomouc
Jana Biova, Palacky University in Olomouc

Max Participants: 50

Description
The urgent need for crop improvement is hindered by the lack of precision in crop breeding. Although significant progress has been made in genomics, many causal genes of important agronomic and nutritional traits remain unknown. This is due to the inefficient identification of causal genetic features in candidate genes. With the growing body of sequencing data and phenotype information, current advances in genomics and the development of bioinformatics tools offer improvements in candidate gene selection. In this workshop, SoyHUB, the suite of tools, strategies, and solutions for soybean applied genomics, will be presented along with its extension to other crops at KBCommons. Our methodology for data integration, curation, reuse, and leveraging will be highlighted. Practical utilization of integrated data with the tools will be demonstrated. We will focus on the selection of best-performing markers, identification of causal genes, and exploration of alleles and genomic variation. We will cover simple Mendelian traits, showing how to analyze variation in protein-coding regions and promoters and touch on copy number variation. Solutions for complex cases, such as multiple independent alleles in a single gene or quantitative traits, will also be introduced. This workshop will highlight recent achievements in leveraging big data to improve precision in GWAS-driven discoveries and, therefore, accelerate the breeding of soybean and other crops.

Learning Objectives

  • To understand the basics of fine-mapping QTLs by post-GWAS analyses
  • To understand the importance of identifying causative mutations in causal genes
  • To learn principles of curating resequenced data gained from independent studies
  • To learn how to explore genetic variation
  • To understand the specifics of the Allele Catalog, GenVarEx, AccuTool, SNPViz v2.0 and MADis tools
  • To gain hands-on experience in applying tools and interpreting results for soybean as a model by using a web-based suite of tools, the SoyHUB

Intended Audience and Level
Graduate students, researchers, scientists, and practitioners in both academia and industry who are interested in applications of bioinformatic tools for crop breeding improvement. The tutorial is aimed towards entry-level participants with knowledge of the fundamentals of biology and bioinformatics (beginner). No prior experience with Python programming language is assumed, but familiarity with working on Unix-based systems is strongly recommended for the participants.

The tutorial slides and materials for hands-on exercises (e.g., links to demo, code implementation, datasets) will be posted online prior to the tutorial and made available to all participants.

Schedule

14:00-14:45 Translational Bioinformatics Frameworks and AI Solutions for Multiomics Research
Next-generation sequencing and multiomics data (bulk and single-cell) capturing molecular changes from genomics all the way to phenomics, have become an integral part of research in all domains including biomedical sciences, plants sciences, and others. This rapid revolution in the multiomics has posed a growing need for translational tools that can handle large amounts of data, are easily expandable, provide interpretable results and can be readily applied to any species. To address such translational needs, we have developed Soybean Knowledge Base (SoyKB) and Knowledge Base Commons (KBCommons) web-based frameworks, both fully equipped to handle the entire multiomics landscape for all organisms. SoyHUB within SoyKB provides access to our developed suite of tools such as AccuTool, Allele Catalog, GenVarX, MADis and others, specifically designed to provide the plant community with efficient data driven solutions for better breeding strategies. Additionally, our G2PDeep, deep learning method, provides a comprehensive web-based resource for phenotype predictions using multiomics data for all organisms.
Trupti Joshi
14:45-15:40 Diversity Panel Creation and Resequencing Data Curation Using SnakyVC Pipeline, Allele Catalog Pipeline, and Allele Catalog Tool (hands-on)
With the growing availability of large-scale genomic datasets, in silico identification of causal genes and crop improvement have become more achievable. However, extracting meaningful insights from these datasets often requires extensive data processing using various computational tools, which can be time-intensive due to sequential tool transitions. To address this challenge, we developed SnakyVC, a scalable variant calling pipeline to process large-scale whole-genome resequencing (WGRS) data, and an Allele Catalog Pipeline to annotate SNPs and Indels and generate Allele Catalog datasets. These high-efficiency and parallelizable pipelines can be deployed on standalone servers or high-performance computing clusters to significantly reduce computational time. To expand the benefits of the Allele Catalog datasets, the web-based Allele Catalog Tool has been developed and integrated into the SoyKB and KBCommons web platforms. This tool enables researchers to query and visualize alleles within genes, functional annotations, accession metadata, and phenotype distributions. Currently, this tool supports a wide range of organisms including soybean, Arabidopsis , poplar, rice, sorghum, and maize. With its extensive functionalities and new organism integration capabilities, the Allele Catalog Tool facilitates the discovery of novel alleles and the selection of plant accessions for improved breeding strategies and agricultural traits. Together, these tools empower researchers to efficiently process genomic data and enhance crop improvement efforts.
Yen On Chan
15:40-16:00 Coffee break
16:00-17:45 Utilization of the SoyHUB tools in Post-GWAS analyses (hands-on)
This hands-on tutorial will cover approaches and strategies in applied genomics that have been developed for Soybean and are available for adoption for other crops. Using SnakyVC, we created a panel of diverse resequenced soybean accessions. To explore the genetic diversity of Soy2939, we demonstrate the use of a gene-centric toolbox for post-GWAS analysis, SoyHUB. Specifically, we will present allele mining in protein-coding regions (Soybean Allele Catalog), gene regulatory regions (GeneVarEx), and intergenic regions (SNPViz v2.0). Additionally, we will introduce a post-GWAS strategy, the Synthetic Phenotype to Causative Mutation (SP2CM) approach, which increases the probability of identifying causative mutations. We will also demonstrate how to use AccuTool to calculate accuracy as a measure of the direct correspondence between phenotype and variant position. In this tutorial, we will perform demonstration analyses using each tool within the SoyHUB platform and discuss the results in the context of relevant biological insights. Together, the strategy and tools provide a powerful, bioinformatics-driven layer in the pre-breeding phase, that ultimately improves and accelerates crop breeding.
Maria Skrabisova
17:45-18:30 MADis: Genomic Analysis Tool for the Revelation of Multiple Alleles within a Single Gene (hands-on)
Understanding crop diversification through evolution and domestication is crucial for crop breeding. Genome-wide association study (GWAS) has emerged as a powerful tool for mapping genomic loci linked to important traits. However, GWAS often struggles to resolve complex genetic architectures, here we focus on a frequent situation when multiple independent causative alleles exist within a single gene. To address this GWAS limitation, we developed the MADis (Multiple Allele Discovery) tool, an innovative tool that computes a score for a combination of variant positions in a single candidate gene and, based on the highest score, identifies the best number and combination of CMs. In this hands-on tutorial, we will introduce the MADis tool and cover its functionalities and utilization. Participants will explore how MADis overcomes the limitations of traditional GWAS to accelerate precision breeding and enhance the understanding of complex genetic traits. Participants will also learn how to integrate MADis into their research workflow. The tool is available as a Python package on GitHub and as a web-based Soybean MADis tool specifically designed for a curated panel of 1066 soybean resequenced accessions.
Jana Biova

- top -

Tutorial VT5: Comprehensive Bioinformatics and Statistical Approaches for High-Throughput Sequencing Data Analysis, Including scRNA-seq, in Biomarker Discovery

Room: Virtual
Date: July 14, 2025
Start Time: 14:00
End Time: 18:00

Organizer:
Xiaoli Zhang

Speakers:
Xiaoli Zhang
Lianbo Yu

Max Participants: 50

Description
With the significant advancements in genomic profiling technologies and the emergence of selective molecular targeted therapies, biomarkers have played an increasingly pivotal role in both the prognosis and treatment of various diseases, most notably cancer. This workshop is designed to begin with an introductory overview of basic concepts of biomarkers, the diverse categories of biomarkers, commonly employed biotechnologies for biomarker detection, with a special focus on gene mutation and gene expression using DNA-seq, RNA-seq, and scRNA-seq data. Furthermore, we will discuss processes of biomarker discovery and development, and outlining the key steps involved and the current analytical methodologies utilized. Following this, we will discuss the identification of driver gene mutations and altered gene expression, using The Cancer Genome Atlas (TCGA) lung cancer data and PBMC scRNA-seq data as illustrative examples with using R code as practical demonstrations to enhance understanding. In the latter part of this workshop, we will discuss commonly utilized biostatistics and bioinformatics tools, including data visualization, survival analysis and machine learning methods, which are employed to predict disease progression and patient survival outcomes based on these critical biomarkers. By the conclusion of this course, participants will have acquired a broad and fundamental understanding of biomarker discovery, particularly in cancer. This encompasses key concepts, data sources, data analysis techniques, and interpretation strategies. Such expertise will equip participants with the knowledge necessary in contributing to the development of precision medicine in cancer patient treatment.

Learning Objectives
Through this tutorial, we expect that participants will have acquired a broad and fundamental understanding of biomarker discovery including key concepts, data types and sources, key steps in molecular biomarker discovery, commonly used biostatistics and bioinformatics methods in the field, and data analysis techniques for DNA-seq, RNA-seq, and scRNA-seq.

Intended Audience and Level
This introductory tutorial will be suitable for all bioinformaticians and biomedical researchers. R programming experience is preferred, but not required.

Schedule

14:00-14:45 Introduction of biomarkers
Definition of biomarkers, types of biomarkers, central dogma, molecular biomarkers, technologies for molecular biomarker detection, key steps in molecular biomarker discovery and related analytic methodologies involved.
14:45-15:30 Identification of driver mutations
Discuss publicly available data sources, data types, use lung cancer TCGA DNA-seq and clinical data as examples to illustrate how to identify driver gene mutations in tumor samples. Hands on examples with R code for analysis will be demonstrated.
15:30-15:45 Break
15:45-16:45 Detection of differential expression
Statistical methods for analyzing differential expression in bulk RNA-seq and scRNA-seq data, complemented by visualization techniques such as t-SNE, UMAP, and heatmaps. Hands-on examples will be provided, including R code demonstrations for data analysis.
16:45-17:00 Break
17:00-18:00 Machine learning methods for phenotype prediction using biomarkers
Further discussing visualization of gene expression data with volcano plots, heatmaps, PCA plots, t-SNE, and UMAP etc., oncoplot and mutplot for gene mutation, logistic regression and survival analysis to predict biomarker association with patient progress and survival outcomes, then discuss some classification methods such as KNN, SVM, and random forest etc., for patient subtype analysis based on genomics data. Hands on examples with R code for predictive modeling will be demonstrated.

- top -

Tutorial VT6: Beyond Bioinformatics: Snakemake for Versatile Computational Workflows

Room: Virtual
Date: July 14, 2025
Start Time: 09:00
End Time: 13:00

Organizer:
Carlos H. M. Rodrigues

Speakers:
Carlos H. M. Rodrigues, Australian Centre for Disease Preparedness

Max Participants: 20

Description
Snakemake is a powerful, Python-based workflow management system that revolutionises how computational tasks are designed, executed and reproduced. By allowing researchers to define workflows as a series of interconnected rules, Snakemake simplifies complex computational pipelines and ensures reproducibility across diverse scientific domains. This intensive workshop introduces participants to the full potential of Snakemake, moving beyond traditional bioinformatics applications to demonstrate its versatility in machine learning and data analysis.

Designed for researchers dealing with complex computational challenges, this tutorial will explore Snakemake’s capabilities through practical, hands-on examples that span multiple disciplines. Participants will learn how to create robust, scalable, and efficient workflows that can adapt to various research challenges. By the end of the workshop, attendees will have a comprehensive understanding of workflow management principles and the skills to implement sophisticated pipelines using Snakemake.

Learning Objectives

  • Understand the fundamental concepts of workflow management with Snakemake
  • Develop skills in creating flexible and reproducible computational pipelines
  • Apply Snakemake to multiple domains, including bioinformatics and data science
  • Implement advanced workflow features such as wildcards, parallel execution and configuration management
  • Recognise best practices for designing efficient and maintainable workflows

Intended Audience and Level
This tutorial is designed for individuals across various disciplines who manage or analyse data: bioinformaticians, data scientists, computational researchers, and anyone wanting to know more about efficient data workflow management. Basic familiarity with command-line operations, such as executing scripts or simple commands like “ls” or “pwd”, will be beneficial for active participation. No prior Snakemake experience is necessary; we will cover everything from basics to advanced applications. For hands-sessions, participants will need a GitHub account to interact with the Binder environment where we will run the workshops. While live coding is encouraged, attendees are also welcome to observe and learn through demonstration if they prefer.

Schedule

09:00-09:30 Introduction to Snakemake and Workflow Principles
09:30-10:45 Domain specific workflow examples
  • Bioinformatics workflow (Hands-on)
  • Data Science/Machine Learning workflow (Hands-on)
10:45-11:00 Coffee break
11:00-12:15 Advanced Snakemake Features
  • Wildcards and config files
  • HPC Environments
  • Parameterised simulation workflow (Hands-on)
12:15-12:45
  • Practical considerations, Best Practices and Q&A Identify and highlight blocks of hands-on content
    • Bioinformatics workflow (30-40 mins)
  • Live coding of an RNA-seq data processing pipeline
  • Participants create and modify Snakemake rules
  • Explore rule dependencies and workflow structure
    • Data Science/Machine Learning workflow (30-40 mins)
  • Build an end-to-end ML pipeline
  • Demonstrate data preprocessing, model training and evaluation
  • Hands-on exercise in creating dynamic workflow rules
    • Parameterised simulation workflow (30-40 mins)
  • Create a parameterised simulation workflow
  • Implement parameter sweeps
  • Explore advanced features like wildcards and config files

- top -

Tutorial VT7: Assessing and Enhancing Digital Accessibility of Biological Data and Visualizations

Room: Virtual
Date: July 15, 2025
Start Time: 14:00
End Time: 18:00

Organizer:
Sehi L’Yi

Speakers:
Sehi L’Yi
Lawrence Weru
Thomas Smits
Nils Gehlenborg

Max Participants: 40

Description
As computational biologists, we produce biological datasets, visualizations, and computational tools. Our shared goal is to make our data and tools widely usable and accessible. However, we often fail to meet the needs of certain groups of people. Our recent comprehensive evaluation [1] (https://inscidar.org) shows that biological data resources are largely inaccessible to people with disabilities, with severe accessibility issues in almost 75% of all 3,112 data portals included in the study. To address the critical accessibility barriers, it is important to increase awareness of accessibility in the community and teach the workforce practical ways to enhance the accessibility of biological data resources. While there are existing resources and training opportunities that focus on content that includes no or little data, there is a lack of solutions and resources that provide insights into how to make data-intensive content accessible.

Our tutorial is designed to help participants understand the importance of digital accessibility in computational biology and practice various approaches to test and implement digital accessibility of biological data and visualizations. We will demonstrate our evaluation results [1] to help participants understand the critical barriers in biological research and education for people with disabilities, such as those involving vision, cognitive, and physical function. We will use hands-on examples that are familiar to and widely used by computational biologists, such as computational notebooks, genome browsers, and other visualizations. Our tutorial will conclude by introducing open problems and recent innovations, such as the accessibility of interactive genomics data visualizations [2]. We will ensure that our tutorial and all the materials are accessible.

Learning Objectives

  1. Understand the importance of accessibility and current accessibility problems in the field of computational biology.
  2. Learn different approaches to evaluate the accessibility of biological data resources.
  3. Learn approaches to enhance the accessibility of biological data resources and visualizations.
  4. Learn open problems and innovations in accessibility

Intended Audience and Level
Anyone in the computational biology workforce can benefit from this tutorial. The target audience includes (under)graduate students, postdocs, staff scientists, data analysts, software engineers, UI/UX designers, and principal investigators.

  • This tutorial will be particularly helpful for people who create any form of digital content using biological data, such as visualizations, figures, websites, etc.
  • For the hands-on sessions, experience in Python is helpful but not required. No additional programming experience is needed.

Schedule

14:00-14:45 Background
  • What is digital accessibility?
  • What are accessibility techniques? (e.g., screen readers)
  • Why is digital accessibility important in computational biology?
  • Why should each of us care about this? How does this affect each of us?
    • Legal requirements
    • Practical benefits of increasing accessibility for your work
  • What is the state of digital accessibility in computational biology?
    • Our accessibility evaluation results on INSCIDAR (https://inscidar.org/) [1]
  • What should we do to make it better?
14:45-16:10 Hands-on session 1: Assessing the accessibility of biological data resources
This session will (i) introduce specific accessibility guidelines/tools/methods and (ii) practice evaluating the accessibility of biological data resources using both manual and automatic methods.
  • Data portals (cBioPortal and HuBMAP)
    • Keyboard-only navigation for Data Discovery tasks
    • WAVE extension
    • HeadingsMap extension (for evaluating the structure)
    • SiteImprove extension
  • Genome browsers (UCSC Genome Browser and JBrowse 2)
    • Keyboard-only navigation
  • Computational notebooks (Google Colab and Jupyter Notebooks)
    • Background and guidelines [5]
  • Other potential examples
    • PDF files (e.g., Bioinformatics, Nature)
    • Evaluating using Adobe Acrobat
    • Navigation using a screen reader
  • 15:30-15:45: Break
16:10-17:30 Hands-on session 2: Implementing accessibility standards
Writing alt-text of biomedical data visualization figures
  • Making computational notebooks accessible
  • Other potential examples
    • Making presentation slides and embedded biomedical data accessible
    • Provide written guidelines and ask each participant to improve accessibility
  • 16:45-17:00: Break
17:30-18:00 Open problems and innovations
  • Accessibility for complex biomedical data visualizations
  • Complex data discovery tasks in data portals
  • Towards better accessibility in computational notebooks
  • AI and LLM-based applications
  • Additional resources (e.g., Siteimprove)

- top -

Tutorial VT8: Generative AI for Single-Cell Perturbation Modeling: Theoretical and practical considerations SOLD OUT

Room: Virtual
Date: July 14, 2025
Start Time: 14:00
End Time: 18:00

Organizers:
Marina Esteban-Medina
Liya Zaygerman
Rosario Astaburuaga-García
Aspasia Orfanou
Vasileios Vasileiou

Speakers:
George Gavriilidis, Centre for Research and Technology Hellas
Konstantinos I. Giatras, National and Kapodistrian University of Athens
Gobikrishnan Subramaniam, Queen’s University Belfast
Sabrina Jagot, Institut NeuroMyoGen
Alejandro Madrid, Barcelona Supercomputing Center

Max Participants: 25

Description
Single-cell perturbation modelling is revolutionising how we understand the effects of genetic interventions, drugs, and cellular stimulants on molecular and cellular physiology. This half-day virtual tutorial will introduce participants to highly performant Generative AI tools—scGEN and scPRAM—designed to simulate perturbations on single-cell datasets and extrapolate to unseen conditions. Through concise presentations and hands-on exercises, attendees will explore the theoretical underpinnings of these tools, preprocess single-cell data, train generative models, and interpret results using advanced metrics such as R², E-distance, and Maximum Mean Discrepancy as well as dimensionality reduction techniques. Special focus will be placed on challenging scenarios like extrapolating to unseen patient responses and cross-species predictions, leveraging benchmarking insights from the EU BH 2024 Perturb-Bench initiative. Participants will leave with actionable knowledge to implement, evaluate, and benchmark generative perturbation models, supported by practical resources hosted on Google Colab, Jupyter Book, and GitHub.

Learning Objectives

  1. Understand Single-Cell Perturbation Modeling:
    Learn how generative AI models (e.g., scGEN, scPRAM) predict unseen perturbations and their role in disease research and drug repurposing
  2. Implement Generative AI Models:
    Gain hands-on experience with scGEN and scPRAM, including preprocessing single-cell data, training models, and interpreting outputs
  3. Utilise Cloud-Based Tools:
    Execute Python scripts in Google Colab to process curated single-cell datasets efficiently using GPU resources
  4. Design Perturbation Experiments:
    Create biologically relevant scenarios (e.g., cross-species predictions) and address key steps like normalisation and hyperparameter tuning
  5. Benchmark and Evaluate Models:
    Conduct mini-benchmarking of scGEN and scPRAM using Perturb-Bench to assess performance and reliability across cell types.

Intended Audience and Level
Post-graduate students and researchers with basic experience in single-cell analysis and Python programming who are interested in generative AI, single-cell perturbation biology, drug repurposing, and computational modeling. The workshop will be conducted in Google Colab, therefore participants do not need to meet any minimum computational specification to attend.

All tutorial materials will be distributed via Github, governed by a permissive open license (e.g. CC-BY). Practical sessions will be documented in Jupyter Book (https://jupyterbook.org/en/stable/intro.html) with accessible and coherent code examples included. Recorded versions of taught material will be available via YouTube after the conference.

Data used for tutorial examples will be publicly available single-cell transcriptomics datasets made accessible via the pertpy package (https://pertpy.readthedocs.io/en/latest/usage/usage.html) and the perturbase repository (http://www.perturbase.cn/); also, datasets from the scPRAM publication will be used (https://doi.org/10.1093/bioinformatics/btae265).

Schedule

14:00-14:10 Title: Welcome and Introduction to perturbation modelling for single-cell technologies
  • Short presentation (importance of perturbation modelling in single-cell biology, brief introduction to available tools, perturbation single-cell data, overview of workshop’s agenda)
Speaker: George Gavriilidis
14:10-15:30 Title: scGEN: a landmark generative model for unseen perturbations
  • Short presentation (Using autoencoders for manifold learning manifold learning, extrapolation to unseen events in single-cell perturbation data, scGEN architecture, possible real-world scenarios for deployment )
  • Hands-on practical: (an enhanced version of https://pertpy.readthedocs.io/en/latest/tutorials/notebooks/scgen_perturbation_prediction.html will be designed focusing on single-cell data pre-processing, designating training and testing sub-datasets, model hyper-parameter tuning (epochs, batch size), running scGEN, interpreting model outputs, using dimensionality reduction to evaluate perturbational extrapolations, implementing basic metric R2 to monitor the accuracy of perturbation prediction for differentially expressed genes and highly variable genes)
Speaker: Konstantinos I. Giatras
Trainers: Gobikrishnan Subramaniam, Konstantinos I. Giatras
15:30-15:45 Coffee Break
15:45-16:45 Title: scPRAM: a newer perturbation generative model based on attention mechanism and causal counterfactuals
  • Short presentation (Explore optimal transport principles in single-cell perturbation modelling, how scPRAM maps perturbations probabilistically between control and stimulated conditions, and its key components, including transport cost functions, regularisation, and interpretable metrics.)
  • Hands-on practical (an enhanced version of https://github.com/jiang-q19/scPRAM/blob/main/Tutorial/PBMC_cross_celltype_predict.ipynb will be designed to focus on preprocessing single-cell datasets, configuring scPRAM with key hyperparameters(noise robustness, learning rate, batch size, and optimal transport settings), evaluating model performance using R2, Wasserstein distance and dimensionality reduction and showcasing how attention prioritises the most relevant cells or features for predicting perturbation responses.
Speaker: Sabrina Jagot
Trainers: Sabrina Jagot, George Gavriilidis
16:45-17:00 11:45-12:00 AM Coffee Break
17:00-17:50 Title: Decentralised benchmarking of generative perturbation models
  • Short presentation (why benchmarking is important, selecting metrics beyond typical linear correlations, workflows and decentralised deployment)
  • Hands-on practical (scGEN vs scPRAM; based on code from the ongoing Perturb-Bench effort from EU BH 2024 to systematically benchmark scGEN vs scPRAM against more metrics like E-distance, Maximum mean discrepancy in challenging single-cell scenarios like extrapolation to unseen patient perturbation responses and perturbation predictions across-species)
Speaker: Alejandro Madrid
Trainers: Alejandro Madrid, Konstantinos I. Giatras
17:50-18:00 Title: Wrap up and discussion
  • Short presentation (Short recap of key take away messages)
  • Open discussion
Speaker: All

- top -

Tutorial VT9: Biomedical text mining for knowledge extraction

Room: Virtual
Date: July 15, 2025
Start Time: 09:00
End Time: 13:00

Organizer:
Jake Lever

Speakers:
Jake Lever, University of Glasgow
Zaiqiao Meng, University of Glasgow
Javier Sanz-Cruzado Puig, University of Glasgow

Max Participants: 60

Description
Modern bioinformatics analyses rely heavily on the existing knowledge of the role of genes and mutations in different diseases, as well as the complex interactions between genes, proteins and drugs. However, access to this information is often limited for many biomedical problems, especially niche areas, due to lack of knowledge bases and large curation costs. The information is often locked in the text of the original research papers. Machine learning methods, particular natural language processing techniques, offer an automated approach to extracting knowledge from the research literature to build bespoke knowledge bases for scientists’ needs. This tutorial will provide a hands-on introduction to the core tasks in biomedical natural language processing (BioNLP). These include identifying mentions of important concepts (e.g. phenotypes, cell-lines, etc) and extracting nuanced relationships between them. Finally, it will show how large language models have changed how information can be quickly extracted, but also highlight their challenges.

Learning Objectives

  1. Identify different sources of biomedical text for use in text mining
  2. Implement methods for extracting mentions of biomedical entities (e.g. drugs and genes) from biomedical text
  3. Demonstrate approaches to extracting relationships between entities in biomedical text
  4. Apply large language models to various information extraction tasks
  5. Appraise the abilities of large language models for different natural language processing tasks
  6. Discuss approaches for evaluating the performance and quality of extraction methods

Intended Audience and Level
Bioinformatics researchers with an interest in extracting knowledge from text, particularly for knowledge base construction. They should be comfortable with coding in Python. No prior knowledge about machine learning or deep learning is expected.

Schedule

09:00-09:15 Introduction to Biomedical Natural Language Processing
This introduction will cover the applications of biomedical text mining and outlining the plan for the day.
09:15-09:30 Lecture: Getting started in NLP
This talk will introduce the core concepts of getting computers to work with text, getting access to appropriate biomedical text and annotating it for building and evaluating text mining systems.
09:30-10:00 Hands-on: Getting started in NLP
This session will have attendees try out some introductory NLP tasks in Python and get access to biomedical text from PubMed through its API
10:00-10:15 Lecture: Identifying mentions of biomedical concepts using named entity recognition (NER)
This talk will go over the NER task and some of the different approaches. It will focus on transformer-based methods.
10:15-10:45 Hands-on: NER with Spacy and HuggingFace
This session will have attendees work through a Jupyter notebook to train an NER system given a provided set of annotations. They will get to work with transformer-based models using the HuggingFace library.
10:45-11:00 Coffee Break
11:00-11:15 Lecture: Extracting relations from biomedical text
This talk will outline the importance of extracting meaningful relations between entities and going beyond co-occurrences.
11:15-12:00 Hands-On: Relation Extraction with Co-occurrences and HuggingFace
This session will have attendees work through a notebook using data that has entities already extracted to extract associations.
12:00-12:15 Lecture: Using Large Language Models for Biomedical Text Mining
This talk will focus on the strengths and weaknesses of large language models for the tasks discussed in this tutorial. It will give a brief background on how they work and the common pitfalls when used for information extraction
12:15-12:45 Hands-On: LLMs for Entity Extraction and Relation Extraction
This session will have attendees work with a small LLM and apply it to pre-prepared data for several extraction tasks.
12:45-13:00 Lecture: LLMs and the future of information extraction
The final talk will lead a discussion on the benefits of LLMs for information extraction, but also what challenges remain.

- top -

Links within this page: Abstracts – Talks and Posters: AboutAbstracts Talks and Posters: Deadlines | Tracks: COSIs & Open Sessions | Abstracts Talks and Posters: Submission Guidelines | Abstracts Talks and Posters: Review Criteria | Poster details for ISMB | Late Breaking Posters | FAQ | Our Partnership with Showcare


Click Here to Submit

Abstracts – Talks and Posters

We invite abstracts for research that is topical to bioinformatics and computational biology, which is in progress (unpublished, formerly Late-Breaking Research) or previously published within the last 18 months (previously Highlights Track) for consideration for oral and/or poster presentation.

Your abstract should be submitted to the Communities of Special Interest (COSI) or Open Session Track that most accurately reflects the area of the work. (You will have the option to identify two other tracks that may also be suited to your research area during your submission). You can learn more about COSIs here.

Talks and posters are organized according to scientific topics which are covered by the COSIs and open sessions listed and described below. Please note, abstracts are not limited to COSI and open session topics and if the topic of your submission falls outside the listed COSIs you can choose the "General Computational Biology" area. Your abstract should convey a scientific result and should not be an advertisement for any commercial software package.

Multiple presentations: The same talk is not permitted to be given more than once as an oral presentation at ISMB. Research may be presented as both a talk and a poster if and only if it is submitted as two individual submissions with TWO different presenters. Having the same person deliver two (or more) different talks is NOT PERMITTED on the same topic. This allows for many people from one lab to present the work of the lab. A presenting author may present NO MORE THAN ONE talk or poster on the same topic.

For a variety of reasons, ISCB strongly prefers that scientific research accepted for oral presentation be presented in-person at the conference venue.  We understand that some presenters will have valid reasons to avoid in-person attendance. ISCB will grant remote presentation options for reasons associated with maternity/paternity leave, care for a family member, personal/medical disability, sickness, financial hardship, or potential visa problems.  If your research is accepted for oral presentation and you are unable to present in person, ISCB requires notification at the time of acceptance and no later than May 15, 2024. If unable to participate in person you will need to request a waiver by writing This email address is being protected from spambots. You need JavaScript enabled to view it.

Any pre-approved virtual presenters will be required to provide a pre-recorded talk for the virtual platform library in advance of the conference as a precaution due to possible technical issues.

ISMB/ECCB 2025 follows the ISCB policy for acceptable use of large language models (https://www.iscb.org/iscb-policy-statements/iscb-policy-for-acceptable-use-of-large-language-models).

- top -


Abstracts: Deadlines

Any recent research, published or unpublished is eligible for submission.  Different submission types have different deadlines and notification dates.  Be sure to check the below table or key dates page to find the dates relevant to your submission.

Poster only submissions DO NOT require any materials to be uploaded during submission. Submissions are evaluated on the 250 word text abstract provided during submission.

- top -


Tracks: Communities of Special Interest (COSIs) and Open Sessions


3DSIGBio-Ontologies BioInfo-CoreBioinformatics in UKBioVisBOSC: Bioinformatics Open Source ConferenceCAMDACompMSComputational and Systems ImmunologyDigital AgricultureEducationEquity and Diversity in Computational Biology ResearchEvolCompGen: Evolution and Comparative GenomicsFunctionGeneral Computational BiologyHitSeqiRNAMLCSBMICROBIOMENetBioRegSys SysModText MiningTransMedVarI • 

3DSIG: Structural Bioinformatics and Computational Biophysics

Abstract Area Chairs:
Douglas Pires, University fo Melbourne, Australia
R. Gonzalo Parra, Barcelona Supercomputing Center
It is impossible to fully understand biological systems without understanding the 3D structure of their constituting parts and their interactions. As such the topics relevant for 3DSIG are wide and include, but are not restricted to Structure-based drug discovery including polypharmacology and network pharmacology; Structure representation, classification and prediction;
Structure-based function prediction; Docking, analysis, prediction and simulation of biomolecular interactions such as protein-protein, protein-ligand and protein-nucleic-acid; Protein dynamics and disorder; Evolution studied through structures; Application of structure to systems biology; Macromolecular assemblies; Structural genomics; 3D databases and data mining; Molecular visualization; Relevant methods of structure determination particularly hybrid methods; Prediction and analysis of protein domains; Membrane protein structure analysis and prediction; The structural basis of immunology.


click to enlarge
BOKR: Bio-Ontologies and Knowledge Representation

Abstract Area Chairs:
Tiffany Callahan, IBM Research, USA
Augustin Luna, NLM, USA

Learn more about the Bio-Ontologies here
Bio-Ontologies & Knowledge Representation (BOKR) covers research related to building and using structured knowledge in bioinformatics, including ontologies, knowledge graphs, knowledge-enhanced learning, knowledge extraction, and, in general, any aspect related to the organisation, presentation, dissemination, and use of knowledge in biology, biomedicine, and the life sciences.



click to enlarge
BioInfo-Core

Abstract Area Chairs:
Lorena Pantano Rubino, Harvard School of Public Health, United States
Madelaine Gogol, Stowers Institute, United States
Alberto Riva, Human Technopole, Italy
Yuvanesh Vedaraju, Houston Methodist, United States

Learn more about Bioinfo-core here
Bioinfo-core is a worldwide body of people that manage or staff bioinformatics cores within organizations of all types including academia, academic medical centers, medical schools, biotechs and pharmas.
The major goal of this COSI is to offer a community to people who manage and staff bioinformatics cores around the world. Topics of interest to this group for our ISMB workshop include topics that core members would find interesting or useful. This includes but is not limited to: management topics relevant to cores, core formation and setup, workflow development, reproducibility, best practices, training, or exploring analysis of new types of data.


click to enlarge
Bioinformatics in the UK

Abstract Area Chairs:
Carole Goble, The University of Manchester, United Kingdom
Janet Thornton
, EBI/EMBL, United Kingdom

Learn more about Bioinformatic in the UK here
Recognising that the UK hosts the 2025 meeting, a one-day UK track will be dedicated to showcasing the UK’s research, innovation and talent, and includes a dedicated poster reception and networking event.

The track will focus on Biodiversity and Agritechnology and Health, and encouraging contributions on methods (including AI), standards and applications.
Abstracts for talks and posters in these areas are invited from Academic or Industry researchers in either Research career paths (faculty, fellows, postdocs, postgraduates), or Technical Professional paths (including but not limited to data stewards, data scientists and Research Software Engineers). Submissions are also encouraged from researchers and RTPs involved with data management and the data infrastructure, and bioinformaticians involved in interpreting the data and making an impact on health and biodiversity. The track will also feature invited keynotes and an industry panel.

An evening poster reception will provide an opportunity for socialising and networking.

The track is open to all conference delegates. All speakers, poster presenters and panel members will be drawn from UK institutions. The sessions will be coordinated by a Programme Committee, who will consider scientific and technical excellence as the primary consideration but will also seek to select papers that represent the breadth and diversity of the bioinformatics community in the UK, considering geography, scientific domain, career stage, and EDI principles. Only those based in the UK can apply for a poster and/or talk.
BioVis: Biological Data Visualization

Abstract Area Chairs:
Zeynep Gümüş, Icahn School of Medicine at Mount Sinai, United States
Robert Krueger, NYU Tandon, United States
Jen Rogers, Glasgow School of Art, United Kingdom
Qianwen Wang, University of Minnesota, United States

Learn more about the BioVis COSI here
The BioVis track aims to educate, inspire, and engage bioinformatics and biology researchers in state-of-the-art visualization research and visualization researchers in problems in biological data visualization.
The rapid adoption of data-intensive biology approaches creates enormous challenges for computational visualization techniques, which are needed to enable researchers to gain insight from their large and highly complex data sets. The goal of this session is to bring together researchers from the visualization, bioinformatics, and biology communities with the purpose of educating, inspiring, and engaging bioinformatics and biology researchers in state-of-the-art visualization research, as well as visualization researchers in problems in biological data visualization.

Keywords: Visualisation, Visual Analytics, Education


click to enlarge
BOSC: Bioinformatics Open Source Conference

Abstract Area Chairs:
Nomi L. Harris, Lawrence Berkeley National Laboratory, United States
Karsten Hokamp, Trinity College Dublin, Ireland
Deepak Unni, Swiss Institute of Bioinformatics, Switzerland
Hervé Ménager, Institut Pasteur, France
Jason Williams, Cold Spring Harbor Laboratory, United States
Monica Munoz-Torres, University of Colorado Anschutz Medical Campus, United States
Tazro Ohta, Chiba University
Jessica Maia, BD

Learn more about BOSC here

BOSC covers all aspects of open science / open source bioinformatics, including standards and ontologies; approaches that promote open science and sharing of data, results and software; bioinformatics tools and libraries; and ways to grow open source communities while promoting diversity within them.
Launched in 2000 and held yearly since then, BOSC is organized by the Open Bioinformatics Foundation (OBF), a non-profit group dedicated to promoting the practice and philosophy of open source software development and open science within the biological research community.
Keywords: Open source; Open science; Open data; Reproducible research; Interoperability; Data science; Workflows; Translational bioinformatics; Diversifying bioinformatics communities
You can learn more about the BOSC COSI here.


click to enlarge
CAMDA: Critical Assessment of Massive Data Analysis

Abstract Area Chairs:
Paweł P Łabaj, Małopolska Centre of Biotechnology of Jagiellonian University

(PLEASE note: CAMDA submitters should include a 3-5 page long abstract PDF)

Learn more about the CAMDA COSI here.
The large, complex data sets for the Critical Assessment of Massive Data Analysis (CAMDA) contest include built-in truths for calibration. In an open-ended competition, however, both seasoned researchers and cunning students push the boundaries of our field, with unexpected questions or angles of approach often bringing the most impressive advances.
The CAMDA track highlights and compares the latest methods and results in an international data analysis contest, with this year's topics including: (1) The Synthetic Clinical Health Records Challenge provides a rich set of highly realistic Electronic Health Records (EHR) tracing the diagnosis trajectories of diabetic patients, created with dual-adversarial auto-encoders trained on data from 1.2 million real patients in the Population Health Database of the Andalusian Ministry of Health. Predict relevant diabetes endpoints like blindness or cardiopathy from past diagnosis trajectories! (2) The Anti-Microbial Resistance Prediction Challenge features clinical isolates sequences. Predict resistance genes/markers and identify resistant bacteria! (3) The Gut Microbiome based Health Index Challenge features hundreds of WMS based taxonomic and functional profiles of healthy and unhealthy individuals. Take advantage of the Theater of Activity concept and compete already existing taxonomy based metrics!


click to enlarge
CompMS: Computational Mass Spectrometry

Abstract Area Chairs:
Wout Bittremieux, University of Antwerp
Timo Sachsenberg, University of Tübingen
Isabell Bludau, Heidelberg University Hospital
Lindsay Pino, Talus Bio
Marie Brunet, Sherbrooke University

Learn more about the CompMS COSI here.
The CompMS group promotes the efficient, high quality analysis of mass spectrometry data. The CompMS initiative covers various computational mass spectrometry application domains, including proteomics, metabolomics, and lipidomics.

click to enlarge
Computational and Systems Immunology

Abstract Area Chair:
Ferhat Ay,
La Jolla Institute for Immunology, United States
Jishnu Das,
University of Pittsburgh School of Medicine, United States


"The Special Session on Computational Immunology invites abstract submissions for short talks and poster presentations encompassing various aspects of computational immunology. This includes the development of machine learning, network systems and immunogenomic approaches for analyzing multi-dimensional/multi-omic data.
Data-driven computational studies addressing fundamental or translational questions in immunology are also of interest. Potential topics may cover, but are not limited to:
  • ingle-cell multi-omics for characterizing immune cell heterogeneity and regulatory mechanisms of immune cell function
  • Spatial genomics for studying immune microenvironments and interactions among immune cells.
  • Trajectory inference and lineage tracing in immune cells.
  • Analysis of receptor-ligand interactions.
  • Profiling, reconstruction, and analysis of T cell and B cell receptor repertoires.
We encourage researchers to submit abstracts showcasing innovative approaches and insights in these areas for consideration in our Special Session on Computational Immunology.
Education: Computational Biology Education

Abstract Area Chair:
Russell Schwartz,
Carnegie Mellon University
Patricia Palagi, SIB Swiss Institute of Bioinformatics
Wai Keat Yam, International Medical University

Learn more about the Education COSI here.

Education-COSI focuses on bioinformatics and computational biology education and training across the life sciences.
A major goal of this COSI is to foster a mutually supportive, collaborative community in which bioscientists can share bioinformatics education and training resources and experiences, and facilitate the development of education programs, courses, curricula, etc., and teaching tools and methods.
Keywords: education, training, training resources, curricula, competencies


click to enlarge
Equity and Diversity in Computational Biology Research

Abstract Area Chairs:

Casey Greene, University of Colorado’s Anschutz Medical Campus, United States
Alejandra Medina Rivera, Universidad Nacional Autonoma de Mexico, Mexico
Farzana Rahman, Kingston University London, United Kingdom


This area is for research that examines issues of equity, representation, diversity, or other elements related to datasets, methods, or the field of computational biology at large.
EvolCompGen: Evolution and Comparative Genomics

Abstract Area Chairs:
Nadia El-Mabrouk, University of Montreal, Canada
Katharina Jahn, Freie Universitat, Germany
Edward Braun, University of Florida, United States

Learn more about the EvolCompGen COSI here
Evolution and comparative genomics are deeply intertwined with computational biology. Computational evolutionary methods, such as phylogenetic inference methods or multiple sequence alignment are widely used, yet remain far from “solved” and are indeed intense areas of research.
At the same time, evolutionary and comparative genomics are inherently “transversal” disciplines in that work in many other biological areas of research have some evolutionary component (e.g. cancer genomics, epidemiology, toxicology, population genetics, functional genomics, structural biology just to name a few). The scope of this COSI is intentionally kept broad. The track will feature a mix of proceedings, highlight, and invited talks. Priority will be given to contributions which are relevant to more than a single area of application, or to contributions which are not covered by more specialised COSIs.

Keywords: Evolution, phylogeny, phylogenomics, comparative genomics, genome rearrangements, orthology, paralogy, recombination, lateral gene transfer, genome variation, population genomics, comparative epigenomics, epidemiology, cancer evolution genomics, genome annotation


click to enlarge
Function: Gene and Protein Function Annotation

Abstract Area Chairs:
Iddo Friedberg, Iowa State University, United States
Mark Wass, University of Kent, United Kingdom
Ana M Rojas Mendoza, Centro Andaluz de Biologia del Desarrollo, Spain
Dukka KC, RIT, United States
Jason McDermott, PNNL, United States

Learn more about the Function COSI here
The mission of the Function Community of Special Interest (Function-COSI) is to bring together computational biologists, experimental biologists, biocurators, and others who are dealing with the important problem of gene and gene product function prediction, to share ideas and create collaborations.
The Function COSI features the Critical Assessment of Function Annotation, an ongoing community challenge aimed at improving methods for protein function prediction. The Function-COSI holds annual meetings alongside ISMB. Also, we are conducting the multi-year Critical Assessment of protein Function Annotation, or CAFA, experiment.

Keywords: Protein function prediction, machine learning, performance assessment


click to enlarge
General Computational Biology
Abstract Area Chairs:
Ben Raphael, Princeton, United States
Simone Zaccaria, UCL Cancer Institute, United Kingdom

(If all other areas are insufficiently descriptive of your work, please submit to the area below. Please note that the submissions in this area can be moved, subject to chairs’ considerations.)
Novel techniques in emerging areas of computational biology, including intersections with other fields.
HitSeq: High-throughput Sequencing

Abstract Area Chairs:
Can Alkan, Bilkent University
Ana Conesa, Institute for Integrative Systems Biology
Francisco M. De La Vega, Stanford University
Dirk Evers, Dr. Dirk Evers Consulting
Kjong Lehmann, RWTH Aachen
Christina Boucher, University of Florida

Learn more about the HiTSeq COSI here
HiTSeq is a community of special interest devoted to the latest advances in computational techniques for the analysis of high-throughput sequencing (HTS) data. Sessions will be devoted to discussing the latest advances in computational techniques for the analysis of high-throughput sequencing (HTS) datasets and will provide a forum for in-depth presentations of the methods and discussions among the academic and industry scientists working in this field.
We seek contributions on any topic involving HTS data analysis including: genome assembly and haplotype phasing; transcriptome analysis; genetics and epigenetics variation; metagenomics and microbiome analysis; and new HTS platform data analysis (e.g. synthetic reads, long reads, nanopore). In addition to general sessions, we propose to have two specialized sessions to focus on current hot topics: a) long sequencing and mapping techniques, b) single cell sequencing applications, c) non-linear genome representations. Both of these topics have generated an enormous amount of interest recently.

Keywords: sequencing, next generation, high throughput, algorithms, assembly, phasing, variation discovery, genotyping, metagenomics, microbiome, long reads, short reads, nanopre, pan-genomes, graph genomes


click to enlarge
iRNA: Integrative RNA Biology

Abstract Area Chairs:
Michelle Scott, University of Sherbrooke, Canada
Athma Pai,
University of Massachusetts Medical School, United States
Hagen Tilgner, Weill Cornell Medicine, United States
Maayan Salton, Hebrew University of Jerusalem in Israel, Israel
Blake Sweeny, European Bioinformatics Institute, United Kingdom

Learn more about the iRNA COSI here
iRNA track covers the full range of research topics in the field of RNA Biology, from computational and high-throughput experimental methods development to their application in different aspects of RNA processing, structure, and function.
The goal of the Integrative RNA Biology session is to bring together experts in computational and experimental aspects of research in RNA Biology to cover new developments across this broad field of research. The meeting focuses on two major areas: (1) the development of computational and high-throughput experimental methods, and (2) the application of such methods to break new grounds in the study of RNA biology and disease. We aim to educate and inspire researchers in the field, novice and seasoned alike, by meshing together different aspects of Computational RNA Biology, and promoting cross-disciplinary collaborative research.

Keywords: Transcriptomics, RNA processing, post-transcriptional regulation, non-coding RNA, RNA 2D/3D structure, alternative-splicing, alternative polyadenylation, RNA translation degragation and localization, genetic variants effect on RNA processing, RNA and disease.


click to enlarge
MICROBIOME

Abstract Area Chair:
Zhogn Wang, DOE Joint Genome Institute, United States

Learn more about the MICROBIOME COSI here
The MICROBIOME Community of Special Interest aims at the advancement and evaluation of computational methods in microbiome research, especially metaomic approaches. Based on the Critical Assessment of Metagenome Interpretation (CAMI), the COSI supplies users and developers with exhaustive quantitative data about the performance of methods in relevant scenarios.
It therefore guides users in the selection and application of methods and in their proper interpretation. Furthermore, the COSI provides a platform for exchange and networking between method developers, and provides valuable information allowing them to identify promising directions for their future work.
Keywords: microbiome, computational metaomics, microbial networks, pathogens, symbionts, (meta)omics-based diagnostics, microbial ecology and evolution

The MICROBIOME Community of Special Interest aims at the advancement and evaluation of computational methods in microbiome research, especially metaomic approaches.

Keywords: microbiome, genome reconstruction, metagenomics, metatranscriptomics, metaproteomics, microbial networks, microbial traits, symbionts, pathogens, genome-based diagnostics


click to enlarge
MLCSB: Machine Learning in Computational and Systems Biology

Abstract Area Chairs:
Barbara Engelhardt, Stanford University
Sara Mostafavi, University of Washington

Learn more about the MLCSB COSI here

Systems Biology and Machine Learning meet in the MLCSB COSI. The community is the place for researchers of these areas to exchange ideas, interact and collaborate.
Molecular biology and all the biomedical sciences are undergoing a true revolution as a result of the emergence and growing impact of a series of new disciplines and tools sharing the -omics suffix in their name. These include in particular genomics, transcriptomics, proteomics and metabolomics, devoted respectively to the examination of the entire systems of genes, transcripts, proteins and metabolites present in a given cell or tissue type. Taking advantage of this wealth of genomic information has become a conditio sine qua non for whoever has the ambition to remain competitive in molecular biology and in the biomedical sciences in general. Machine learning naturally appears as one of the main drivers of progress in this context, where most of the targets of interest deal with complex structured objects: sequences, 2D and 3D structures or interaction networks. At the same time, bioinformatics and systems biology have already induced significant new developments of general interest in machine learning, for example in the context of learning with structured data, graph inference, semi-supervised learning, system identification, and novel combinations of optimization and learning algorithms.

Keywords: Machine Learning, Data Mining, Deep Learning, Artificial Intelligence in Bioinformatics and Computational Biology


click to enlarge
NetBio: Network Biology

Abstract Area Chair:
Deisy Morselli Gysi, Federal University of Parana, Brazil
Scooter Morris, University of California, San Francisco, United States
Martina Summer-Kutmon, Maastricht University, Netherlands

Learn more about the NetBio COSI here
As large scale, systems-level data are becoming increasingly available, modeling and analyzing them as networks is widespread. Network Biology Community serves to introduce novel methods and tools, identify best practices and highlight the latest research in the growing and interdisciplinary field of network biology.

Keywords: networks, pathways, systems, network analysis, disease networks, disease pathways


click to enlarge
RegSys: Regulatory and Systems Genomics

Abstract Area Chairs:
Marcel Schulz, Uniklinikum and Goethe University Frankfurt
Anthony Mathelier, University of Oslo

Learn more about the RegSys COSI here
Regulatory genomics involves the study of the genomic control system, which determines how, when and where to activate the blueprint encoded in the genome. Regulatory genomics is the topic of much research activity worldwide. Since computational methods are important in the study of gene regulation, the RegSys COSI meeting focuses on bioinformatics for regulatory genomics.
An important goal of the meeting is to foster a collaborative community wherein scientists convene to solve difficult research problems in all areas of computational regulatory genomics.

Keywords:
1. Genomic and epigenomic regulatory elements (transcription factors, enhancers, histones, gene promoters, regulatory motifs, non-coding RNAs, etc.)
2. NGS methods in regulatory genomics (e.g., ChIP-seq, RNA-seq, ATAC-seq)
3. Epigenetics and epitranscriptomics
4. Alternative splicing
5. Regulatory networks and pathway analysis
6. Genetic, molecular, and phenotypic variation in human disease
7. DNA shape
8. Single-cell transcriptomics (and other single cell assays)
9. 3D genomics (e.g., Hi-C and ChIA-PET)
10. Regulatory evolution or comparative regulatory genomics


click to enlarge
SysMod: Computational Modeling of Biological Systems

Abstract Area Chairs:
Shaimaa Bakr, Stanford University
Matteo Barberis, University of Surrey
Chiara Damiani, Università degli Studi di Milano-Bicocca
Reihaneh Mostolizadeh, Justus Liebig University Giessen
Bhanwar Lal Puniya, University of Nebraska–Lincoln
Meghna Verma, AstraZeneca

Learn more about the SysMod COSI here
The Computational Modeling of Biological Systems (SysMod) aims to create a forum for systems modelers and bioinformaticians to discuss common research questions and methods. The session will focus on the conjoint use of mathematical modeling and bioinformatics to understand biological systems functions and dysfunctions.
The meeting is open to the full range of methods used in systems modeling, including qualitative and quantitative modeling, dynamical and steady-state modeling, as well all applications of systems modeling including basic science, bioengineering, and medicine.

Keywords: computational modeling, constraint-based models, logical models, kinetic models, multi-scale models


click to enlarge
Text Mining: Text Mining for Healthcare and Biology

Abstract Area Chairs:
Robert Leaman, NCBI/NLM/NIH
Lars Juhl Jensen, University of Copenhagen
Zhiyong Lu, NCBI/NLM/NIH

Learn more about the Text Mining COSI here
The Text Mining COSI brings together researchers that create text mining tools with researchers who use text mining tools to make new discoveries. Our community discussions include both text mining use cases - concrete problems and applications in the biological/biomedical domain - and methodology development.

click to enlarge
TransMed: Translational Medical Informatics

Abstract Area Chairs:
Irina Balaur, University of Luxembourg

Learn more about the TransMed COSI here.
TransMed covers the current developments in the field of clinical and translational medicine informatics. Analysis of large amounts of multi-omics, imaging (medical and molecular), mobile sensor, clinical and health records data is paving the way for precision medicine. In the TransMed track, we will explore the current status of computational biology and advance machine learning approaches within the field of clinical and translational medicine.
This opens the debate on current state-of-the-art data infrastructures for translational medicine data integration and analysis. A variety of computational approaches are currently being used to harmonize and relate molecular data to clinical outcomes in order to better understand disease conditions. These methods also have the potential to discover biomarkers for early detection of disease, and targets for drug discovery, and to be used predictively to help to suggest personalised therapeutic strategies for patients. In this session we will bring scientists from both academia and industry to exchange knowledge and foster networking, to help in building up of the translational medicine community. .

Keywords: Translational medicine; clinical and ‘omics data integration; curation and harmonization; stratification of patients; informatics for integrating biology and the bedside; medical informatics; ontology-driven data representation; patient centered outcomes research; cohort data; deep learning


click to enlarge
VarI: Variant Interpretation

Abstract Area Chairs:
Emidio Capriotti, University of Bologna
Hannah Carter, University of California San Diego
Antonio Rausell, Imagine Institute

Learn more about the VarI COSI here
The VarI COSI meeting is dedicated to the recent advances in the analysis and interpretation of the genetic variants.
The meeting aims to organize a research network (VarI-COSI) facilitating the exchange of ideas and the establishment of new collaborations between researchers with different expertise. The VarI-COSI meeting is broadly divided in two sessions (“Genetic variants as markers: evolution, populations, GWAS” and “Genetic variants as effectors: function, structure, and regulation”) that encompass the four major research topics of the field: 1) Databases, data mining algorithms and visualization tools for variants analysis. 2) Methods for predicting regulatory/structural/functional impacts of SNVs. 3) Personal Genomics, GWAS studies and SNV prioritization. 4) Population genomics and phylogenetic analysis.


click to enlarge

- top -


Abstracts: Submission Guidelines

All submission MUST select a presenting author. The presenting author MUST be marked as a corresponding author. Presenters of accepted abstracts are required to make the presentation themselves, and must register and pay to attend the conference. Submitters may choose to have their submission reviewed for: talk or poster or poster only. If you select talk or poster and are unsuccessful for a talk your abstract will automatically be considered for a poster presentation. Submission deadlines are posted in the deadlines section or key dates page.

During the submission process you will submit to the COSI or open session track that most reflects your research area. You will have the option to identify two other tracks that may also be suited to your research area during your submission.

  • All abstracts must be submitted using the conference submission site by the abstract deadlines, posted in the deadlines section or key dates page. (You have until 11:59 p.m. Any Time Zone).

  • Accepted abstracts (maximum 250 words) will be posted on the conference website. For authors requesting a talk you should upload a long abstract of up to two (2) pages in PDF format during the submission process. *Do not submit more than two pages* It is best to check individual COSI websites for specific information regarding program goals and guidelines.

    PLEASE note: CAMDA submitters should include a 3-5 page long abstract PDF

  • The 250 word abstract should be a brief and concise summary of the background/motivation for the study, method and result(s), as well as conclusion/statement of significance. The abstract is submitted through the EasyChair submission system. If you are submitting for a talk a PDF (2 pages maximum) can be uploaded to provide additional details including figures and graphics that support the main points of the abstract.

    Your abstract must not contain your title/position or any personal information (e.g. affiliation). These details will be collected through the EasyChair submission system.

    Selected Abstract talks will be presented in tracks that run in parallel at the conference. The length of a talk is determined by individual COSI tracks. Oral presentation schedules for abstracts will be announced after acceptance notifications directly by the COSI track organizers.

- top -


 

Abstracts: Review Criteria

All submissions will be evaluated by a program committee. Submitters may choose to have their submission reviewed for talk or poster or poster only. If you select talk or poster and are unsuccessful for a talk your abstract will automatically be considered for a poster.

Posters advertising commercial software will be rejected from this poster session and/or removed from the conference without notice. There is a separate space for such posters in the exhibition area. If you have an industry poster, contact This email address is being protected from spambots. You need JavaScript enabled to view it..

- top -


Poster Details for ISMB/ECCB 2025

Scientific Research Exchange - Posters Presentation Hours
10:00 AM - 11:00 AM & 4:00 PM - 5:00 PM July 21-24, 2025

Poster schedule can be found on the poster page once available.

Poster Display Size

Recommended poster size is A0 size (84.1cm wide and 118.9cm tall).

Click to enlarge
*No onsite poster printing is available*

- top -


Late-breaking Posters

After the initial submission deadline, late-breaking posters can be submitted.  The submission window is notably smaller, be sure not to miss deadlines posted in the deadlines section or key dates page, as extensions cannot be granted.

Institutional Research

Not-for-profit research institutes and universities are able to submit their research findings as part of the call for posters. This will be displayed within the topic area of Institutional Research.

Reviews of fields or techniques

Individuals may choose to submit a poster presenting a review of a specific field or technique. These submissions do not need to contain original work from the author and are intended to be an introduction for individuals not familiar with the field or technique. These review are not meant to be advertisements (e.g. it is not acceptable that the poster is an advertisement for a book or other type of commercial publication).

- top -


FAQ

Q: Can I submit more than one abstract?
A: Yes, but the same person may not deliver more than one talk on the same topic. We encourage labs to involve multiple presenters. The same talk is not permitted to be given more than once as an oral presentation at ISMB. Research may be presented as both a talk and a poster if and only if it is submitted as two individual submissions with TWO different presenters. Having the same person deliver two (or more) different talks is NOT PERMITTED on the same topic. This allows for many people from one lab to present the work of the lab. A presenting author may present NO MORE THAN ONE talk or poster on the same topic.

Q: Should I submit a PDF of my poster when completing my submission?
A: No, please only submit the poster abstract. Closer to the conference all accepted poster presenters will upload a Poster PDF and short 5-7 minute MP4 talk to the virtual conference platform.

Q: Can previously accepted work be considered for a poster?
A: Yes, we do ask that you note the year the work was published and provide the Digital Object Identifier (DOI) during the submission process.

- top -


Our Partnership with Showcare

Showcare is a trusted partner of ISCB. ISCB has authorized Showcare to build external pages like this one in hopes to streamline your experience and enhance every step of your journey with ISCB.

- top -

Links within this page: Key Dates | Details | Format | Acceptance and Confirmation



Click Here to Submit

Prior to submitting please review the ISCB Exhibitor and Technology Track guidelines for presentations available here.

The Technology Track allows organizations to showcase their software and/or hardware relevant to the bioinformatics/molecular biology community.

For a variety of reasons, ISCB strongly prefers that talks accepted for oral presentation be presented in-person at the conference venue. Of course, we understand that some presenters will have good reasons to avoid in-person attendance. ISCB will grant remote presentation options for reasons associated to maternity/paternity leave, care for a family member, personal/medical disability, sickness, financial hardship, or potential visa problems. If your research is accepted for oral presentation and you are unable to present in person, ISCB requires notification at the time of acceptance.You will be asked during your confirmation of participation to confirm your in-person participation. If unable to participate you will need to request a waiver by writing This email address is being protected from spambots. You need JavaScript enabled to view it.

- top -


Key Dates

- top -


Details

To conduct a Technology Track presentation, organizations must complete the on-line sign-up form including a brief description (50 words maximum) of the talk and a one (1) page Technology Track presentation overview (PDF) which will provide the delegate with additional details about the technology being presented and include a one page CV of the presenter as related to the topic. The presenter of the Technology Track must be a registered and paid delegate of the conference.

The cost to present a Technology Track is:

For-profit organization 20 minutes: $2000 USD
For-profit organization 40 minutes: $3000 USD
Not-for-profit organization 20 minutes: $620 USD
Not-for-profit organization 40 minutes: $840 USD

  • Please note payment is due no later than June 20, 2025, to ensure the talk is included in the conference program schedule
  • Presentation fees do not include conference registration
  • The Technology Track schedule is set by the organizers with sponsors given scheduling preference for presentation time slots
  • Each presentation is listed on the conference website and includes an abstract and a full page PDF overview

- top -


Format

Presentations occur as part of the parallel track system. Individual time slots are on a 20 minutes schedule (3 talks per hour).

Within your time slot, you are at liberty to demonstrate your software in any way that you see fit: for example, you can combine one or more structured talks with live demos and question-and-answer sessions.

- top -


Acceptance and Confirmation

All submissions will be reviewed by the Technology Track selection committee and notification of acceptance will be forwarded no later than May 20, 2024. Technology Track presenters will be contacted by the conference organizers regarding presentation acceptance, Technology Track schedule, logistics, and payment. Payment should not be made until organizations receive notification of acceptance.

For information on Technology Tracks contact This email address is being protected from spambots. You need JavaScript enabled to view it.

- top -

Links within this page: John Jumper | Amos BairochJames Zou | Charlotte DeaneDavid Baker | Fabian Theis



John Jumper

Google DeepMind
United Kingdom

Introduced by: TBD
Time: Sunday, July 20, 2025 at 18:30-19:30
Room: 1A

Title TBD

Abstract coming soon.

Biography

John Jumper received his PhD in Chemistry from the University of Chicago, where he developed machine learning methods to simulate protein dynamics. Prior to that, he worked at D.E. Shaw Research on molecular dynamics simulations of protein dynamics and supercooled liquids. He also holds an MPhil in Physics from the University of Cambridge and a B.S. in Physics and Mathematics from Vanderbilt University. At Google DeepMind, John is leading the development of new methods to apply machine learning to protein biology. John has won numerous awards for his work, including the Lasker Award, Breakthrough Prize in Life Sciences, the Canada Gairdner International Award, and the 2024 Nobel Prize in chemistry.

- top -

ISCB 2025 Accomplishments by a Senior Scientist winner:

Amos Bairoch

SIB Swiss Institute of Bioinformatics
Switzerland


Introduced by: TBD
Time: Monday, July 21, 2025 at 09:00-10:00
Room: 1A

Plus ça change, plus c'est la même chose": from Swiss-Prot to Cellosaurus, 40 years of biocuration

Abstract coming soon.

Biography

Amos Bairoch is an Emeritus Professor at the University of Geneva and a group leader at the SIB Swiss Institute of Bioinformatics. A pioneer in bioinformatics, he is best known for developing foundational protein and genome databases, including UniProtKB/Swiss-Prot, which have transformed how biological data is curated and used worldwide.

Throughout his career, Amos has led efforts to enhance protein sequence annotation and develop widely used computational tools, shaping the field of molecular biology. His contributions to knowledge sharing in bioinformatics continue to influence research and innovation globally.

- top -


ISCB 2025 Overton Prize winner:

James Zou

Stanford University
United States


Introduced by: TBD
Time: Tuesday, July 22, 2025 at 09:00-10:00
Room: 1A

Title TBD

Abstract coming soon.

Biography

James Zou is an associate professor of biomedical data science at Stanford University, where he develops cutting-edge machine learning and AI techniques for applications in genomics and biomedical research. His work spans algorithmic advancements, ethical AI in healthcare, and precision medicine.

James has made key contributions to deep learning for biological data, interpretable AI, and fair and robust machine learning models for clinical applications. His research helps bridge computational methods with real-world medical impact.

- top -


Charlotte Deane

University of Oxford
United Kingdom
https://www.stats.ox.ac.uk/~deane/

Introduced by: TBD
Time: Wednesday, July 23, 2025 at 09:00-10:00
Room: 1A

Title TBD

Abstract coming soon.

Biography

Charlotte Deane MBE is a Professor in the Department of Statistics at the University of Oxford and the Executive Chair of the Engineering and Physical Sciences Research Council (EPSRC).

From 2022 to 2023, Charlotte was Chief AI Officer at Exscientia, a biotech with ~450 employees, where she led its computational scientific development.

She served on SAGE, the UK Government’s Scientific Advisory Group for Emergencies, during the COVID-19 pandemic, and acted as UK Research and Innovation’s COVID-19 Response Director.

At Oxford, Charlotte leads the Oxford Protein Informatics Group (OPIG), who work on diverse problems across immunoinformatics, protein structure and small molecule drug discovery; using statistics, AI and computation to generate biological and medical insight.

Her work focuses on the development of novel algorithms, tools and databases that are openly available to the community. These tools are widely used web resources and are also part of several Pharma drug discovery pipelines. Charlotte is a member of several advisory boards and has consulted extensively with industry. Additionally, she has established a consulting arm within her research group as a way of promoting industrial interaction and use of the group’s software tools.

- top -


David Baker

University of Washington
United States
https://www.bakerlab.org/

Introduced by: TBD
Time: Thursday, July 24, 2025 at 16:20-18:00
Room: 1A

Title TBD

Abstract coming soon.

Biography

Nobel Laureate David Baker is a professor of biochemistry, HHMI investigator, and the director of the Institute for Protein Design at the University of Washington School of Medicine. The Baker Lab develops protein design software and uses it to create molecules that solve challenges in medicine, technology, and sustainability. Among his recent work is the development of powerful machine-learning methods for generating functional proteins.

David is also an adjunct professor of genome sciences, bioengineering, chemical engineering, computer science, and physics at the University of Washington. He has published over 640 research papers, co-founded 21 companies, and been awarded more than 100 patents. Ninety of his mentees have gone on to independent faculty positions.

David is an elected member of the National Academy of Sciences and a recipient of numerous awards, including the 2024 Nobel Prize in Chemistry. TIME named him among the world’s 100 Most Influential People in health.

He received his PhD in biochemistry with Randy Schekman at the University of California, Berkeley, and did postdoctoral work in biophysics with David Agard at UCSF.

- top -


ISCB 2025 Innovator Award winner:

Fabian Theis

Helmholtz Munich
Germany
http://comp.bio



Introduced by: TBD
Time: Thursday, July 24, 2025 at 16:20-18:00
Room: 1A

Decoding cellular systems: From observational atlases to generative interventions

Over the past decade, the field of computational cell biology has undergone a transformation — from cataloging cell types to modeling how cells behave, interact, and respond to perturbations. In this talk, I will review and explore how machine learning is enabling this shift, focusing on two converging frontiers: integrated cellular mapping and actionable generative models.

I’ll begin with a brief overview of recent advances in representation learning for atlas-scale integration, highlighting work across the Human Cell Atlas and beyond. These efforts aim to unify diverse single-cell and spatial modalities into shared manifolds of cellular identity and state. As one example, I will present our recent multimodal atlas of human brain organoids, which integrates transcriptomic variation across development and lab protocols.

From there, I’ll review the emerging landscape of foundation models in single-cell genomics, including our work on Nicheformer, a transformer trained on millions of spatial and dissociated cells. These models offer generalizable embeddings for a range of tasks—but more importantly, they set the stage for predictive modeling of biological responses.

I’ll close by introducing perturbation models leveraging generative AI to model interventions on these systems. As example I will show Cellflow, a generative framework that learns how perturbations such as drugs, cytokines or gene edits — shift cellular phenotypes. It enables virtual experimental design, including in silico protocol screening for brain organoid differentiation. This exemplifies a move toward models that not only interpret biological systems, but help shape them.

Biography

Fabian Theis is the Director of the Computational Health Center at Helmholtz Munich and a full professor at the Technical University of Munich. A leading expert in computational biology, he applies machine learning to biomedical data, with a particular focus on single-cell analysis and its implications for precision medicine.

Fabian’s work bridges AI, genomics, and healthcare, pioneering methods that drive biological discovery and advance our understanding of human health. His research has contributed to major breakthroughs in single-cell transcriptomics and the integration of deep learning into biomedical sciences.

- top -