Leading Professional Society for Computational Biology and Bioinformatics
Connecting, Training, Empowering, Worldwide

UPCOMING DEADLINES & NOTICES

  • Last day for presenting and poster authors to complete registration *no extensions*
    GLBIO 2024
    April 22, 2024
  • Late poster submissions open (posters only)
    ISMB 2024
    April 22, 2024
  • Talks and posters submissions deadline
    ECCB 2024
    April 23, 2024
  • Registration deadline for organisers and speakers
    ECCB 2024
    April 30, 2024
  • Last day to upload ANY/ALL files to the virtual Platform
    GLBIO 2024
    May 06, 2024
  • Acceptance notification for talks and posters
    ECCB 2024
    May 08, 2024
  • Tech track proposal deadline (closes earlier if capacity is reached)
    ISMB 2024
    May 10, 2024
  • Early bird registration opens
    APBJC 2024
    May 10, 2024
  • Talk and/or poster acceptance notifications
    ISMB 2024
    May 13, 2024
  • Conference fellowship invitations sent for early abstract accepted talks and posters
    ISMB 2024
    May 13, 2024
  • (Conditional) Acceptance notification for proceedings
    ECCB 2024
    May 15, 2024
  • Registration deadline for talk presenting authors
    ECCB 2024
    May 15, 2024
  • CAMDA extended abstracts deadline
    ISMB 2024
    May 20, 2024
  • Late poster submissions deadline
    ISMB 2024
    May 20, 2024
  • Conference fellowship application deadline
    ISMB 2024
    May 20, 2024
  • Revised paper deadline
    ECCB 2024
    May 25, 2024
  • Tech track acceptance notification
    ISMB 2024
    May 31, 2024
  • Last day for discounted student hotel booking
    ISMB 2024
    May 27, 2024
  • Late poster acceptance notifications
    ISMB 2024
    May 28, 2024
  • CAMDA acceptance notification
    ISMB 2024
    May 30, 2024
  • Complete workshop/tutorial programme with speakers and schedule online
    ECCB 2024
    May 30, 2024
  • Conference fellowship acceptance notification
    ISMB 2024
    May 31, 2024
  • Tech track presentation schedule posted
    ISMB 2024
    May 31, 2024
  • Final acceptance notification for proceedings
    ECCB 2024
    May 31, 2024

Upcoming Conferences

A Global Community

  • ISCB Student Council

    dedicated to facilitating development for students and young researchers

  • Affiliated Groups

    The ISCB Affiliates program is designed to forge links between ISCB and regional non-profit membership groups, centers, institutes and networks that involve researchers from various institutions and/or organizations within a defined geographic region involved in the advancement of bioinformatics. Such groups have regular meetings either in person or online, and an organizing body in the form of a board of directors or steering committee. If you are interested in affiliating your regional membership group, center, institute or network with ISCB, please review these guidelines (.pdf) and send your exploratory questions to Diane E. Kovats, ISCB Chief Executive Officer (This email address is being protected from spambots. You need JavaScript enabled to view it.).  For information about the Affilliates Committee click here.

  • Communities of Special Interest

    Topically-focused collaborative communities

  • ISCB Member Directory

    Connect with ISCB worldwide

  • Green ISCB

    Environmental Sustainability Effort

  • Equity, Diversity, and Inclusion

    ISCB is committed to creating a safe, inclusive, and equal environment for everyone

Professional Development, Training, and Education

ISCBintel and Achievements

Birds of a Feather (BoF) - ISMB 2018

On Leadership and Management: focus on mentorship

Room: TBA Saturday July 7 (12:45 pm - 1:45 pm)

Organizers:

Leader: Lucia Peixoto, Washington State University, United States

Overview

In this Career Development session we will discuss what it means to be a good mentor and how it influences the success of an independent research group.

Panel

Trey Ideker, University of California San Diego, United States
Casey Green, University of Pennsylvania, United States
Lucia Peixoto, Washington State University, United States
Terry Gaasterland, University of California San Diego, United States


Informatics for Precision Medicine

Room: TBA Saturday July 7 (12:45 pm - 1:45 pm)

Organizers:

Leader: Jake Chen, University of Alabama at Birmingham, United States

Overview

What are the various roles Informatics can play in precision medicine? Among all the roles, what are current successful areas so far? What the most critical technologies that still need to be developed? How should we prepare for the upcoming challenges?


ISCB Equity, Diversity, and Inclusion Task Force Update and Breakout Sessions

Room: TBA Saturday July 7 (12:45 pm - 1:45 pm)

Organizers:

Leader: Madelaine Gogol, Kieran O'Neill, Aurora Blucher

Overview

The ISCB Equity Diversity and Inclusion Task Force will provide a brief update on current efforts and hold breakout sessions to brainstorm solutions and solicit feedback for priority action items such as EDI workshops at ISMB and conference childcare.

Co-Leaders:
Madelaine Gogol, Stowers Institute for Medical Research
Kieran O'Neill, University of British Columbia / BC Cancer
Aurora Blucher, Oregon Health & Science University


Cytoscape Community Meeting: Latest updates and Roadmap

Room: TBA Saturday July 7 (12:45 pm - 1:45 pm)

Organizers:

Leader: Barry Demchak, University of California at San Diego

Overview

The Cytoscape Consortium will be hosting an open public meeting for the community of users, app developers and scripters to learn about the latest features and to engage with core developers and roadmap for the future. If you are new to Cytoscape or a long-time power user, you are welcome to join.


JPI Career Development: Funding opportunities for Early Career Researchers

Room: TBA Monday, July 9 (12:45 pm - 1:45 pm)

Organizers:

Leader: Lucia Peixoto, Washington State University, United States

Overview

Establishing independently funded lines of research is top priority for Young Principal Investigators. Join Program officers of NIH and NSF to find out more about funding opportunities relevant to the Computational Biology community, with an emphasis on Early Career Investigators

Panelists:

Jennifer Walsh Weller,PhD National Science Foundation, United States
Xujing Wang, PhD NIH/NIDDK, United States
Haluk Resat, PhD NIH/NIGMS, United States


Critical assessment communities

Room: TBA Monday, July 9 (12:45 pm - 1:45 pm)

Organizers:

Leader: Steven E Brenner, UC Berkeley, United States

Overview

Community assessment has emerged as an effective framework to evaluate and develop methodologies, especially experiments in which participants are challenged to deduce biological problems such as determining the phenotypic consequences of genomic variation, protein structure, and system perturbations. This BoF represents a unique and unprecedented gathering of a diverse range of critical assessment organizations.


ISMB 2018 - Tutorials

Attention Presenters - please review the Speaker Information Page available here

 

Tutorial AM1: Single cell RNA-seq toolkit

(SOLD OUT)
July 6, 2018, 9:00 am - 1:00 pm

Room: Grand Ballroom A

Presenters

Tyler Faits, Boston University, United States
Matan Hofree, Broad Institute, United States
Ayshwarya Subramanian, Broad Institute, United States
Alex Tsankov, Broad Institute, United States

Overview

Single cell transcriptomics has emerged as a powerful tool to identify and interrogate novel cell types in homeostatic and perturbed states. Unlike bulk transcriptomics, single cell data provides resolution at the level of individual cells while working with much smaller quantities of RNA. As such, analysis of single cell RNA sequencing (scRNA-seq) data presents challenges of scale and technical noise, while providing the resolution necessary to pursue novel questions that earlier technologies did not allow.

The objective of the tutorial is to provide an overview of the laboratory and computational challenges involved in generating and analyzing scRNA-seq data. Participants will be introduced to popular molecular technologies for generating scRNA-seq data, and gain hands-on experience with existing software tools and computational methods for its analysis. The tutorial will briefly introduce approaches for preprocessing of scRNA-seq data, including demultiplexing, sequence alignment, and quality control. Then, starting from a cell x gene expression matrix, participants will learn standard methods to infer heterogeneity by identifying clusters of cells and perform analyses to assign cell identity and function. Participants will also be introduced to specialized analytical methods for exploring expression signatures of cell states, cellular differentiation trajectories, inference of cellular localization, and modern methods targeted towards better understanding of cancer biology. Analyses will be performed by executing commands in RStudio as well as leveraging newly developed point-and-click graphical R/Shiny interfaces.

Audience

Familiarity with basic RNA-Seq data analysis and working knowledge of R.

Requirements

The tutorial will utilize web/cloud-based computing infrastructure with all software preinstalled, such that the only user requirement will be a personal laptop with the Google Chrome web browser installed. From within the Chrome web browser, users will access RStudio and additional web-based utilities for computation.

Schedule Overview
9:00-9:45 am Introduction: Tutorial infrastructure setup; Technologies for scRNA-seq data generation; Description of course datasets, case study and analysis questions
9:45-10:15 am Quality-control and preprocessing; introduction to scRNA-seq data structures in R
10:15-11:00 am Basic analyses of scRNA-seq data; batch effect correction, clustering and inference of cell-types
11:00-11:15 am Coffee Break
11:15-11:45 am Cell cluster-based differential expression and pathway analysis
11:45 am-12:30 pm Interactive tools for visualization and scRNA-seq analysis
12:30-1:00 pm Specialized scRNA-seq applications, currently available resources, and data repositories
   

Capacity

50

Presenter Bios

Matan Hofree, Broad Institute, United States Dr. Hofree completed his PhD in UC San Diego under the supervision of Trey Ideker, developing approaches for improved inference, classification and biological subtype discovery in cancer, using prior biological knowledge encoded in gene interaction networks. He received his B.Sc. in Computer Science and Computational Biology at the Hebrew University of Jerusalem, Israel. Presently, Dr. Hofree is working under the mentorship of Dr. Regev, developing computational techniques for single cell transcriptomics, and studying how transcriptional plasticity and heterogeneity is driving diverse tumors.
Ayshwarya Subramanian, Broad Institute, United States Ayshwarya Subramanian completed her PhD at Carnegie Mellon University under the supervision of Russell Schwartz. Her dissertation focused on developing computational methods for resolving heterogeneity in high-throughput data from tumors, detecting progression markers, and using these markers for phylogenetic inference. Her postdoctoral training was completed at the Harvard T.H. Chan School of Public Health’s Department of Biostatistics, with Curtis Huttenhower and Rafael Irizarry, where she developed probabilistic models of transcriptional activity states from bulk transcriptome sequencing data and approaches for analysis of metagenomic data. She is a computational scientist at the Broad Institute with Anna Greka and Aviv Regev, working on understanding kidney biology and disease using single cell transcriptomics.
Alex Tsankov, Broad Institute, United States Alex Tsankov completed his M.S. and Ph.D. in electrical engineering and computer science at Massachusetts Institute of Technology. Under the mentorship of Aviv Regev and Oliver Rando, his Ph.D. thesis characterized the role of nucleosome positioning in the evolution of Ascomycota yeasts. His postdoctoral training in Alex Meissner’s lab at Harvard University focused on understanding transcription factor and epigenetic dynamics during differentiation of human embryonic stem cells into the three germ layers and also led to the creation of a quantitative assay of functional potency called the ScoreCard. Presently, Dr. Tsankov is a computational scientist at the Broad Institute and is using single cell transcriptomics to build a cellular atlas of the human lung and to study transcriptional heterogeneity and metastasis in lung cancer.
Tyler Faits, Broad Institute, United States Tyler Faits is a Ph.D. candidate in the Bioinformatics Program at Boston University. His thesis, supervised by Dr. Evan Johnson, is focused on creating tools for the optimization of single cell RNA-sequencing experimental designs, and developing interactive portals for transcriptomic data in applications that include single cell RNA-sequencing and metatranscriptomic data analysis.


Tutorial AM2: Machine learning methods in the analysis of genomic and clinical data

(SOLD OUT)
July 6, 2018, 9:00 am - 1:00 pm

Room: Grand Ballroom B

Presenters

Felipe Llinares-López, ETH Zurich, Basel, Switzerland
Damian Roqueiro, ETH Zurich, Basel, Switzerland

Website: https://www.bsse.ethz.ch/mlcb/education/tutorial-ismb18.html

Overview

This tutorial covers various machine learning (ML) tools that have been developed for the analysis of genomic and clinical data. It is an intermediate level tutorial targeted to an audience with previous experience in diverse bioinformatics methods such as: i) genome-wide association studies, ii) comparison of structured data such as graphs or time-series, and iii) traditional text mining. State-of-the-art methods and their applications are presented. We will also discuss illustrative examples of how deep learning is currently being used in the analysis of biomedical data.

Audience

Beginner or intermediate. For hands-on sessions: programmer experience in R/Python is required.

Collect your name badge July 6 between 8:00 am - 8:45 am at the Conference Registration Desk, Ballroom Foyer, East Tower (lower level) Hyatt Regency Chicago.

Participant Requirements

For the hands-on session, if you wish to follow the steps we present you will need to install one of the following on your laptops:
- installation of R 3.4 or newer
(or)
- installation of Python 2.7 or 3

Schedule Overview
9:00 - 9:10 am Damian Roqueiro Introduction: ML in Bioinformatics. Overview of topics presented in the session.
9:10 - 10:10 am Felipe Llinares-López Module I: Significant pattern mining (SPM) and pruning the search space in association studies to increase statistical power
10:10 - 11:00 am Damian Roqueiro Hands-on session: applying SPM on genomic data with the package “sigPatSearch”
11:00 - 11:15 am Coffee break
11:15 - 12:00 pm Damian Roqueiro Module II: ML methods to compare structured biomedical data such as strings, graphs and time series.
12:00 - 12:30 pm Felipe Llinares-López Hands-on session: Computing graph kernels with the package “graphKernels”
12:30 - 1:00 pm Damian Roqueiro and Felipe Llinares-López Module III: Deep learning and its applications to biomedical data. Illustrative examples, with a focus on text mining and processing of electronic health records.

Capacity

40

Presenter Bios

Felipe Llinares-López, ETH Zurich, Basel, Switzerland Felipe Llinares-López is a PhD student in the Machine Learning and Computational Biology lab in ETH Zurich. The main focus of his PhD research has been the development of algorithms to assess the statistical association between a target of interest and high-order interactions between features, and applying these methods to selected problems in computational biology, such as genome-wide association studies.
Damian Roqueiro, ETH Zurich, Basel, Switzerland Damian Roqueiro is a postdoc at the Machine Learning and Computational Biology lab in ETH Zurich. His research has been focused on the development and application of machine learning techniques to better understand the association between specific diseases and the genetic markup of individuals afflicted by those diseases.


Tutorial AM3: Integrated network analysis: Cytoscape automation using R and Python

(SOLD OUT)
July 6, 2018, 9:00 am - 1:00 pm

Room: Columbus IJ

Presenters

Alexander Pico, Gladstone Institutes, United States
John “Scooter” Morris, UCSF, United States
Barry Demchak, UCSD, United States

Overview

Cytoscape is one of the most popular applications for network analysis and visualization. In this workshop, we will demonstrate new capabilities to integrate Cytoscape into programmatic workflows and pipelines using R and Python. We will begin with an overview of network biology themes and concepts, and then we will translate these into Cytoscape terms for practical applications. The bulk of the workshop will be a hands-on demonstration of accessing and controlling Cytoscape from R and Python to perform a network analysis of tumor expression and variant data.

Learning Objectives

By the end of tutorial, you should be able to:
• Know when and how to use Cytoscape in your research area
• Identify and discriminate relevant source of interactions, networks and datasets
• Command programmatic control over Cytoscape
• Integrate Cytoscape into your bioinformatics pipelines
• Publish, share and export networks
• Generalize network analysis methods to multiple problem domains

Schedule Overview
9:00-9:20 am Introductory (20 min)
  • Quick introductions: presenters & audience
  • General network biology perspective and applications
  • Cytoscape introduction
9:20-10:30 am Getting relevant networks
  • Types of networks, sources, and relevant apps
  • How to choose a network source
  • Hands-on exercise: STRING, NDEx, WikiPathways
10:30-11:00 am Intermediate (30 min)
  • Driving Cytoscape from R and Python
    • Overview of Cytoscape automation
    • Launch Cytoscape and connect
  • Getting Disease Networks
    • Query STRING database from R and Python via CyREST
11:00-11:15 am Coffee break
11:15-12:15 Advanced (60 min)
  • Interacting with Cytoscape following R and Python vignettes
    • CyREST and Commands
    • R and Python packages
  • Visualizing data on networks
    • Loading multiple data types into Cytoscape
    • Setting visual styles
  • Subnetwork selection
    • Data-driven and diffusion-based subnetworks
  • Saving, sharing and publishing
    • Session files, images and web export
12:15-1:00 am Additional Topics and Q&A (45 min)
  • More docs, more exercises
  • New features planned for Cytoscape 3.7
  • CyBrowser & web integration

Intended audience

This tutorial is intended for an audience that has prior experience with at least one of the following:
• Cytoscape software
• Network biology concepts
• Bioinformatics analysis using R or Python

Participant requirements

Participants are required to bring a laptop with Cytoscape, R, RStudio and Python installed. Installation instructions will be provided in the weeks preceding the tutorial.

Capacity

40

Presenter Bios

Alexander Pico, Gladstone Institutes, United States Alex is the Executive Director of the National Resource for Network Biology, the Vice President of the Cytoscape Consortium, and Associate Director of Bioinformatics at Gladstone Institutes. He has been a contributing member to Cytoscape since 2006 and has led numerous Cytoscape and Network Biology workshops and mentoring programs over the past 10 years.
John “Scooter” Morris, UCSF, United States Scooter is the Executive Director of the Resource for Biocomputing, Visualization, and Informatics at UCSF, the “Roving Engineer” for Cytoscape, and an Adjunct Assistant Professor of Pharmaceutical Chemistry at UCSF. He has given numerous presentations on using and extending Cytoscape and is a Cytoscape core developer as well as the developer of over a dozen Cytoscape apps, including chemViz, structureViz, clusterMaker, and cddApp.
Barry Demchak, UCSD, United States Barry is the Chief Architect of Cytoscape, Secretary/Treasurer of the Cytoscape Consortium and Project Manager in the Ideker lab at UCSD. He has been a contributing member to Cytoscape development since 2012 and has led numerous Cytoscape and Network Biology workshops and mentored projects over the past 5 years.


Tutorial AM4: Computational methods for comparative regulatory genomics

(SOLD OUT)
July 6, 2018, 9:00 am - 1:00 pm

Room: Columbus KL

Presenters

Saurabh Sinha, Institute of Genomic Biology, University of Illinois, Urbana-Champaign, United States
Colin Dewey, Genome Center of Wisconsin, University of Wisconsin-Madison, United States
Siavash Mirabab, Center for Microbiome Innovation, University of California, San Diego, United States
Ferhat Ay, La Jolla Institute for Allergy and Immunology, University of California, San Diego, United States
Sushmita Roy, Wisconsin Institute for Discovery, University of Wisconsin-Madison, Madison, United States

Learning Objectives

• Gain an overview of key challenges arising in the comparative analysis of molecular data at the sequence, expression, chromatin, and network level.
• Learn about recent algorithms, software tools and their applications to tackle these challenges.

Schedule Overview
9:00-9:40 am Whole genome alignment
9:40-10:20 am Identification and comparative analysis of regulatory sequence elements
10:20-11:00 am Phylogenetic tree construction
11:20-12:00 pm Comparative analysis of chromatin state and 3D genome organization
12:00-12:40 pm Inference and comparative analysis of transcriptional regulatory networks
12:40-1:00 pm Tutorial wrap up

Audience

Beginner or intermediate

Participant requirements

None

Capacity

40

Presenter Bios

Saurabh Sinha, Institute of Genomic Biology, University of Illinois, Urbana-Champaign, United States Saurabh Sinha is a Professor of Computer Science at the University of Illinois Urbana-Champaign. His research focuses on regulatory and comparative genomics and has been supported by NIH, NSF and USDA. He is co-Director of the NIH BD2K Center of Excellence at the University of Illinois. He chairs the M.S. Bioinformatics program of the department, and leads the educational program of the Mayo Clinic-University of Illinois Alliance. He serves as Program co-Chair of the RECOMB Regulatory and Systems Genomics (RSG) conference. He is an NSF CAREER award recipient and was recognized as a University Scholar in 2018.
Colin Dewey, Genome Center of Wisconsin, University of Wisconsin-Madison, United States Colin Dewey is Associate Professor in the Department of Biostatistics and Medical Informatics at UW-Madison, which he joined in 2006. His research focuses on the development of computational and statistical methodology for the analysis of biological sequence data, with RNA-seq data and whole genome sequences of particular interest. Among the methods his group has developed are RSEM (for RNA-seq transcript quantification), DETONATE (for de novo transcriptome assembly evaluation), and Mercator (for multiple whole-genome orthology mapping).
Siavash Mirabab, Center for Microbiome Innovation, University of California, San Diego, United States Siavash Mirarab is an Assistant Professor in the Department of Electrical and Computer Engineering at University of California, San Diego, where he has been since 2015. He obtained his Ph.D. from the Computer Science department at UT-Austin and was advised by Prof. Tandy Warnow. His dissertation won the honorable mention for the 2015 ACM Doctoral Dissertation Award and he is a recipient of the 2017 Sloan Research Fellowship in Computational & Evolutionary Molecular Biology. His lab develops methods for evolutionary computational biology, mostly targetting large-scale datasets. His specific areas of research span many topics, including, reconstruction of species trees from gene trees (phylogenomics), large-scale multiple sequence alignment, HIV transmission network reconstruction, and metagenomic analyses using phylogenetic approaches.
Ferhat Ay, La Jolla Institute for Allergy and Immunology, University of California, San Diego, United States Ferhat Ay is the Institute Leadership Assistant Professor of Computational Biology at the La Jolla Institute for Allergy and Immunology and an Assistant Adjunct Professor at the UC San Diego - School of Medicine. His primary research areas are bioinformatics, computational biology, epigenomics, regulatory genomics and 3D/4D Nucleome. He has developed several methods to model the 3D structure of chromatin and its relation to gene regulation in several diseases including malaria and cancer.
Sushmita Roy, Wisconsin Institute for Discovery, University of Wisconsin-Madison, Madison, United States Sushmita Roy is an assistant professor at the Biostatistics and Medical Informatics department and a faculty at the Wisconsin Institute for Discovery, University of Wisconsin, Madison. Her lab focuses on the development and application of methods for inference and analysis of gene regulatory networks and their dynamics on developmental and evolutionary lineages. Sushmita is a recipient of the 2014 Alfred P. Sloan Foundation Fellowship, an NSF CAREER award, and a James S. McDonnell Foundation Scholar award.


Tutorial PM5: Visualization of large biological data

(SOLD OUT)
July 6, 2018, 2:00 pm - 6:00 pm

Room: Columbus IJ

Presenters

G. Elisabeta Marai, University of Illinois at Chicago, United States
Kay Nieselt, Center for Bioinformatics, University of Tübingen, Germany
Michael Krone, Center for Bioinformatics, University of Tübingen, Germany

Overview

The aim of this tutorial is to familiarise the participants with modern visual analytics methodologies applied to biological data and to provide simple hands-on training. Questions such as what is data visualization, what is visual analytics and how can biological data be visualised to gain insight are addressed, so that hypotheses can be generated or explored and further targeted analyses can be defined. Topics covered are:
• Digital/Electronic Visualisation of data
• Understanding color
• Visual Design Principles
• Examples of visualisation of biological data
• Challenges of large-scale biological data visualisation

Learning Objectives

• Understand the relationship between visual analysis and bioinformatics
• Use principles of human perception and cognition in visual biological data analysis
• Understand and use visual design principles
• Know the basics and do’s and don’ts of visualisation
• Critically evaluate data visual representations and suggest improvements and refinements
• Create simple web-based interactive visualizations using HTML 5, JavaScript, and possibly D3

Schedule Overview
2:00-2:15pm Welcome & Introduction to tutorial structure
2:15-2:45pm What is (electronic) visualization - Understanding color
  • Luminance
  • Color Choice: mapping Data to Color
2:45-3:30pm Visual design principles
  • Tufte’s design principles
  • small multiples
  • Shneiderman’s mantra
3:30-4:00pm Visualization software
  • general: D3, prefuse, javascript, and more
  • specific tools for biological data
4:00-4:15pm Break
4:15-4:45pm BioVis examples
  • Sequences
  • Macromolecules
  • Multivariate Data
  • Networks
4:45-5:15pm Introduction to HTML5 and Javascript
  • Generate a simple interactive, web-based visual analysis tool
5:15-6:00pm Introduction to D3

Audience

This course is designed for everyone who would like to learn and apply visualization techniques in the analysis of large biological data sets. The course provides useful background material on data visualization principles, but the focus is on methods and tools for visualization of next-generation sequencing data, other omics data and network data.

Participant Requirements

None, if participants just wish to listen. For those who would like to also work on the programming part, should bring a laptop and should have programming knowledge (e.g. with java, C++ or similar).

Capacity

40

Presenter Bios

G. Elisabeta Marai, University of Illinois at Chicago, United States G.Elisabeta Marai is an Associate Professor of Computer Science at the University of Illinois at Chicago, affiliated with the Electronic Visualization Laboratory. Her research interests are in biomedical imaging, biology data visualization, and data visual analysis. Liz is a recipient of an NSF CAREER award, of multiple NSF and NIH R01 awards, and of multiple Outstanding Paper awards, and has co-created open-source software (RuleBender, MOSBIE) used by biologists across over 40 institutions. She received her Ph.D. from Brown University in 2007.
http://evl.uic.edu/marai

Kay Nieselt, Center for Bioinformatics, University of Tübingen, Germany Kay Nieselt got her PhD in Mathematics at the Max Planck Institute for Biophysical Chemistry in Göttingen, Germany. Since 2002, she is a group leader at the Center for Bioinformatics Tübingen. Her main research interests are transcriptomics, small non-coding RNAs, ancient pathogenomics and visual analytics of life science data. Some of her visual analytics software products are Mayday, an open-source framework for transcriptome data analysis, GenomeRing for visualisation of multiple genomes, and Pan-Tetris, an interactive platform for pan-genomes. In 2015 together with Liz Marai she has been General Chair of the Symposium on Biological Data Visualization (BioVis, http://www.biovis.net) at ISMB.


Tutorial PM6: Deep learning for network biology

(SOLD OUT)
July 6, 2018, 2:00 pm - 6:00 pm

Room: Grand Ballroom A

Presenters

Marinka Zitnik, Stanford University, United States
Jure Leskovec, Stanford University, United States

Overview

Networks are ubiquitous in biology where they encode connectivity patterns at all scales of organization, from single-cell to population level. Network approaches have been used many times to combine and amplify signals from individual genes,and have led to remarkable discoveries in biology,including drug discovery,protein function prediction,disease diagnosis,and precision medicine. Mathematical machinery that is central to these approaches is machine learning on networks. The main challenge in machine learning on networks is to find a way to extract information about interactions between nodes and to incorporate that information into a machine learning model. To extract information from networks, classic machine learning approaches often rely on summary statistics (e.g., degrees or clustering coefficients) or carefully engineered features to measure local neighborhood structures (e.g., network motifs). These classic approaches can be limited because these hand-engineered features are inflexible, they often do not generalize to networks derived from other organisms, tissues and experimental technologies,and can fail on datasets with low experimental coverage.

Recent years have seen a surge in approaches that automatically learn to encode network structure into low-dimensional representations using transformation techniques based on deep learning and nonlinear dimensionality reduction. The idea behind these representation learning approaches is to learn a data transformation function that maps nodes to embeddings, points in a low-dimensional space. Deep representation learning methods have revolutionized the state-of-the-art in network science. This tutorial will investigate methods and case studies for analyzing biological networks and extracting actionable insights,and in doing so,it will provide attendees with a toolbox of next - generation algorithms for network biology.

Tutorial Website: http://snap.stanford.edu/deepnetbio-ismb

2:00-2:30 pm Part 1 – Introduction and overview of network biology
  • Biological network maps and interaction resources
  • Concepts of network theory
  • Organizing principles of network biomedicine (hubs, local principle, network parsimony principle, shared components principle)
  • Standard prediction tasks (node classification, link prediction, and node clustering)
2:30-3:30 pm Part 2 - Matrix factorization and network propagation
  • Matrix factorization and Laplacian eigenmaps
  • Random-walk embeddings (e.g., DeepWalk, node2vec, metapath2vec, struc2vec)
  • Integrative matrix factorization and propagation methods to improve performance
3:30-4:00 pm Part 3 - Introduction to graph autoencoders
  • Principles of graph autoencoder approaches (encoding, message passing, decoding)
4:00 - 4:15 pm Coffee Break
4:15-5:00 pm: Part 4 - Graph autoencoders and deep representation learning
  • Detailed description of graph convolutional networks (GCNs)
  • Embedding nodes, entire graphs, and extensions for multimodal graphs
5:00-6:00 pm Part 5 - Applications in network biology and new directions
  • Single-cell genomics and gene regulation (e.g., clustering of cells, biomarker discovery)
  • Human disease (e.g., disease pathway discovery, multi-omic and clinical data)
  • Tissue-specific protein function prediction
  • Computational pharmacology (medical indications, polypharmacy side effects, drug repurposing)

Participant Overview

The tutorial will be of broad interest to researchers who work with network data coming from biology, medicine, and life sciences. Graph-structured data arise in many different areas of data mining and predictive analytics, so the tutorial should be of theoretical and practical interest to a large part of data mining and network science community.

The tutorial will not require prior knowledge beyond fundamental concepts covered in introductory machine learning and network science classes. Attendees will come away with a broad knowledge necessary to understand state-of-the-art representation learning methods and to use these methods to solve central problems in network biology.

Presenter Bios

Marinka Zitnik, Stanford University, United States Marinka Zitnik is a postdoctoral fellow in Computer Science at Stanford University. Her research focuses on network science and representation learning methods for biomedicine. She received her PhD in Computer Science from University of Ljubljana in 2015 while also conducting research at Imperial College London, University of Toronto, Baylor College of Medicine. She received outstanding research awards at ISMB, CAMDA, RECOMB, and BC2 conferences, and is involved in projects at Chan Zuckerberg Biohub.
Jure Leskovec, Stanford University, United States Jure Leskovec is an Associate Professor of Computer Science at Stanford University and Chan Zuckerberg Biohub Investigator. His research is recently focusing on biological and biomedical problems and applications of network science to problems in biomedicine and health. Jure received his PhD in Machine Learning from Carnegie Mellon University in 2008 and spent a year at Cornell University. His work received five best paper awards, won the ACM KDD cup and topped the Battle of the Sensor Networks competition.


Tutorial PM7: High-throughput sequencing: Identification of disease variants in exomes and genomes

Download PDF

July 6, 2018, 2:00 pm - 6:00 pm

Room: Grand Ballroom B

Presenters

Francisco De La Vega, D.Sc. Stanford University & Fabric Genomics
Chad Huff, Ph.D. The University of Texas, MD Anderson Cancer Center
Suzanne Leal, Ph.D. Baylor College of Medicine
Mark Yandel, Ph.D. University of Utah
Yao Yu, Ph.D. The University of Texas, MD Anderson Cancer Center

Overview

With the advent and the continuous drop in cost of next-generation sequencing, whole exome (WES) and whole genome sequencing (WGS) have become the platforms of choice for the diagnosis of Mendelian disease. New clinical applications of genome sequencing continue to appear, such as the diagnosis of idiopathic disease and the rapid diagnosis of rare childhood diseases in neonatal/pediatric intensive care units. In the research setting, this technology is permitting to explore the role of rare genetic variation in common, complex diseases through the sequencing of patient cohorts and case/control studies. A number of research studies are now generating WES or WGS data for sample sizes ranging from hundreds to thousands of cases. As the cost of sequencing drops further, it is expected that the number of cases sequenced will reach the 100’s of thousands, allowing the statistical power to identify disease associations with rare variants. For example, Genomics England is well underway to achieve its goal of sequencing 100,000 genomes, about half of these for rare genetic diseases and half for cancer patients. Regeneron Pharmaceuticals, in collaboration with the Geisinger Healthcare system, have already sequenced about 100,000 exomes, with the goal of reaching 250,000. In addition, Regeneron has recently proposed to build a coalition to sequence the ~250K cases of the Welcome Trust Case Control Study. Many healthcare systems around the world are starting to conceive similar projects, where a key aspect of the initiative is that sampling of cases is carried out as part of the healthcare of patients, and while the data obtained will be used in aggregate to look for findings that can drive drug development and new therapeutic approaches, a diagnostic of immediate value to the patient should be provided as well. Finally, the NIH “All of Us” million-people project is starting to move forward, where the ultimate goal will be the sequencing of the genomes of all of the participants.

Motivation

Identification of disease-causing variants, whether in clinical diagnostics or research studies, requires algorithms and statistical methods to score variants with respect to their likely relevance to the disease at hand. In diagnostics, these scores should allow clinicians to focus quickly into a relatively small number of candidate variants to examine their evidence and be able to classify them as either pathogenic or benign, with respect to the patient’s disease phenotype. In research studies, the goal is to understand the role of genetic variation in complex disease, using methods that can aggregate the burden of many rare deleterious variants in key genes for its contributions to the trait. In addition, analysis methods should be able to identify donors harboring a Mendelianlike version of the disease, with ultra-rare variants of very strong effect – natural knock-outs of genes that while may not result in early developmental disease, may significantly influence late onset disease, either by accelerating its onset, or protecting against it. A great success example for this paradigm was the finding of homozygotes for deleterious variants in the PCSK9 gene, a very rare genotype that protects carriers against cardiovascular disease (CVD). This finding led to the development of the latest class of CVD drugs and finding more cases like this is driving a lot of pharmaceutical genome sequencing. Analysis strategies to deal with each of these cases are different, and yet they need to be considered together in the analysis of projects generating large-scale WGS/WES data from patients.

Goals of the Tutorial

The goal of this tutorial is to present an overview of the current state of disease variant identification approaches, describe the most common methods used to interpret variants from WGS/WES patient datasets for clinical diagnostics, as well as the statistical methods applied to the analysis of large cohorts of patients with WES/WGS data for finding novel disease genes. To ensure we deliver practical knowledge, we will discuss in some detail specific tools that the presenters have developed, explaining the fundamentals of the algorithms underlying them, how to use them in real use cases, and how these tools compare to other available tools and approaches. Since the scale of the genome datasets keeps growing, it is also important to understand the techniques to make these analyses scalable.

Learning Objectives

At the end of the tutorial the participants will have an understanding of: 1) What are the challenges of analyzing WES/WGS data for clinical diagnostics and disease association studies; 2) How variant prioritization can be performed probabilistically and why its superior to empirical filtering schemes; 3) How to take advantage of family structures and phenotype information in these endeavors; 4) What are the difficulties in the analysis of rare variants for disease gene finding; 5) What are the typical and most advanced tools for rare variant analysis; and 6) What are the novel approaches for the analysis of disease cohorts for both identifying rare variants influencing common disease as well as ultra-rare homozygotes with very strong effects.

Intended Audience

The participants of this tutorial will be bioinformaticians, statisticians, or geneticists that anticipate would be involved in the analysis of WES/WGS data for either clinical diagnostics or case/control association studies with emphasis in rare variants. This tutorial will be appealing to participants with either academic or industry (e.g. pharmaceutical industry/clinical diagnostic labs) background.

Participant requirements

This is a theoretical tutorial, and the only requirements would be familiarity with the basics of next-generation sequencing of genomes and exomes, the basics of human genetics, and ideally an understanding of how classical GWAS studies for common variants work.

Schedule Overview
Timing Presenter Topic
2:00-2:50 pm F. De La Vega Introduction to variant prioritization in Mendelian disease diagnostics
  • Variant annotation and effects
  • Assessment of deleteriousness
  • Leveraging population allele frequencies
  • Variants Interpretation schemes
  • Challenges of annotation of small vs structural variants
3:00-3:50 pm G. Wang Analysis of Large-Scale Rare Variant Association Studies
  • Common variant vs rare variant disease susceptibility
  • Rare variant study design and power
  • Rare variant association tests
  • Burden vs variance component tests
  • VAT - quality control and analysis of population-based exome association studies
4:00-4:15 PM Coffee Break
4:15-5:15 pm M. Yandel Discovery of rare and ultra-rare disease variants in case/control and cohort studies
  • VAAST and VVP algorithms
  • Rare and common variants association from WES/WGS with VAAST
  • Power consideration of ratios of cases & controls
  • Challenges in finding Mendelian genotypes embedded in case/control studies
5:15-6:15 pm C. Huff and Yao Yu Rare variant prioritization and association analysis with VAAST, XPAT, PHEVOR, and related tools
  • Rare variant association studies with VAAST
  • Familial studies with pVAAST
  • Familial studies with pVAAST
  • Cross-platform sequencing association studies with XPAT
  • Leveraging phenotype information with PHEVOR
References

Mendelian disease analysis by WGS/WES

Eilbeck, K., Quinlan, A. & Yandell, M. Settling the score: variant prioritization and Mendelian disease. Nature Publishing Group 1–14 (2017). doi:10.1038/nrg.2017.52

Wright, C. F., FitzPatrick, D. R. & Firth, H. V. Paediatric genomics: diagnosing rare disease in children. Nature Publishing Group 10, 1–16 (2018).

Coonrod, E. M., Margraf, R. L., Russell, A., Voelkerding, K. V. & Reese, M. G. Clinical analysis of genome nextgeneration sequencing data using the Omicia platform. Expert Rev Mol Diagn 13, 529–540 (2013).

Rare Variant Association Tests

Nicolae, D. L. Association Tests for Rare Variants. Annu. Rev. Genom. Human Genet. 17, 117–130 (2016). Lee, S., Abecasis, G. R., Boehnke, M. & Lin, X. Rare-Variant Association Analysis: Study Designs and Statistical Tests. The American Journal of Human Genetics 95, 5–23 (2014).

Auer PL et al (2016) Guidelines for Large-Scale Sequence-Based Complex Trait Association Studies: Lessons Learned from the NHLBI Exome Sequencing Project, Am J Hum Genet. 99 (4): 791-801.

Analysis Tools

F. Anthony San Lucas, Gao Wang, Paul Scheet, and Bo Peng (2012) Integrated annotation and analysis of genetic variants from next-generation sequencing studies with variant tools, Bioinformatics 28 (3): 421-422.

Gao Wang, Bo Peng and Suzanne M. Leal (2014) Variant Association Tools for Quality Control and Analysis of Large-Scale Sequence and Genotyping Array Data, The American Journal of Human Genetics 94 (5): 770–83.

Yandell M, Huff C, Hu H, Singleton M, Moore B, Xing J, Jorde LB, Reese MG. A probabilistic disease-gene finder for personal genomes. Genome Res 2011, 21(9):1529-1542.

Singleton M., Guthery SL., Voelkerding KV., Chen K., Kennedy BJ., Margraf RL., Durtschi J., Eilbeck K., Reese MG., Jorde LB., Huff CD., Yandell M. Phevor Combines Multiple Biomedical Ontologies for Accurate Identification of Disease-Causing Alleles in Single Individuals and Small Nuclear Families. Am J Hum Genet. 2014 Apr 3;94(4):599- 610.

Flygare S, Hernandez EJ, Phan L, et al. The VAAST Variant Prioritizer (VVP): ultrafast, easy to use whole genome variant prioritization tool. BMC Bioinformatics. 2018;19:57.

Yu, Y. et al. XPAT: a toolkit to conduct cross-platform association studies with heterogeneous sequencing datasets. Nucleic Acids Research 1–11 (2017). doi:10.1093/nar/gkx1280

Di Zhang et al. SEQSpark: A Complete Analysis Tool for Large-Scale Rare Variant Association Studies Using Whole-Genome and Exome Sequence Data. The American Journal of Human Genetics 101, 115–122 (2017).

Links to Tools and Code

VAASt 2.0, pVAAST, at Yandell Lab.: http://www.yandell-lab.org/software/vaast.html
PHEVOR 2.0 web service: http://weatherby.genetics.utah.edu/phevor2/index.html
Variant Tools: http://varianttools.sourceforge.net/
XPAT at Huff Lab: http://www.hufflab.org/software/xpat/
Materials and guides at Leal lab: https://statgen.research.bcm.edu/index.php/Tutorials

Presenter Bios

Francisco M. De La Vega, D.Sc. Stanford University School of Medicine & Fabric Genomics, United States Adjunct Professor at the Department of Biomedical Data science of Stanford, and SVP of Genomics at Fabric Genomics. De La Vega is a geneticist and computational biologist with interests in cancer, population, and clinical genomics, and with extensive experience in the life sciences industry. Dr. De La Vega has led the development of new methods and software for the analysis of next-generation sequencing data and has been involved in major population-scale sequencing projects such as the 1000 Genomes Project, the PanCancer Analysis of Whole Genomes project of the ICGC, and standard-setting public-private partnerships such as the NIST Genome-in-a-Bottle Consortium.
Chad Huff, Ph.D., The University of Texas MD Anderson Cancer., United States Associate Professor, Department of Epidemiology, The University of Texas MD Anderson Cancer Center. He works on understanding human evolution and the genetic basis of human disease through statistical, computational, and population genomics. Current focus is on developing new methods to analyze genomic data and by applying these methods to discover novel insights about the genetic basis of human disease, with particular emphasis on identifying and characterizing genes that increase the risk of developing common cancers.
Suzanne Leal, Ph.D., Baylor College of Medicine, United States Professor in the Department of Molecular and Human Genetics at Baylor College of Medicine and Director of the Center for Statistical Genetics, and also an adjunct Professor in the Department of Statistics at Rice University and a Senior Research Associate at The Rockefeller University. Dr. Leal interests lies in statistical genetics and genetic epidemiology and has worked extensively in developing methods to aid in gene identification and understanding disease etiology. Her current focus is in the development of methods to analyze rare variants. Dr. Leal is also pioneering big-data architectures to more effectively process large WES/WGS datasets of cases/control studies.
Mark Yandel, Ph.D., University of Utah, United States Professor of Human Genetics and H.A. and Edna Benning Presidential Endowed Chair at University of Utah. Dr. Yandel develops computational algorithms and software tools to analyze genomics data and uses these tools to identify disease-causing variants in clinical settings, to understand the molecular basis of gene dysfunction, and to understand evolution. He spent three years at the Genome Sequencing Center at Washington University School of Medicine, St. Louis, and then three years at Celera Genomics where he led the Annotation Software Research and Development group. Mark has led the development of innovative variant prioritization tools, and novel methods that take advantage of the disease phenotype of a patient disease leveraging biomedical phenotype ontologies, and more recently has been extending these tools to make them more efficient and applicable to large cohort studies.
Yao Yu, Ph.D., The University of Texas MD Anderson Cancer Center, United States Computational Scientist at the Department of Epidemiology, The University of Texas MD Anderson Cancer Center. His research interests cover a wide range of topics in computational biology, including genetics, genomics, transcriptomics, and metabolomics. He is the lead developer of the Cross-Platform Association Toolkit (XPAT), a suite of tools designed to support and conduct large-scale association studies with heterogeneous sequencing datasets.


Tutorial PM8: Ontologies in computational biology

July 6, 2018, 2:00 pm - 6:00 pm

Room: Columbus KL

Presenters

Michel Dumontier, Maastricht University, Netherlands
Robert Hoehndorf, King Abdullah University of Science and Technology, Kingdom of Saudi Arabia

Overview

Ontologies have long provided a core foundation in the organization of biomedical entities, their attributes, and their relationships. With over 500 biomedical ontologies currently available there are a number of new and exciting new opportunities emerging in using ontologies for large scale data sharing and data analysis. This tutorial will help you understand what ontologies are and how they are being used in computational biology and bioinformatics.

Learning Objectives

This is an introductory-level course to ontologies and ontology-based data analysis in bioinformatics. In this tutorial, participants will learn:
- what ontologies are and where to find them
- how to understand and use ontology semantics through automated reasoning
- how to measure semantic similarity
- how to incorporate ontologies and semantic similarity measures in bioinformatics analyses
- recent developments in bio-ontologies

Intended audience:

The tutorial will be of interest to any researcher who will use or produce large structured datasets in computational biology. The tutorial will be at an introductory level, but will also describe current research directions and challenges that will be of broad interest to researchers in computational biology.

Requirements:

The tutorial will contain a hands-on part. If you want to participate (instead of just watching the presentation), please download and install Jupyter Notebook (http://jupyter.org/) with a SciJava kernel. For latest updates on this tutorial, see https://github.com/bio-ontology-research-group/ontology-tutorial

Capacity

50

Presenter Bios

Michel Dumontier, Maastricht University, Netherlands Michel Dumontier is a Distinguished Professor of Data Science at Maastricht University. His research focuses on the development of computational methods for scalable integration and reproducible analysis of FAIR (Findable, Accessible, Interoperable and Reusable) data across scales - from molecules, tissues, organs, individuals, populations to the environment. His group combines semantic web technologies with effective indexing, machine learning and network analysis for drug discovery and personalized medicine. Dr. Dumontier leads a new inter-faculty Institute for Data Science at Maastricht University with a focus on accelerating discovery science, empowering communities, and improving health and well being. He is the editor-in-chief for the IOS press journal Data Science and an associate editor for the IOS press journal Semantic Web. He is the scientific director for Bio2RDF, an open source project to generate Linked Data for the Life Sciences and is a technical lead for the FAIR (Findable, Accessible, Interoperable, Re-usable) data initiative. He has published over 125 articles in top rated journals and international conferences. He is internationally recognized for his contributions in bioinformatics, biomedical informatics, and semantic technologies as evidenced by awards, keynote talks at international conferences, and collaborations on international projects.
Robert Hoehndorf, King Abdullah University of Science and Technology, Kingdom of Saudi Arabia Robert Hoehndorf is an Assistant Professor in Computer Science at King Abdullah University of Science and Technology in Thuwal. His research focuses on the applications of ontologies in biology and biomedicine, with a particular emphasis on integrating and analyzing heterogeneous, multimodal data. Dr. Hoehndorf has developed the PhenomeNET system for ontology-based prioritization of disease genes using model organism phenotypes, and contributed to the development of the AberOWL ontology repository. He is an associate editor for the Journal of Biomedical Semantics, BMC Bioinformatics, Applied Ontology, and editorial board member of the IOS press journal Data Science. He published over 90 papers in journals and international conferences, and presented previous tutorials on ontologies and their applications at ISMB, OWL-ED, and ECCB.

ISCB Outstanding Contributions Award

Russ Altman

Russ Altman

Professor, Director, Biomedical Informatics Training Program, Stanford University,
Co-Principal Investagor, FDA Center for Excellence in Regulatory Science & Innovation
United States

Presentation Title: TBA



The Outstanding Contributions to ISCB Award recognizes an ISCB member for her or his outstanding service contributions toward the betterment of ISCB through exemplary leadership, education, and service.

This award debuted in 2015, and the 2018 winner is Russ Altman.

Biography:

Russ Altman is a professor of bioengineering, genetics, medicine, and biomedical data science (and of computer science, by courtesy) and past chairman of the bioengineering department at Stanford University. Altman received an A.B. from Harvard College in 1983, a Ph.D. in medical information sciences from Stanford in 1989 and M.D. from Stanford Medical School in 1990. He also became board certified in 1991 in internal medicine and in clinical informatics.

Altman was on the ISCB Board of Directors from 1997-2005, and the ISCB president from 2002-2005. He has provided service to the ISCB membership through his leadership in establishing and helping to organize the annual Pacific Symposium on Biocomputing. Altman is the Editor of the Journal of Biomedical Informatics (since 2009), and he is a current member of the editorial boards for many major journals in bioinformatics, including Bioinformatics and PLOS Computational Biology. He served on the steering committee for the IEEE-ACM Transactions on Computational Biology (TCBB) from 2009-2011. He is also an executive editor of Biomedical Computational Review, which covers the latest research wherever computation, biology, and medicine intersect.

Altman serves on the Advisory Committee to the NIH Director, Francis Collins (since 2012) and was Chair of the Science Board to the FDA Commissioner (2013-2014). He is a member of the National Academy of Medicine (formerly the Institutes of Medicine), Fellow of ISCB, Fellow of AAAS, Fellow of the American College of Physicians, Fellow of the American College of Medical Informatics, and Fellow of the American Institute of Medical and Biological Engineering. He is also the winner of the PECASE award.

ISCB Innovator Award Keynote

M. Madan Babu

M. Madan Babu Programme Leader, MRC Laboratory of Molecular Biology,
Cambridge, United Kingdom

http://mbgroup.mrc-lmb.cam.ac.uk/
https://www2.mrc-lmb.cam.ac.uk/group-leaders/a-to-g/m-madan-babu/

Presentation Title: How Does Protein Disorder Enable Phenotypic Diversity?
Time: Monday July 9, 8:30 am - 9:30 am
Room: Grand Ballroom C-F



2016 marked the launch of the ISCB Innovator Award, which is given to a leading scientist who is within a decade and half of receiving her or his PhD degree, and has consistently made outstanding contributions to the field and continues to forge new directions. M. Madan Babu is the 2018 winner of the ISCB Innovator Award.

Abstract:

Understanding how the amino acid sequence of a protein contributes to its function (sequence–function relationship) is a fundamental problem of long standing interest. In the 1960s, Christian Anfinsen postulated that the amino acid sequence of a protein determines its three-dimensional structure that in turn determines its function. This work laid the foundation for the sequence–structure–function paradigm. However, a class of polypeptide segments called intrinsically disordered regions does not conform to this postulate. In this presentation, I will first describe established and emerging ideas about how disordered regions contribute to protein function. I will then discuss molecular principles by which regulatory mechanisms, such as alternative splicing and asymmetric localization of transcripts that encode disordered regions, can increase the functional versatility of proteins. I will also present IDR-Screen, which is a high-throughput experimental and computational approach for discovering functional disordered regions in a biologically relevant context and identifying features of functional sequences through statistical learning. Finally, I will discuss how disordered regions contribute to human disease and the emergence of cellular complexity during organismal evolution.

Biography:

M. Madan Babu is a Group Leader at the MRC Laboratory of Molecular Biology, Cambridge, UK. He obtained his undergraduate degree in 2001 from the Centre for Biotechnology, Anna University, India with fellowships from the Indian Institute of Science and the Indian Academy of Sciences. He then received an LMB-Cambridge International Fellowship and a Trinity College Research Scholarship to carry out his doctoral research at the Medical Research Council’s Laboratory of Molecular Biology (MRC-LMB) in Cambridge, UK.

Babu’s research group aims to gain a detailed understanding of how regulation is achieved at distinct levels of organization in cellular systems by placing a particular emphasis on understanding how the precise structure and intrinsically disordered regions of proteins contribute to cellular regulation. Specifically, he investigates regulation at three levels of organization: molecules, processes and genomes. At the molecular level, Babu aims to discover novel features of regulatory and signalling proteins. At the process level, he aims to understand how the different regulatory mechanisms contribute to cellular homeostasis. At the genome level, he studies the interplay between regulation and genome evolution.

Babu's work has also been recognized with national and international awards including the most recent Blavatnik Awards Life Sciences Laureate (2018), Francis Crick Medal and Lecture from the Royal Society (2015), Protein Society Young Investigator Award (2014), Lister Prize (2014), Biochemical Society of UK’s Colworth Medal (2013), Royal Society of Chemistry’s Molecular BioSystems Award (2011), British Genetics Society’s Balfour Award (2011), and the EMBO Young Investigator Award (2010). Madan is an executive editor of Nucleic Acids Research, an elected member of EMBO (2016) and a Fellow of the Royal Society of Chemistry (2017)

ISCB Overton Prize Keynote

Cole Trapnell

Cole Trapnell

Assistant Professor, Department of Genome Sciences, University of Washington
United States

Presentation Title: Reconstructing and deforming developmental landscapes
Time: Saturday July 7, 8:30 am - 9:30 am
Room: Grand Ballroom C-F



The Overton Prize recognizes the research, education, and service accomplishments of early to mid-career scientists who are emerging leaders in computational biology and bioinformatics. The Overton Prize was instituted in 2001 to honor the untimely loss of G. Christian Overton, a leading bioinformatics researcher and a founding member of the ISCB Board of Directors. Cole Trapnell is being recognized as the 2018 winner of the Overton Prize.

Abstract:

Developing embryos are comprised of highly plastic individual cells that shift from one functional state to another, often reversibly so. A cell executes a different gene expression program for each of its possible roles, switching between them as needed throughout its life. How does the genome encode the developmentally intended sequence of program switches? Which gene regulatory events are crucial for a given cell fate decision? Quantifying each gene’s contribution in governing even one developmental step is a staggeringly difficult challenge. However, massively scalable single-cell transcriptome and epigenome profiling offers a way to quantitatively dissect developmental regulatory circuits. I will discuss new assays and algorithms developed by my laboratory to realize this goal, and offer some lessons from several of our recent projects.

Biography:

Cole Trapnell is an Assistant Professor in the Department of Genome Sciences at the University of Washington. Trapnell received his bachelor’s degree and PhD in Computer Science from the University of Maryland. As a graduate student, he was co-advised by Steven Salzberg, and Lior Pachter from the University of California, Berkeley, where he spent several years as a visiting student. While working with Salzberg and Pachter, Trapnell wrote TopHat and Cufflinks, and assisted Ben Langmead with Bowtie.

Dr. Trapnell studies stem cells and differentiation, primarily using high throughput transcriptome sequencing. He is the principal developer of several widely used open-source software tools for analyzing high-throughput sequencing experiments. At the University of Washington, his lab focuses on finding genes that govern stem cell maintenance and cell differentiation, primarily through single-cell genomics.

ISCB Accomplishments by a Senior Scientist Award Keynote

Ruth Nussinov

Ruth Nussinov

Senior Principal Investigator, National Cancer Institute, National Institutes of Health, United States;
Professor, School of Medicine, Department of Human Genetics, Tel Aviv University, Israel

Presentation Title: A woman’s computational biology journey
Time: Tuesday July 10, 5:00 pm - 6:00 pm
Room: Grand Ballroom C-F



The ISCB Accomplishments by a Senior Scientist Award recognizes leaders in the fields of computational biology and bioinformatics for their significant research, education, and service contributions. Ruth Nussinov is being honored as the 2018 winner of the Accomplishment by a Senior Scientist Award.

Abstract:

From the dynamic programming algorithm to fold RNA, to unraveling the hallmarks of oncogenic signaling, it has been a long and fascinating journey which aspired to tackle significant and pressing questions where computational biology can make a difference. My adventures began when revolutionary sequencing methods produced the first long DNA sequences with the development of an efficient algorithm to fold RNA, followed by pioneering bioinformatic DNA sequence analyses. They continued with the principles of protein-protein interactions and harnessing the unraveled interface architectures for prediction, and proposing fundamental biophysical principles based on a dynamic view of protein conformational ensembles. This view led us to suggest that all (dynamic) proteins are allosteric, and the universal “conformational selection and population shift” mechanism in molecular recognition, replacing the text-book “induced-fit” paradigm. Finally, my inspirational journey confronts oncogenic Ras signaling, a problem at the center of the NCI initiative, which will be the focus of my talk.

Biography:

Ruth Nussinov is the Senior Principal Scientist and Principal Investigator at the National Cancer Institute, National Institutes of Health and a Professor in the Department of Human Genetics, School of Medicine at Tel Aviv University. Nussinov received her B.Sc in Microbiology from University of Washington in 1966, her M.Sc in Biochemistry from Rutgers University in 1967 and her Ph.D. in Biochemistry from Rutgers in 1977.

Besides her work on nucleic acid secondary structure prediction, Nussinov is also regarded as a pioneer in DNA sequence analysis for her work in the early 1980s. Nussinov’s algorithm for the prediction of RNA secondary structure is still the leading method. She proposed ‘Conformational Selection and Population Shift’ as an alternative to the textbook ‘Induced-Fit’ model in molecular recognition. Her recent studies unveiled the key role of allostery under normal conditions and in disease and the principles of allosteric drug discovery.

Dr. Nussinov serves as the Editor-in-Chief of PLOS Computational Biology and she is an elected Fellow of the Biophysical Society and the International Society for Computational Biology. She is a Highly Cited Researcher (ranking among the top 3000 researchers or 1% across all fields according to Thomson Reuters Essential Science Indicators, http://highlycited.com/ December 2015), earning them the mark of exceptional impact.

She also won an award from the AACR in 2017 for her paper on The Key Role of Calmodulin in KRAS-Driven Adenocarcinomas.

Beware ISMB 2018 Conference Hotel Room Scam

For those of you interested in attending the ISMB 2018 conference in Chicago, please be aware of a potential scam being perpetrated by a company called Exhibition Hotel Management - they should not be used and are representing themselves as the Hyatt Regency Chicago offering discounted room rates. They will pressure into making your room reservation and then charge you immediately for their services. They are not endorsed by or affiliated with ISMB 2018

onPeak is the only official hotel provider associated with the conference. Hotel reservations can be secured online or by calling them directly - more information can be found at https://www.iscb.org/ismb2018-accommodation. onPeak will not call you to solicit reservations.

ISMB OFFICIAL HOTEL WEBSITE

Workshops

Attention Presenters - please review the Speaker Information Page available here
Bioinformatics training in the FAIR era
Date: Saturday, July 7, 2018, 10:15 am - 12:40 pm Room: Columbus EF
Organized by the ISCB Education Committee:

Dr. Annette McGrath is a principal research scientist at CSIRO Data61, Australia. She has been actively involved in developing and delivering national bioinformatics training programs in Australia. She is an executive on GOBLET, ABACBS and a member of the ISCB Education committee.

Dr. Michelle D. Brazas is the Senior Program Manager for Adaptive Oncology at the Ontario Institute for Cancer Research. She was previously the lead for the Canadian Bioinformatics Workshops (bioinformatics.ca) and Manager of Bioinformatics Education at OICR. She is also an executive on GOBLET and a member of the ISCB Board of Directors and Education committees.

Presentation Overview:

In recent years, there has been a growing focus on the necessity of making published research data easier to discover and reuse for subsequent analyses by other researchers. This is not limited to the life sciences. Open data sharing is a core principle of many public research funding bodies worldwide. Discovery and accessibility of research data is essential to enable others to perform subsequent downstream analyses and integration of data. This means that research data can generate value in the research community for research far beyond the original author lab and focus. International efforts have recently culminated in the publication of the FAIR DATA principles in 2016. FAIR stands for “Findable, Accessible, Interoperable and Reusable”. These principles act as guidelines for best practices in data stewardship for those who wish to enhance the discoverability and reusability of their research data. These principles have received worldwide recognition by organisations such as FORCE11, NIH, ELIXIR and the European Commission as a useful framework to maximise data sharing and use and reuse.

With the steep drop in the cost of generating data, life scientists are generating ever increasing amounts of data via next generation sequencing and other activities. Training life scientists in the analysis of these datasets, particularly sequencing data, is a core activity for a large proportion of the bioinformatics training and education community. As a community, we spend a great deal of time teaching people how to analyze data using specific tools and best practices workflows for these types of data. However, it is commonplace for some researchers, to take the resulting gene sets or conclusions forward to further experiments but place little further thought on the raw data from which they gained these insights.

Against the backdrop of reusable data and good data stewardship practices, are our training programs keeping pace with the changing landscape? Are bioinformatics trainers aware of these international initiatives in data use and reuse? As trainers are we equipping our students to make the most of their own data by understanding both the value of their own data and its potential value to others? What steps are we taking to help trainees better manage and value their data?

Through a series of presentation showcasing current practices in and identifying future needs in better data practices to enable reuse of research data within the life science community, this workshop aims to highlight how we as a community can be most effective in bringing best practices in data management to trainees and students in educational environments.

This workshop will consist of three presentations on topics ranging from the basics of FAIR principles, how we can apply these to bioinformatics training programs, examples of application and how we can further FAIR principles by teaching workshops on FAIR principles.

Bioinformatics Core Workshop:
Date: Saturday, July 7, 2018, 2:00 pm - 4:00 pm Room: Columbus EF
Organizer

Madelaine Gogol, Stowers Institute, United States
Hemant Kelkar, UNC-Chapel Hill, United States
Alastair Kerr, University of Edinburgh, Scotland
Brent Richter, Partners HealthCare of Massachusetts General and Brigham and Women’s Hospitals, United States
Alberto Riva, University of Florida, United States

Presentation Overview:

The bioinformatics core workshop is a workshop by practitioners and managers of Core Facilities for all members of core facilities, including scientists, engineers, analysts, operations and management staff. In this 15th year of bringing the Core community together at ISMB, we will explore in-depth three topics relevant to bioinformatics core facilities through lightning talks that broadly explore each area followed by small-group break out discussions with insights brought back to the full audience for further discussion and knowledge share.

We have partitioned this 2 hour workshop into four sections: 3 sections devoted to lightning talks (Parts A, B, C) introducing the topic areas for the longer and in-depth 4th section (Part D) that further explores the topic areas within breakout sessions and full-audience discussions.

Part A: Strategies for Hiring, Recruiting, and Interviewing new bioinformaticians

Section Description
Methods to find, interview and hire highly successful staff and bioinformaticians for a core facility. Speakers will introduce experience and challenges including finding and hiring people, interview techniques and questions and best practices for recruiting candidates

Part B: Containerization, Clouds, and Workflows

Section Description
Topics to be covered include cloud infrastructure recommendations and limitations, key datasets of value hosted in the cloud, containerization technology that works and workflow tool development and results.

Part C: When good experiments go bad: Negotiating experiment quality failures

Session Description
A non-exhaustive survey of methods and successes in detecting failures and exploring guidelines for terminating bad projects.

Part D: Small group discussion

Session Description
During this longer session, audience members will divide into groups based on their own interests. Groups will come up with their main take away points and bring them back to the main audience for knowledge sharing and for further discussion. Topics may include all previous presentation areas as well as other areas of interest to running or working within a bioinformatics core facility such as single cell analysis or long read analysis.

Schedule

Exclusively for members

  • Member Discount

    ISCB Members enjoy discounts on conference registration (up to $150), journal subscriptions, book (25% off), and job center postings (free).

  • Why Belong

    Connecting, Collaborating, Training, the Lifeblood of Science. ISCB, the professional society for computational biology!

     

Supporting ISCB

Donate and Make a Difference

Giving never felt so good! Considering donating today.