ISMB/ECCB 2011 Posters

19th Annual International Conference on
Intelligent Systems for Molecular Biology and
10th European Conference on Computational Biology

Accepted Posters

Category 'P'- Ontologies'

Poster P01

Benchmarking Ontologies: Bigger or Better?

Lixia Yao Columbia University

Andrey Rzhetsky (University Of Chicago, Dept of Medicine); Anna Divoli (University Of Chicago, Dept of Medicine); Ilya Mayzus (University Of Chicago, Dept of Medicine); James Evans (University Of Chicago, Sociology Department);

Short Abstract: An ontology represents the concepts and their interrelation within a knowledge domain. Many ontologies have been developed in biomedicine, providing standardized vocabularies to describe genes and proteins, anatomical structures, physiological phenotypes or diseases, and many other phenomena. Scientists use them to encode observations and experimental results, and to perform integrative analysis to discover new knowledge. A remaining challenge is to evaluate how well an ontology represents the underlying knowledge domain. We introduce a family of metrics, including breadth and depth, to capture the conceptual and relational coverage and parsimony of an ontology. We test these measures using four commonly used medical ontologies and seven popular English thesauri (ontologies of synonyms) with respect to text from medicine, news and novels. Results demonstrate that both medical ontologies and English thesauri have a small overlap in concepts and relations, and suggest further efforts to tighten the fit between ontologies and biomedical knowledge domain.

Long Abstract: Click Here

Poster P02

Functional coherence assessment for protein groups and its application to pathway assignment

Meghana Chitale Purdue University

Daisuke Kihara (Purdue University, Departmen of Biological Sciences, Department of Computer Science); Shriphani Palakodety (Purdue University, Department of Computer Science);

Short Abstract: With the availability of large scale omics data, computational function elucidation of proteins needs to take a step further to understand the systems level interplay of proteins. Traditional analyses had mainly focused on analyzing single gene function at a time, but to understand the governing biological principles behind the functional units, we would like to identify functionally coherent groups of proteins working in co-ordination.

We have developed and analyzed two coherence measures for identifying biologically relevant protein groups. Our scores quantify functional coherence of proteins by considering association of Gene Ontology (GO) terms observed in two actual biological contexts, co-occurrences in protein annotations and co-mentions in literature. The first score is named the Co-occurrence Association Score (CAS) while the second one is named the PubMed Association Score (PAS). Importantly, functional coherence is not identical with functional similarity because proteins with different function work for a same pathway in a coordinated fashion. Our scores have shown to have the superior ability for accurately discriminating biologically relevant protein groups, such as proteins in the same pathway, proteins in the same complex, and proteins which co-localize in a cell when compared with the other state-of-the-art methods. Our scores were also effectively applied to detect physically interacting pairs of proteins. Remarkably, our scores were successful in assigning 90-94% of proteins to correct pathways within the top ranks based on their coherence.

Poster P03

A Modular Annotation Framework for Mathematical Models

Michael Cooling University of Auckland

Short Abstract: The last ten years has seen the development of model encoding schemes for documenting and sharing mathematical models. Now hundreds of published computational biology models are available for free, online, encoded in languages such as the XML-based CellML.
Having succeeded in availability, a next challenge is to increase utility through the addition of semantic meaning to model code. The kinds of metadata that would be immediately useful include: biological (allowing computer-assisted model composition from smaller models, and intelligent search and retrieval of repositories), authorship (so one knows who to credit, and/or contact with queries), citation (describing the connections between the model and academic works), licensing and general comments, among others.
In collaboration with the CellML team (http://www.cellml.org/community/team), I have developed a metadata specification framework to better enable the annotation of CellML models with metadata. The framework consists of a core specification describing, in general terms, how annotations should be attached using RDF/XML, and how satellite specifications drawing on existing orthogonal ontologies should be described. Satellite specifications for the kinds of annotation described above have been developed using elements from the Dublin Core, FOAF (Friend-Of-A-Friend), BIBO (Bibliographic Ontology) and Biomodels Qualifiers. The development of this framework allows the attribution of semantic meaning to sharable, reusable mathematical models

Poster P04

An ontology of bioinformatics tool functions, data types and formats

Jon Ison EMBL-EBI

Peter Rice (EMBL-EBI, Rice Group); Matus Kalas (University of Bergen, Bergen Center for Computational Science); James Malone (EMBL-EBI, Functional Genomics Team); Hamish McWilliam (EMBL-EBI, External Services); Steve Pettifer (The University of Manchester, Advanced Interfaces Group);

Short Abstract: Advances in the discovery, comparison and connection of bioinformatics resources require consistent machine-understandable descriptions of the underlying tools and data. A comprehensive controlled vocabulary of common concepts that is broadly applicable is therefore urgently required.

EDAM is a simple ontology of bioinformatics tool functions, data types, formats and identifiers. It provides a controlled vocabulary of over 2000 defined terms for annotating diverse entities such as Web services, standalone tools, data schema, data objects and databases. EDAM provides coarse coverage of Web services and data from BioCatalogue and is ready for pilot annotations.

EDAM is available in OBO (Open Biomedical Ontologies) format from http://sourceforge.net/projects/edamontology/files/. For documentation and license see http://edamontology.sourceforge.net/.

Poster P05

Multiple Ontology Based System for Integrating and Display Rat Phenotype Data

Mary Shimoyama Medical College of Wisconsin

Rajni Nigam (Medical College of Wisconsin, Rat Genome Database, Human and Molecular Genetics Center); Melinda Dwinell (Medical College of Wisconsin, Rat Genome Database, Human and Molecular Genetics Center);

Short Abstract: The strength of the rat as a model organism lies in its utility in pharmacology, biochemistry, disease, and physiology research. Data resulting from such studies is difficult to represent in databases and the creation of user-friendly data mining tools has proven difficult. Typically clinical measurement data for model organisms such as the rat is housed by experiment or combined only when researchers follow identical protocols. This prevents researchers from accessing and analyzing the vast amount of valuable phenotype data that has been generated for the rat. To address this issue, multiple ontologies and data formats were developed at the Rat Genome Database to standardize this data. The ontologies include the Clinical Measurement Ontology, Measurement Method Ontology, Experimental Condition Ontology and the Rat Strain Ontology. All are available at RGD (rgd.mcw.edu) or the National Center for Biomedical Ontology (bioportal.bioontology.org/). These ontologies and accompanying data fields were used to create standardized formats for the major components of phenotype records: who was measured, what was measured, how was it measured and under what conditions was it measured. The resulting PhenoMiner system (rgd.mcw.edu/phenotypes/) consists of three components. The first is an ontology based database which has incorporated more than 14,000 records from multiple sources. A second component consists of data mining and visualization tools and the third component is the ontology based curation software for creating and editing the data. The PhenoMiner system provides an outstanding resource for the rat research community.

Poster P06

ontoCAT R package for ontologies traversal and search

Natalja Kurbatova European Bioinformatics Institute

Natalja Kurbatova (European Bioinformatics Institute, Functional Genomics Group); Tomasz Adamusiak (European Bioinformatics Institute, Functional Genomics Group); Misha Kapushesky (European Bioinformatics Institute, Functional Genomics Group);

Short Abstract: An R package called ''ontoCAT'' provides a simple interface to any ontology described in OWL or OBO formats. The ontoCAT R package supports these two formats widely used for ontology creation and provides unified, format independent access to the ontology terms and ontology hierarchy.

Our package's focus is the basic methods that allows ontology traversal and search: search of the terms, getting paticular term's parents and/or children, showing paths to the term from the root element of the ontology, showing flat tree representations of the ontology hierarchy, list and search of relations, etc.

Additional functionality of the ontoCAT R package allows to work with group of ontologies and with the major ontology repositories like OLS and Bioportal, namely search terms within batch of ontologies, list available ontologies and load selected ontology for futher analysis when needed.

The ontoCAT R package has been used in a number of projects at EBI and has proven its usefulness.

Considering all mentioned above we believe the ontoCAT R package can be extermly useful for the bioinformatics society working in the R environment.

The package is based on the Ontology Common API Tasks Java library (http://www.ontocat.org).

We are providing two versions of the ontoCAT R package:
- Light ontoCAT package version is available in Bioconductor starting from the release 2.7. It includes all functionality except methods to work with batch of ontologies, since Bioconductor package size is limited with 2 MB.
- Full ontoCAT package version includes batch methods and is available from the ontoCAT website http://www.ontocat.org/wiki/r

Poster P07

What does semantic similarity mean in terms of protein function comparison? The analysis of relation between functional and structural similarity.

Bogumil Konopka Wroclaw University of Technology

Tomasz Golda (Wroclaw University of Technology, Faculty of Fundamental Problems of Technology, Institute of Biomedical Engineering and Instrumentation); Pawel Wozniak (Wroclaw University of Technology, Faculty of Fundamental Problems of Technology, Institute of Biomedical Engineering and Instrumentation); Malgorzata Kotulska (Wroclaw University of Technology, Faculty of Fundamental Problems of Technology, Institute of Biomedical Engineering and Instrumentation);

Short Abstract: Functions of a substantial number of proteins in wwPDB and the Uniprot databases are represented by Gene Ontology terms. The hierarchical structure of GO allows to quantify the similarity of protein functions by applying semantic similarity algorithms. Their result can be used to discover evolutionary relationships between proteins or aid proper protein classification. However its usefulness is limited since relation between the structural similarity and functional similarity of proteins, described by GO terms, has not been yet thoroughly investigated. The aim of this work is to define the exact meaning of protein function similarity given by measures of semantic similarity.
In order to estimate the actual value of a result of functional comparison between two proteins, we calculated the similarities between sets of GO terms randomly chosen from the ontology. We also calculated pairwise similarities between randomly chosen proteins. We propose a Z-score statistic to measure the significance of pairwise similarities by relating the score to the distribution of results of those random comparisons. Using the Z-score metric we ran a systematic analysis of functional similarity of proteins within SCOP classes at different levels of structural similarity.
Results of functional comparison of random proteins show a distribution that could be approximated with an extreme values distribution. The study showed that on the same level of structural similarity there are SCOP classes where function is better conserved. This heterogeneity should be taken into consideration in development of techniques that benefit from the structure?function relationship.

Poster P08

A Semantic Web Infrastructure for Bioinformatics of Staphylococcus aureus

Cândida Delgado Instituto de Tecnologia Química e Biológica

Jonas Almeida (University of Alabama at Birmingham, Department of Pathology); Maria Miragaia ( Instituto de Tecnologia Química e Biológica, Laboratório de Genetica Molecular); Hermínia de Lencastre (The Rockefeller University, Laboratory of Microbiology); Study Group on Molecular Epidemiology of Staphylococcus (Instituto de Tecnologia Química e Biológica, Laboratório de Genetica Molecular); Helena F Deus (Digital Enterprise Research Institute, Health Care and Life Sciences Domain);

Short Abstract: Staphylococcus aureus, a leading cause of human bacterial infections worldwide, can cause a wide range of diseases. In Europe, hospital-acquired methicillin-resistant Staphylococcus aureus infections are estimated to affect more than 150,000 patients annually. Although much is known about this pathogen, the efficient sharing and linking of knowledge is still hampered by scarce bioinformatics infrastructures.
Linked Data consists of a set of best practices for enabling standardized access and query over aggregated data. This integrative potential can be employed in the development stage of bioinformatics apparatus that support molecular epidemiology research. Furthermore, recent developments have made JavaScript a powerful prototype-based object-oriented scripting language, ideal for web-based user interfaces. In this report we exploit the Simple Sloppy Semantic Database (S3DB), as the basis for an extensible, usage-oriented, bioinformatics infrastructure geared towards the study of Staphylococcus aureus epidemiology.
The key outcomes of this work are an Ontology model for Staphylococcus epidemiology and a JavaScript system designed to simplify visualization and interpretation of high-throughput datasets submitted to S3DB (S3DB Ontology Analytics). Both were primarily designed based on requirement assembly and regular evaluation by its ultimate users, the biology experts. A core infrastructure that will integrate multiple views over the same ontologies and support user-friendly interaction with the dataset (viewing, editing or creating) is under development. This system will be reusable in various molecular domains, including cancer research, as a model for publishing and consuming Linked Data. This design model has been shown to be proficient in the management of multiple biomedical datasets.

Poster P09

A standard for cooperative interactions

Kim Van Roey European Molecular Biology Laboratory

Toby Gibson (European Molecular Biology Laboratory, Structural and Computational Biology Unit);

Short Abstract: Cells must continuously monitor external and internal cues, integrate the wide variety of signals they perceive, and translate these inputs into proper outputs. This process of signal transduction depends on the formation of large, dynamic macromolecular complexes. Assembly of these signaling complexes depends on multiple transient, low-affinity interactions. Many of these interactions are mediated by short linear peptide motifs, some of which can be additionally regulated by post-translational modifications, and have been observed to be highly cooperative. Such cooperative interactions provide the dynamic plasticity that is required for cells to integrate multiple input signals, robustly and reliably transmit information, and rapidly generate appropriate responses.

Despite the importance of cooperative interactions in molecular biology, formalisms that can adequately capture and represent such interactions are currently lacking. By means of a literature survey, we will collect cooperative interaction data and use it to define classes of information to be stored. From this analysis, we will develop a data exchange format and create a controlled vocabulary that can describe the relevant features of the collected data set. Once we can computationally describe cooperative interactions, a relational database will be developed, and a prototype will be populated with cooperative interaction data extracted from the literature. As a guideline for experimentalists to unambiguously describe cooperative interactions, the Minimum Information About a Cooperative Interaction (MIACI) will be defined. Together, these tools will facilitate the systematic capture, comparison, exchange and verification of cooperative interaction data.

Poster P10

Mining Gene Ontology data with AGENDA

Guvanch Ovezmyradov University of Goettingen

Martin Goepfert (University of Goettingen, Cellular neurobiology);

Short Abstract: With the emergence of novel genetic techniques and the exponential accumulation of genomic data, the need for powerful bioinformatics tools is growing accordingly. The Gene Ontology (GO) database is one of the most essential information sources in biology. Here we present AGENDA (Application for mining Gene Ontology data), a novel web-based tool that provides practical access to the GO database. AGENDA can be used to efficiently explore genetic information within and across species. The application comes with a user friendly interface and offers novel possibilities for the user. AGENDA provides diverse search options and offers possibilities to bookmark or download the results. The program undergoes active development to suit the needs of the research community. We anticipate that this tool will provide novel insights into the genetic basis of diverse biological processes and facilitate comparisons in-between. AGENDA is open source software that is freely available for non-commercial use. AGENDA can be queried online at http://bioagenda.uni-goettingen.de.

Poster P11

BioPAX – A Community Standard for Pathway Data Sharing

Nadia Anwar Memorial Sloan-Kettering Cancer Center

Emek Demir (MSKCC, cBio); Gary Bader (University of Toronto, The Donnelly Centre); Igor Rodchenkov (University of Toronto, The Donnelly Centre); Ozgun Babur (MKSCC, cBio); Chris Sander (MKSCC, cBio);

Short Abstract: The BioPAX ontology (www.biopax.org) is a standard language for formally representing biological pathways and is available to the biological community to enable exchange and integration of pathway data. BioPAX level 3 supports the representation of metabolic pathways, signal transduction pathways, protein-protein interaction networks, gene regulatory networks and genetic interactions.

Data exchange and integration continues to be a challenge given the complex nature of both pathway data and data sources. Biological pathways are constructs that biologists use to represent relationships between and within chains of cellular events. BioPAX was developed to address these issues and to ease the access, use, exchange and aggregation of pathway data.

This poster will outline data representation in BioPAX and the use of the BioPAX ontology in integration, analysis and visualization of pathway data, which enables more efficient use and reuse of these data. We will outline progress on the further work highlighted by the BioPAX community, including: visualization and exchange of co-ordinates, semantic web applications, enhancing the use of external taxonomies and controlled vocabularies within BioPAX, aligning BioPAX to SBML for quantitative modeling and the continued development of the core ontology to support novel data.

Poster P12

Infer Novel Genes and Pathways that Modulate Sphingolipid Pathway from a Novel Yeast Gene Network Derived from Ontology Fingerprints

Tingting Qin Medical University of South Carolina

Wenjin Zheng (Medical University of South Carolina) Tingting Qin (Medical University of South Carolina, Biochemistry); Lam Tsoi (University of Michigan, Biostatistics); Nabil Matmati (Medical University of South Carolina, Biochemistry & Molecular Biology); Bidyut Mohanty (Medical University of South Carolina, Biochemistry & Molecular Biology); Andrew Lawson (Medical University of South Carolina, Medicine); Yusuf Hannun (Medical University of South Carolina, Biochemistry & Molecular Biology);

Short Abstract: We integrated biomedical literature, ontology, network analysis and experimental investigation to infer novel genes that can modulate yeast sphingolipid pathway. Such modulations may play important roles in regulating the cellular functions of bioactive lipids. We first constructed novel gene networks by performing pairwise comparison of all yeast genes’ Ontology Fingerprints—a set of ontology terms overrepresented in the PubMed abstracts linked to a gene along with their corresponding enrichment p-value. The comparison generated a weighted undirected gene network where genes are nodes and the similarity scores between genes are weighted edges. The network was further refined by applying a Bayesian hierarchical model to distinguish biologically relevant connections from those that are not. To infer novel genes potentially involved in sphingolipid pathway modulation, we identified a subnetwork of well known yeast sphingolipid genes together with non-sphingolipid genes that have no “sphingo” prefix or “ceramide” in PubMed abstracts and descriptions associated with these gene. These non-sphingolipid genes are considered as candidate genes that can modulate sphingolipid pathway. We tested the Myriocin sensitivity of the deletion strains of the top and bottom ranked candidate genes. The proportion of top ranked candidates genes whose deletion showing altered sphingolipid pathway activity (Myriocin sensitivity) is significantly higher than that among lower ranked ones, and the lipidomic profiles of these deletion strains are significantly different from that of wide type. Our novel network analysis provides a powerful tool to study pathway modulation and can be applied to study human disease related pathways.

Accepted Posters

Preparing your Poster - Information and Poster Size
Poster Schedule
Vienna Poster Printing Services
Poster Categories
Search for a Poster

Attention Poster Authors: The ideal poster size should be max. 1.30 m (130 cm) high x 0.90 m (90 cm) wide. Fasteners (Velcro / double sided tape) will be provided at the site, please DO NOT bring tape, tacks or pins. View a diagram of the the poster board here

Posters Display Schedule:

Odd Numbered posters:

Set-up timeframe: Sunday, July 17, 7:30 a.m. - 10:00 a.m.
Author poster presentations: Monday, July 18, 12:40 p.m. - 2:30 p.m.
Removal timeframe: Monday, July 18, 2:30 p.m. - 3:30 p.m.*

Even Numbered posters:

Set-up timeframe: Monday, July 18, 3:30 p.m. - 4:30 p.m.
Author poster presentations: Tuesday, July 19, 12:40 p.m. - 2:30 p.m.
Removal timeframe: Tuesday, July 19, 2:30 p.m. - 4:00 p.m.*

* Posters that are not removed by the designated time may be taken down by the organizers and discarded. Please be sure to remove your poster within the stated timeframe.

Delegate Posters Viewing Schedule

Odd Numbered posters:
On display Sunday, July 17, 10:00 a.m. through Monday, June 18, 2:30 p.m.
Author presentations will take place Monday, July 18: 12:40 p.m.-2:30 p.m.

Even Numbered posters:
On display Monday, July 18, 4:30 p.m. through Tuesday, June 19, 2:30 p.m.
Author presentations will take place Tuesday, July 19: 12:40 p.m.-2:30 p.m

Want to print a poster in Vienna - try these options:

Repacopy- next to the congress venue link [MAP]

Also at Karlsplatz is in the Ring Center, Kärntner Str. 42, link [MAP]

If you need your poster on a thicker material, you may also use a plotter service next to Karlsplatz: http://schiessling.at/portfolio/

View Posters By Category

Search Posters:

↑ TOP