ISMB/ECCB 2011 Posters

19th Annual International Conference on
Intelligent Systems for Molecular Biology and
10th European Conference on Computational Biology

Accepted Posters

Category 'F'- Evolution'

Poster F01

Multi-species integrative biclustering

Richard Bonneau New York University

Short Abstract: A key challenge in the analysis of functional genomics data is the identification of modules of genes with similar regulatory controls, a non-trivial problem due to the complexity of regulatory networks. Recent works that compare functional genomics datasets for closely related species reveal that many co-regulated gene groups are conserved across several species. This suggests that comparative analysis of multiple-species functional genomics datasets could prove powerful in accurately identifying those conserved modules.
We describe an extension of our cMonkey algorithm that allows for the simultaneous biclustering of heterogeneous data collections spanning multiple species. We present results from the multi-species biclustering of a group of Gram positive bacteria containing Bacillus subtilis, Bacillus anthracis, and Listeria monocytogenes. We identify conserved groups of orthologous genes, yielding evolutionary insights into the formation and surprisingly high degree of conservation of regulatory modules across these three species. We report a temporal difference between the two Bacillus species in the expression of a conserved bicluster of metabolic genes required for spore formation. In addition, we discuss the unexpected identification of a highly expressed flagellum assembly bicluster in non-motile B. anthracis. Analysis of biclusters obtained revealed a large number of gene groups with conserved modularity and high biological significance as judged by several measures of cluster quality. We show that the method provides a framework that allows data and insights from well-studied organisms to complement the analysis of related but less well studied organisms.

Long Abstract: Click Here

Poster F02

Evolutionary comparisons between sequential datasets with horizontal dependencies infer tumor evolution

Roland Schwarz University of Cambridge

Short Abstract: In Schwarz et al 2010 we showed that rational kernels based on
finite-state transducers can perform pairwise sequence comparisons
with horizontal dependencies and extract accurate distance information
from divergent sets of sequences. We demonstrated the power of our
approach on the ambiguous phylogeny of the Chlorophyceae and are
currently extending it to infer tumor evolution within a single cancer
patient.

Copy number variation profiling is a powerful tool for measuring the
heterogeneity in cancer samples directly linked to patient
survival. Reconstructing the evolutionary history of a cancer within
one patient allows to discover driver events for resistance
development. We have obtained multiple Ovarian cancer samples from the
same patient and have developed a new distance measure based on the
method published in Schwarz et al 2010 to reconstruct the evolutionary
tree and the unsampled internal nodes including the cancer
precursor. Our model includes horizontal dependencies,
i.e. amplification and deletion events of arbitrary length over the
full CNV profiles.

Long Abstract: Click Here

Poster F03

Domain architecture conservation in orthologs

Kristoffer Forslund Stockholm Bioinformatics Centre

Erik Sonnhammer (Stockholm University, Stockholm Bioinformatics Centre); Isabella Pekkari (Stockholm University, Stockholm Bioinformatics Centre);

Short Abstract: As orthologous proteins are expected to retain function more often than other homologs, they are often used for functional annotation transfer between species. While the assumption of higher similarity in various respects of orthologs over paralogs at the same evolutionary separation is widespread and supported by many isolated examples, it has not been systematically evaluated, and some recent publications have questioned its general validity. Because of this, we set out to consistently validate the specific effect of orthologous versus paralogous evolution on the conservation of domain architectures in homologous proteins.

We carried out a large-scale study of domain architecture conservation or change on a large-scale dataset. We designed a score to measure domain architecture similarity and used it to analyze differences between orthologs and close outparalogs as defined by InParanoid. To compensate for differences between the pair types in average evolutionary divergence, we divided the protein comparisons into bins based on Jukes-Cantor corrected protein sequence identity, and analyzed the difference between ortholog and paralog pairs within each bin separately.

The analysis shows that orthologs exhibit significantly greater domain architecture conservation than non-orthologous homologs, even when differences in evolutionary divergence are compensated for. We interpret this as an indication of a stronger selective pressure on orthologs than on paralogs to retain the domain architecture required to perform a specific function. This supports the notion that orthologs are functionally more similar than other types of homologs at the same evolutionary distance.

Poster F04

The Operonic Location of Auto-Transcriptional Repressors is Highly Conserved in Bacteria

Tal Pupko Tel-Aviv University

Nimrod Rubinstein (Tel-Aviv Univeristy, Cell Research and Immunology); David Zeevi (Tel-Aviv Univeristy, Cell Research and Immunology); Yaara Oren (Tel-Aviv Univeristy, Cell Research and Immunology); Gil Segal (Tel-Aviv University, Molecular Microbiology and Biotechnology Department );

Short Abstract: Bacterial genes are commonly encoded in clusters, known as operons, which share transcriptional regulatory control and often encode functionally related proteins that take part in certain biological pathways. Regulatory related operons are known to colocalize in the genome, suggesting that their spatial organization is under selection for efficient expression regulation. However, the internal order of genes within operons is believed to be poorly conserved and hence expression requirements are claimed to be too weak to oppose gene rearrangements. In light of these opposing views we set out to investigate whether the internal location of the regulatory genes within operons is under selection. Our analysis shows that transcription factors are preferentially encoded either first or last in their operons, in the two diverged model bacteria Escherichia coli and Bacillus subtilis. In a higher resolution, we find that transcription factors that repress transcription of the operon in which they are encoded (autorepressors), contribute most of this signal by specific preference of the first operon position. We show that this trend is strikingly conserved throughout highly diverged bacterial phyla. Moreover, these autorepressors regulate operons that carry out highly diverse biological functions. We propose a model according to which autorepressors are selected to be located first in their operons in order to optimize transcription regulation. Specifically, the first operon position helps autorepressors to minimize leaky transcription of the operon structural genes, thus minimizing energy waste. Our robust statistical evidence corroborated with our suggested model thus reveals a paradigm of bacterial operon expression control.

Poster F05

Evolution of transition-metal binding domains

Arye (Arik) Harel Rutgers University

Arik Harel (Rutgers University) Arik Harel (Rutgers, Institute of Marine and Coastal Science & the Department of Biochemistry and Microbiology); Cheong Xin Chan (Rutgers, Department of Ecology, Evolution and Natural Resources); Debashish Bhattacharya (Rutgers, Department of Ecology, Evolution and Natural Resources); Paul Falkowski (Rutgers, Institute of Marine and Coastal Science); Yana Bromberg (Rutgers, Department of Biochemistry and Microbiology);

Short Abstract: Protein redox reactions are among the most important biochemical reactions due to their association with key biological processes like energy homeostasis and metabolism. Their basic machinery evolved in microbes early in our planet's history and has remained conserved throughout time. As amino acids are weak electron transfer reaction mediators, most of these reactions are performed using cofactors such as organic ligands and transition-metals. Recently, it has been suggested that the differential availability of different metal types in the Achaean ocean influenced both the evolution of metal-binding architectures and the selection of preferred binders. That is, iron, manganese, and molybdenum were preferentially used before the Great Oxidation Event (GOE), while copper and zinc utilization evolved after the GOE.
To further study the evolution of transition-metal binding domains and its relationship to the environmental trends, we extracted from Swiss-Prot 35,446 Oxidoreductases (Enzyme Commission group 1) and analyzed them for transition-metal binding. 298 InterPro transition-metal binding domains were identified in 7,227 proteins. As InterPro domains overlap, we further manually filtered this set to the most representative subset of 106 domains (median size 129 residues) in 7,201 proteins. 86% of these sequences contain annotations confirming the presence of the metal binding residues. In order of decreasing abundance, domains binding iron, copper, molybdenum, manganese, and vanadium are present in our dataset. Here we describe our efforts to understand the origins of these domains, their relationships, and respective function evolution, independent of carrier organisms, using phylogenetic analysis and functional data reported in the literature.

Poster F06

Transcriptomics in an ecological key species: Adaptive variations in seagrass responses to rising water temperatures

Susanne Franssen Leibniz-Institute for Marine Sciences

Thorsten Reusch (Leibniz-Institute for Marine Sciences) Jenny Gu (University of Münster, Institute for Evolution and Biodiversity); Nina Bergmann (Leibniz-Institute for Marine Sciences, Evolutionary Ecology of Marine Fishes); Gidon Winters (University of Münster, Institute for Evolution and Biodiversity); Ulrich Klostermeier (Christian-Albrechts-University Kiel, Institute of Clinical Molecular Biology); Philip Rosenstiel (Christian-Albrechts-University Kiel, nstitute of Clinical Molecular Biology); Erich Bornberg-Bauer (University of Münster, Institute for Evolution and Biodiversity);

Short Abstract: RNA-seq offers the opportunity to perform global transcription profiling of a key ecological species, predicting ecologically relevant responses under global warming. The seagrass Zostera marina, occurring along a thermal cline, provides the unique opportunity to assess temperature effects on gene expression as a function of their long term adaptation to local temperature regimes.

Here, natural populations from cold and warm adapted seagrass populations (Denmark, Italy) of Zostera marina were exposed to a realistic heat wave scenario in a common stress garden setup, capturing a two-point thermal reaction norm. Transcriptomic responses were obtained by RNA-seq of eight cDNA libraries, each comprising ~125 000 reads. The expression profiles were assessed subsequent to transcriptome de novo assembly and gene identification via orthologous plant genes.

Expression profiles revealed similar acute stress responses of both populations, with a focus on heat shock proteins. Population differences, however, became apparent at immediate heat recovery, characterised by a convergence to control expression of the warm adapted populations, while profiles of cold adapted genotypes diverged further from controls as well as acute heat responses. This divergent expression was characterized by a diverse set of gene functions dominated by protein degradation and RNA transcription regulation.

Results implicate that ecological experiments addressing gene expression differences of locally adapted populations may be misleading when only acute stress responses are considered. Moreover, we propose expressional flexibility to return to control expression as an important measure to predict the plasticity of individuals and hence, the fate of local populations under increasing climatic extremes.

Poster F07

Phylogenetic analysis of mapped sequence reads

Jean-François Dufayard CIRAD Montpellier

Manuel Ruiz (CIRAD Montpellier, Hérault);

Short Abstract: The aim of the project is to document variation in genomic patterns among a wide range of crops and wild relative species, during the domestication process and at a larger evolutionary scale. We analyse transcriptomes of several individuals for eleven different crops. For each crop, we assemble a transcriptome of one individual, and then we map the sequence reads of the different individuals on this reference.
The main weakness of this method resides in the analyses of mapping results: most frequently, the only mapping relationships taken into account are the 1:1 ones. It means that many paralogy, polymorphism or allelism problems may break-off the analysis.
We propose a methodological improvement in three steps:
-a global clustering of consensus sequences from the individuals, and from the reference,
-a phylogenetic analysis of each cluster (MAFFT, GBlocks, PhyML + branch support statistics),
-a mapping based on tree topologies and branch supports.
The mapping of transcripts on the reference using phylogenetical methods permits a rigorous analysis of multigenic, multi-individual or allelic data.
For each mapping we are able to provide additional information:
-the number of paralogies in the family,
-the maximum or the mean divergence of the family,
-the maximum or the mean divergence of the mapped transcripts with the reference,
-the presence of highly supported incongruences in the topologies.
Such methods are able to infer rich mapping relationships, and important information to detect even individual-specific evolutionary events.

Poster F08

Transcriptomic and metabolomic investigation of thermal stress response through metabolic modulations in seagrasses

Jenny Gu University of Muenster

Susanne Franssen (University of Muenster, Institute for Evolution and Biodiversity); Katrin Weber (Heinrich-Heine University, Plant Biochemistry); Elisabeth Klemp (Heinrich-Heine University, Plant Biochemistry); Andreas Weber (Heinrich-Heine University, Plant Biochemistry); Erich Bornberg-Bauer (University of Muenster, Institute for Evolution and Biodiversity); Gidon Winters (University of Muenster, Institute for Evolution and Biodiversity); Thorsten Reusch (Leibniz-Institute for Marine Sciences, Marine Ecology);

Short Abstract: While chaperones are known to stabilize proteins against denaturing effects, metabolic adjustments modulating compatible osmolytes may prove to be a more viable strategy to counteract chronic thermal stress. A recent proteomic analysis of extremophiles with eScape, a protein stability predicting algorithm, suggests that global fitness of organisms to thermal stress may be achieved via interactions between the metabolome and the proteome. This hypothesis is further investigated in seagrasses (Zostera marina and Zostera noltii) which have been subjected to longer heat waves with the recent climate change. Metabolic adaptation may be the key alternate strategy for many organisms, particularly those that are sessile in aqueous environments and are therefore subjected to longer dosage effects of thermal stress due to the heat capacity of water compared to terrestrial organisms. Metabolomics and transcriptomic studies using next generation sequencing for seagrasses collected from two populations (Denmark and Italy) were conducted. Seagrasses were subjected to an ecologically relevant heat regime controlled within a mesocosm. Significant metabolic adjustment during the thermal stress response of these two populations with different thermal tolerance has been observed, particularly with sucrose and fructose.

References:
Jenny Gu and Vincent J. Hilser. Mol. Biol. Evol. 26(10):2217-2227. 2009

Poster F09

Positive Feedback and Tradeoff as Sources for Complexity in Social Interaction System

Tomoyuki Yamamoto Osaka University

Yuya Yoshimura (Osaka University, Engineering Science); Hiroshi Ishiguro (Osaka University, Grad. School of Engineering Science);

Short Abstract: A theoretical model of social system is investigated for the origin complexity of the social interaction system.
While "small world" theory seems to suggest existence of a positive feedback phenomenon in social systems, the phenomenon implies winner-takes-all situation that leads to simplification of the system falling into minimization problem on a energy landscape.
It does not correspond to a real social system, such as human society.
There must be another feature that originates diversity.

We hypothesized that the diversity of society is a tradeoff feature between social strength parameters, like diversion aspects of strength and weakness form a personality.

The simulation model is based on "DODOM" interaction model introduced by Hogeweg.
In our model, DOM value is extended to 2 or 3 dimensional vector.

While this interaction is a two-person game with a dilemma, a randomness and a positive feedback, our extension is adaptation of tradeoff between DOM components.
The interaction is done in only one component and other updated inversely; if an individual wins, DOM component used for the interaction increase and one of the rest decrease.

As the results, we found grouping and oscillating phenomenon in 2D system.
The grouping, differentiation into two groups, is maintained both of inter- and intra- group interaction, where cutoff of one of them results in reunification.
Also, in 3D, we obtained clustering and dynamical interchanging of members in addition to above results. We believe this self organizing hierarchical structure is obtained, as a source of complexity of society.

Poster F10

Novel phosphorylation modification sites in human proteins that originated after the human-chimpanzee divergence

Yoonsoo Hahn Chung-Ang University

Short Abstract: Phosphorylation modifications of specific protein residues are involved in a wide range of biological processes such as modulation of intracellular signal networks. Here, we present the development and application of a bioinformatics procedure for systematic identification of human-specific phosphorylation sites in proteins that may have occurred after the human-chimpanzee divergence. We collected annotated human phosphorylation sites and compared each site to orthologous mammalian proteins across taxa including chimpanzee, orangutan, rhesus macaque, marmoset, mouse, dog, cow, elephant, opossum, and platypus. We identified 39 human-specific gain of annotated phosphorylation sites in 37 proteins: 24 serines, 12 threonines, and three tyrosines. In many cases, the novel phosphorylation sites are situated in highly conserved segments of the protein. Proteins with novel phosphorylation sites are involved in crucial biological processes such as cell division (AURKB, CASC5, MKI67, and PDCD4) and chromatin remodeling (HIRA, HIRIP3, HIST1H1T, NAP1L4, and LRWD1). Modified phosphorylatable residues produce novel target sites for protein kinases such as cyclin-dependent kinases and casein kinases, possibly resulting in rewiring of phosphorylation regulatory networks. We propose that the gain of phosphorylation sites during human evolution may have altered protein interaction networks and resulted in the development of novel human-specific phenotypes.

Poster F11

Large tandem, higher order repeats (HOR) and regularly dispersed repeat units contribute substantially to divergence between human and chimpanzee Y chromosomes

Matko Glun?i? University of Zagreb, Faculty of Science

Matko Glunčić (University of Zagreb, Faculty of Science) Vladimir Paar (Faculty of Science, University of Zagreb, Theoretical Physics Dept. ); Ivan Basar (Faculty of Science, University of Zagreb, Theoretical Physics Dept. ); Marija Rosandi? (University Hospital Rebro, Department of Internal Medicine); Mislav Cvitkovi? (Faculty of Science, University of Zagreb, Theoretical Physics Dept.); Petar Paar (Faculty of Electrical Engineering and Computing, Theoretical Dept.);

Short Abstract: Comparison of human and chimpanzee genomes has received much attention recently, because of paramount role for understanding the evolutionary step distinguishing us from our closest living relative. In order to contribute to insight into Y chromosome evolutionary history, we study and compare tandems, HORs and regularly dispersed repeats in human and chimpanzee Y chromosome contigs, using robust Global Repeat Map (GRM) algorithm. We find Human Accelerated HOR Regions (HAHORs). In peripheral domains of 35mer human alphoid HORs we find riddled features containing ten additional repeat monomers. In chimpanzee we identify 30mer alphoid HOR. We construct alphoid HOR schemes showing significant human-chimpanzee difference, revealing rapid evolution after human-chimpanzee separation. We identify and analyze over twenty other large repeat units, most of them reported here for the first time: chimpanzee and human ~1.6 kb 3mer secondary repeat unit (SRU) and ~23.5 kb tertiary repeat unit (for ~0.55 kb primary repeat unit (PRU)); human 10848, 15775, 20309, 60910, and 72140 bp PRUs; human 3mer SRU (for ~2.4 kb PRU); 715mer and 1123mer SRUs (for 5mer PRU); chimpanzee 5096, 10762, 10853, 60523 bp PRUs; chimpanzee 64624 bp SRU (for 10853 bp PRU) and so on. We show that substantial human-chimpanzee differences are concentrated in large repeat structures, at the level of as much as ~70% divergence, sizably exceeding previous numerical estimates for some selected noncoding sequences. Smeared over the whole sequenced assembly (25 Mb) this gives ~14% human-chimpanzee divergence. This is significantly higher estimate of divergence between human and chimpanzee than previous estimates.

Poster F12

Distribution patterns of genomic elements in bacteria using genome linguistic methods.

Oliver Bezuidt University of Pretoria

Oleg Reva (University of Pretoria, Biochemistry);

Short Abstract: DNA molecules encoding functional enzymes, transcriptional regulators and virulence factors are fluxing through the bacterial taxonomic walls. They endow environmental and clinical strains of bacteria with new unexpected properties. Lateral genetic exchange, particularly of drug tolerance genes has been recognized for a long time; however the ontology of genomic islands and their donor-recipient relations remain generally obscure because of methodological problems. Prediction of insertion sites of these elements in host chromosomes still remain to be a challenge. Horizontally transferred genes are highly mutable and the mobilome entities having been inserted into host chromosomes undergo multiple events of fragmentation, partial duplications and deletions. This work is based on novel genome linguistics methods to study and visualize intrinsic relationships between mobile genetic elements in bacteria. The method also determines the relative insertion time of genomic islands in bacterial chromosomes using analytical stratigraphic approaches.

Poster F13

Gene expression evolution on the emergence of pathogenicity in Ascomycetes

Aminael Sanchez Catholic University of Leuven

Aminael Sanchez-Rodriguez (Catholic University of Leuven) Aminael Sanchez-Rodriguez (Catholic University of Leuven, Microbial and Molecular Systems); Kristof Engelen (Catholic University of Leuven, Microbial and Molecular Systems); Riet De Smet (Ghent University, Plant Systems Biology); Qiang Fu (Catholic University of Leuven, Microbial and Molecular Systems); Yan Wu (Catholic University of Leuven, Microbial and Molecular Systems); Kathleen Marchal (Catholic University of Leuven, Microbial and Molecular Systems);

Short Abstract: The Ascomycetes form the largest phylum in the fungal kingdom. They are of special interest due to their broad spectrum of life styles including both plant and human pathogens. In a previous study we showed that most of the protein-coding genes needed for pathogenicity were already present in an ancestor common to both pathogenic and non-pathogenic Ascomycetes. Based on this finding that at least some of the coding potential related to pathogenicity is common to both pathogens and non-pathogens, we studied whether alterations in expression behavior could be correlated with a pathogenic versus a non pathogenic lifestyle. Using coexpression networks derived from large scale expression compendia for the non-pathogen N. crassa and the pathogens M. grisea and F. graminearum, we compared the coexpression behavior of true orthologs from gene families related to pathogenicity and occurring in each of the species. We found that the direct neighbours, of the respective orthologs in the coexpression network of the pathogen was largely different from its neighbours in the non-pathogenic species, implying a considerable rewiring of the coexpression network of these common orthologs. In addition we also observed that the expression evolution of paralogs in gene families that are common to both pathogens and non pathogens is different. More specifically we detected that the expression behavior of paralogs belonging to gene families that are involved in pathogen-host interaction tend evolve faster in pathogens than in non-pathogens.

Poster F14

Incorporating protein structure into coevolution analysis reveals functional side chain interactions in ABC-C exporters

Attila Gulyas-Kovacs postdoc/Rockefeller Universiy

David C. Gadsby (Professor/Rockefeller University, Laboratory of Cardiac/Membrane Physiology);

Short Abstract: Prediction of coevolving amino acid positions from aligned sequences provides detailed insight into conserved molecular mechanisms in a protein family. However, the accuracy of prediction is limited by the non-independence of sequence homologs and the variability of the rate of amino acid substitution across positions. To mitigate these factors, a new approach is introduced, which takes advantage of protein structure to optimize the prediction of side chain contacts in three steps. One step tackles sequence non-independence by adjusting phylogenetic filtering of the alignment. Another addresses substitution rate heterogeneity by partitioning position pairs into rate classes followed by class weighting. Separately or in combination, these optimization steps improved three out of four tested coevolution prediction methods. A third step, which weights and combines methods, yielded further improvement relative to individual methods. These results were obtained using an alignment of ABC-C exporters, which underlie multidrug resistance of cancerous cells, insulin secretion, and cystic fibrosis disease. ABC-C proteins are characterized by highly asymmetric nucleotide binding domains, whose ATP-hydrolytic activity is signaled to the transport pathway gates via bundles of long helices that form intracellular linkers extending into the transmembrane domains. The new approach was harnessed to predict functional side chain interactions within and between the nucleotide binding domains and linker, or transmembrane, helices. Comparing outward- and inward-facing transporter conformations suggests that many interactions provide persistent stability to the helix bundles, whereas others occur transiently in a particular conformational state. Intriguingly, some identified positions are implicated in cystic fibrosis.
[NIH-DK51767]

Poster F15

Evolutionary meta-analysis reveals ancient constraints affecting missing heritability and reproducibility in disease association studies

Joel Dudley Stanford University

Atul Butte (Stanford University, Systems Medicine); Sudhir Kumar (Arizona State University, Center for Evolutionary Medicine and Informatics); Rong Chen (Stanford University, Systems Medicine);

Short Abstract: Genome-wide disease association studies contrast genetic variation between disease cohorts and healthy populations to discover single nucleotide polymorphisms (SNPs) and other genetic markers revealing underlying genetic architectures of human diseases. Despite many large efforts over the past decade, these studies are yet to identify many reproducible genetic variants that explain significant proportions of the heritable risk of common human diseases. Here, we report results from a multi-species comparative genomic meta-analysis of 6,720 risk variants for more than 420 disease phenotypes reported in 1,814 studies, which is aimed at investigating the role of evolutionary histories of genomic positions on the discovery, reproducibility, and missing heritability of disease associated SNPs (dSNPs) identified in association studies. Our findings indicate that association studies are biased towards discovering rare variants, because strongly conserved positions only permit minor alleles with lowest frequencies. Using published data from a large case-control study, we demonstrate that the use of a straightforward multi-species evolutionary prior improves the power of association statistics to discover SNPs with reproducible genetic disease associations.

Poster F16

Evolutionary meta-analysis reveals ancient constraints affecting missing heritability and reproducibility in disease association studies

Joel Dudley Stanford University

Rong Chen (Stanford University, Systems Medicine and Pediatrics); Atul Butte (Stanford University, Systems Medicine and Pediatrics); Sudhir Kumar (Arizona State University, Center for Evolutionary Medicine and Informatics);

Short Abstract: Genome-wide disease association studies contrast genetic variation between disease cohorts and healthy populations to discover single nucleotide polymorphisms (SNPs) and other genetic markers revealing underlying genetic architectures of human diseases. Despite many large efforts over the past decade, these studies are yet to identify many reproducible genetic variants that explain significant proportions of the heritable risk of common human diseases. Here, we report results from a multi-species comparative genomic meta-analysis of 6,720 risk variants for more than 420 disease phenotypes reported in 1,814 studies, which is aimed at investigating the role of evolutionary histories of genomic positions on the discovery, reproducibility, and missing heritability of disease associated SNPs (dSNPs) identified in association studies. We show that dSNPs are disproportionately discovered at conserved genomic loci in both coding and non-coding regions, as the effect size (odds ratio) of dSNPs relates strongly to the evolutionary conservation of their genomic positions. Our findings indicate that association studies are biased towards discovering rare variants, because strongly conserved positions only permit minor alleles with lowest frequencies. Using published data from a large case-control study, we demonstrate that the use of a straightforward multi-species evolutionary prior improves the power of association statistics to discover SNPs with reproducible genetic disease associations. Therefore, long-term evolutionary histories of genomic positions are poised to play a key role in reassessing data from existing disease association studies and in the design and analysis of future studies aimed at revealing the genetic basis of common human diseases.

Poster F17

A Phylogenetic Approach to Germline Methylome Reconstruction

Lars Feuerbach Max Planck Institut Informatik

Rune B. Lyngsoe (Oxford University, Department for Statistics); Thomas Lengauer (Max Planck Institute Informatik, Computational Biology and Applied Algorithmics); Jotun Hein (Oxford University, Department for Statistics);

Short Abstract: DNA methylation of cytosine (C) is a molecular mechanism that establishes an epigenetic imprint, which can be maintained across cell division. Furthermore, it introduces a mutational burden that facilitates the transition of 5-methylcytosine (5mC) to thymine (T). Methylation in mammals is mainly targeted to cytosines that are followed by a guanine (CpG) and is considered the default state of this dinucleotide as, for instance, 70-80% of all CpGs are methylated in human tissues. As spontaneous deamination of 5mC is a relatively frequent chemical event, over the course of evolution genomic CpGs are being significantly depleted. This CpG decay has resulted in a six fold underrepresentation of CpGs in the human genome. In fact, the CpG to TpG/CpA mutations appear about 48 times more frequently than normal transversions. It can be assumed that CpG containing sequences of biological relevance are preferentially unmethylated in the germline to escape this mutational burden. Conversely, genome regions that have been persistently unmethylated in the germline over several million years acquire a notable footprint in their dinucleotide composition. In consequence, in unmethylated regions phenomena such as CpG island formation are observable.
We here propose a phylogeny based approach, which exploits the substantial differences in the mutation rates of methylated and unmethylated DNA. In simulation studies, the underlying algorithm robustly reconstructs the methylation state of the applied evolutionary constraints from homologous sequences. One of the intended applications of the approach is an enhanced identification of epigenetically active candidate biomarkers in methylation-based genome-wide association studies.

Poster F18

Probabilistic reconstruction of protein interaction histories using a simple measure of interaction affinity.

Ryan Topping Imperial College London

John Pinney (Imperial College London, Molecular Biosciences);

Short Abstract: The increasing availability of sequence and structural data describing protein complexes across species has encouraged attempts to understand the evolutionary histories of these complexes. However, previous attempts have lacked a thorough phylogenetic underpinning, attempting to model the re-wiring of protein-protein interactions whilst neglecting our knowledge of sequence evolution. The inclusion of detailed phylogenetic information within a probabilistic model of interaction evolution will provide a better appreciation of uncertainties and permit the prediction of assembled complex structures in species for which we have no interaction data.

We present a probabilistic approach for the reconstruction of interaction histories in a complex of paralogous subunits across several species. The approach incorporates a maximum a posteriori reconciled gene tree and a model for interaction gain and loss (re-wiring events) during protein sequence evolution. In order to predict re-wiring events, we propose using a simple measure of binding affinity that takes into account the coevolution occurring at the protein-protein interface. Preliminary results describing an application of the method to reconstruct the history of the proteasome show improvements using this measure compared to previously used measures. Using a training set of protein x-ray crystal structures, we then fit a generalised model relating changes at a protein-protein interface to re-wiring events during evolution.

Poster F19

Next Generation Sequencing, Assembly, and Analysis of Influenza A (H1N1)

Simon Watson Wellcome Trust Sanger Institute

Eve Coulter (Wellcome Trust Sanger Institute, Viral Genomics); Gregory Baillie (Wellcome Trust Sanger Institute, Viral Genomics); Rachael Wash (Wellcome Trust Sanger Institute, Viral Genomics); Astrid Gall (Wellcome Trust Sanger Institute, Viral Genomics); Anne Palser (Wellcome Trust Sanger Institute, Viral Genomics); Monica Galiano (Health Protection Agency, Center for Infections); Maria Zambon (Health Protection Agency, Center for Infections); Oliver Pybus (University of Oxford, Department of Zoology); Andrew Rambaut (University of Edinburgh, Institute of Evolutionary Biology); Paul Kellam (Wellcome Trust Sanger Institute, Viral Genomics);

Short Abstract: Viral gene sequencing and phylogenetics can be used to study the epidemiological dynamics of rapidly-evolving viruses. With complete genome data, it's possible to identify and trace individual transmission chains of viruses, like influenza, during the course of an epidemic. We have developed methods for next generation sequencing of influenza A and sequenced 26 influenza A H1N1 genomes from the 2009 pandemic. Samples were sequenced on an Illumina GAII and the resultant reads quality controlled and assembled against a reference sequence using a pipeline developed here called QUASR.
By comparing to other influenza A H1N1 2009 genomes from the UK, we demonstrate that the UK epidemic was composed of many co-circulating lineages, of which at least 12 were exclusively or predominantly UK clusters. The estimated divergence times of two clusters predate the detection of H1N1pdm in the UK, suggesting H1N1pdm was circulating in the UK before the first clinical case. Crucially, three clusters contain isolates from the second wave of infections in the UK, two of which represent chains of transmission that have persisted within the UK between the first and second waves.

Poster F20

TimeTree2: Species divergence times from molecular data

Sudhir Kumar The Biodesign Institute, Arizona State University

Short Abstract: Scientists and non-scientists often need to know times of divergence between species but rarely can locate that information because it is buried in the scientific literature, usually in a format that is inaccessible to text search engines. We have developed a public knowledgebase that enables data-driven access to the collection of peer reviewed publications in molecular evolution and phylogenetics that have reported estimates of time of divergence between species. Users can query the TimeTree resource by providing two names of organisms (common or scientific) that can correspond to species or groups of species. The current TimeTree web resource (TimeTree2) contains timetrees reported from molecular clock analyses in 910 published studies and 17,341 species that span the diversity of life. TimeTree2 interprets complex and hierarchical data from these studies for each user query, which can be launched using a specially designed iPhone application, in addition to the website. Published time estimates are now readily accessible to the scientific community and general public, without requiring knowledge of evolutionary nomenclature.
TimeTree2 is accessible from the URL http://www.timetree.org, with an iPhone app available from iTunes (http://itunes.apple.com/us/app/timetree/id372842500?mt=8) and a YouTube tutorial (http://www.youtube.com/watch?v=CxmshZQciwo).

Poster F21

Identifying Patterns of Balancing Selection by Comparing the Bonobo, Chimpanzee and Human Genomes

Kay Prüfer Max-Planck-Institute for Evolutionary Anthropology

Aida Andres (Max Planck Institute for Evolutionary Anthropology, Evolutionary Genetics); Kasper Munch (Aarhus University, Bioinformatics Research Centre (BiRC) ); Thomas Mailund (Aarhus University, Bioinformatics Research Centre (BiRC) ); Asger Hobolth (Aarhus University, Bioinformatics Research Centre (BiRC) ); Mikkel Schierup (Aarhus University, Bioinformatics Research Centre (BiRC) ); The Bonobo Genome Consortium (Max Planck Institute for Evolutionary Anthropology, Evolutionary Genetics); Janet Kelso (Max Planck Institute for Evolutionary Anthropology, Evolutionary Genetics); Svante Pääbo (Max Planck Institute for Evolutionary Anthropology, Evolutionary Genetics);

Short Abstract: Balancing selection is a process by which advantageous diversity in populations is maintained. It is, for instance, known to play a vital role in maintaining variation in some immunity-related genes. Here, we describe a new method to identify regions with the genetic signatures of long-standing balancing selection acting in the chimpanzee and bonobo, and potentially the human, lineages.
Alleles evolving under long-standing balancing selection may trace their ancestry back to a common ancestor older than the common ancestor of bonobo, chimpanzee and human and can appear as regions of incomplete lineage sorting (i.e. the gene tree may differ from the species tree for this region). We therefore applied a coalescent hidden Markov model approach to the bonobo, chimpanzee and human genomes to identify regions of incomplete lineage sorting. To enrich for candidate regions with a high likelihood of being true targets of selection, we used Illumina resequencing data for 16 chimpanzee and three bonobo individuals. Within this data we investigated several other signatures that can derive from the action of balancing selection: high diversity in natural chimpanzee and bonobo populations, and a high fraction of shared SNPs between each of the three pairs of species. We identified several regions that constitute candidate targets of balancing selection.

Poster F22

The Darwinian Tree Of Life In Light Of Horizontal Gene Transfer (Is Still Sound)

Adam Sardar University of Bristol

Short Abstract: Horizontal gene transfer (HGT) is the process whereby nucleic acids are transmitted from the genome of one species to another without inheritance by descent (i.e parent to child). Some people believe that there is no prokaryotic tree of life, but in fact a ‘thicket’ of shared genetic material. It is therefore extremely important to understand HGT, as it is so divergent from assumptions made in some phylogenetic analysis , such as a direct ancestry of homologous proteins. Previous studies have focused on trying to predict precisely which genes have been horizontally transferred and where, based on multiple sequence alignments and searches for a ’footprint’ or biochemical signal peculiar to the parent genome. This is very difficult and the signal is quickly lost with evolutionary time. We propose avoiding to attempt prediction of every horizontal transfer event, but to take a global view. We look for proteins comprised of sets of structural-domains (domain architectures, as identified in the SUPERFAMILY database) that show disparity in their deletion rates across the tree of life, requiring an overwhelmingly improbable realisation of events in order for their current distribution in observed genomes. This does not provide concrete identification of individual HGT events, but creates an enriched set of possible contenders, acting as an indicator towards HGT across all all organisms.

Our research suggest an upper bound of 30% for the percentage of domain architectures in bacteria involved in HGT, suggesting that bacteria do indeed posses an underlying Darwinian tree.

Accepted Posters

Preparing your Poster - Information and Poster Size
Poster Schedule
Vienna Poster Printing Services
Poster Categories
Search for a Poster

Attention Poster Authors: The ideal poster size should be max. 1.30 m (130 cm) high x 0.90 m (90 cm) wide. Fasteners (Velcro / double sided tape) will be provided at the site, please DO NOT bring tape, tacks or pins. View a diagram of the the poster board here

Posters Display Schedule:

Odd Numbered posters:

Set-up timeframe: Sunday, July 17, 7:30 a.m. - 10:00 a.m.
Author poster presentations: Monday, July 18, 12:40 p.m. - 2:30 p.m.
Removal timeframe: Monday, July 18, 2:30 p.m. - 3:30 p.m.*

Even Numbered posters:

Set-up timeframe: Monday, July 18, 3:30 p.m. - 4:30 p.m.
Author poster presentations: Tuesday, July 19, 12:40 p.m. - 2:30 p.m.
Removal timeframe: Tuesday, July 19, 2:30 p.m. - 4:00 p.m.*

* Posters that are not removed by the designated time may be taken down by the organizers and discarded. Please be sure to remove your poster within the stated timeframe.

Delegate Posters Viewing Schedule

Odd Numbered posters:
On display Sunday, July 17, 10:00 a.m. through Monday, June 18, 2:30 p.m.
Author presentations will take place Monday, July 18: 12:40 p.m.-2:30 p.m.

Even Numbered posters:
On display Monday, July 18, 4:30 p.m. through Tuesday, June 19, 2:30 p.m.
Author presentations will take place Tuesday, July 19: 12:40 p.m.-2:30 p.m

Want to print a poster in Vienna - try these options:

Repacopy- next to the congress venue link [MAP]

Also at Karlsplatz is in the Ring Center, Kärntner Str. 42, link [MAP]

If you need your poster on a thicker material, you may also use a plotter service next to Karlsplatz: http://schiessling.at/portfolio/

View Posters By Category

Search Posters:

↑ TOP