Home

Function COSI

Attention Presenters - please review the Speaker Information Page available here

Schedule subject to change
All times listed are in UTC

Wednesday, July 28^th

11:00-11:20

Proceedings Presentation: DeepGraphGO: graph neural network for large-scale, multispecies protein function prediction

Format: Pre-recorded with live Q&A

Moderator(s): Mark Wass

Ronghui You, Fudan University, China
Shuwei Yao, Fudan University, China
Hiroshi Mamitsuka, Kyoto University / Aalto University, Japan
Shanfeng Zhu, Fudan University, China

Presentation Overview: Show

11:20-11:40

BENZ WS annotates sequences of the human reference proteome with four level EC numbers

Format: Pre-recorded with live Q&A

Moderator(s): Mark Wass

Davide Baldazzi, University of Bologna - Biocomputing Group, Italy
Castrense Savojardo, University of Bologna - Biocomputing Group, Italy
Pier Luigi Martelli, University of Bologna - Biocomputing Group, Italy
Rita Casadio, University of Bologna - Biocomputing Group, Italy

Presentation Overview: Show

11:40-12:00

Large-scale mining of differential expression data reveals insight into gene function

Format: Pre-recorded with live Q&A

Moderator(s): Mark Wass

Jordan Sicherman, Graduate Program in Bioinformatics, Canada
Nathaniel Lim, Graduate Program in Genome Science and Technology, Canada
Paul Pavlidis, Michael Smith Laboratories - Department of Psychiatry, Canada

Presentation Overview: Show

12:00-12:20

Cellular composition variation drives coexpression-based gene function prediction

Format: Pre-recorded with live Q&A

Moderator(s): Mark Wass

Paul Pavlidis, Michael Smith Laboratories and Department of Psychiatry, University of British Columbia, Canada
Qinkai Wu, Graduate Program in Bioinformatics, University of British Columbia, Canada

Presentation Overview: Show

12:40-13:00

Critical assessment of protein intrinsic disorder prediction

Format: Pre-recorded with live Q&A

Moderator(s): Kim Reynolds

Damiano Piovesan, University of Padova, Italy
Silvio C. E. Tosatto, University of Padova, Italy

Presentation Overview: Show

13:00-13:20

Proposals to improve CAFA evaluation based on community participation

Format: Pre-recorded with live Q&A

Moderator(s): Kim Reynolds

Jeffrey Yunes, Yunes Foundation for Research on Aging, Portsmouth, NH, USA, United States
Chengxin Zhang, Department of Molecular, Cellular, and Developmental Biology, Yale University, New Haven, CT 06511, USA, United States
Petri Törönen, Institute of Biotechnology, University of Helsinki, Finland, Finland

Presentation Overview: Show

13:20-14:00

Panel Discussion of CAFA

Format: Live-stream

Moderator(s): Kim Reynolds

Predrag Radivojac, Iddo Friedberg, Mark Wass

Presentation Overview: Show

14:20-14:40

Sequence-based prediction of proteins associated with extracellular vesicles

Format: Pre-recorded with live Q&A

Moderator(s): Kim Reynolds

Sanne Abeln, Vrije Universiteit Amsterdam, Netherlands
Katharina Waury, Vrije Universiteit Amsterdam, Netherlands
Dea Gogishvili, Vrije Universiteit Amsterdam, Netherlands

Presentation Overview: Show

14:40-15:20

UniProt and Gene Ontology: the need for functional annotation across the span of taxonomic biodiversity

Format: Live-stream

Moderator(s): Kim Reynodls

Sandra Orchard EMBL-EBI

Thursday, July 29^th

11:00-11:20

ChemBoost: A chemical language based approach for the prediction of protein - ligand binding affinity

Format: Pre-recorded with live Q&A

Moderator(s): Mark Wass

Rıza Özçelik, Boğaziçi University, Turkey
Hakime Öztürk, Boğaziçi University, Turkey
Arzucan Ozgur, Bogazici University, Turkey
Elif Ozkirimli, Bogazici University, Turkey

Presentation Overview: Show

11:20-11:40

Ensemble learning for novel drug - target affinity prediction

Format: Pre-recorded with live Q&A

Moderator(s): Mark Wass

Rıza Özçelik, Boğaziçi University, Turkey
Alperen Bağ, Bogazici University, Turkey
Berk Atıl, Boğaziçi University, Turkey
Elif Ozkirimli, Bogazici University, Turkey
Arzucan Ozgur, Bogazici University, Turkey

Presentation Overview: Show

11:40-12:00

Transformer-based Protein Function Annotation with Joint Sequence-Label Embedding

Format: Pre-recorded with live Q&A

Moderator(s): Mark Wass

Yue Cao, Texas A&M University, United States
Yang Shen, Texas A&M University, United States

Presentation Overview: Show

12:00-12:20

Integrating multiple information sources for protein function prediction with end-to-end deep learning

Format: Pre-recorded with live Q&A

Moderator(s): Mark Wass

Gabriela Merino, IBB-CONICET-UNER, Argentina
Diego Milone, sinc(i)-CONICET-UNL, Argentina
Maria Martin, EBI-EMBL, United Kingdom
Georgina Stegmayer, sinc(i)-CONICET-UNL, Argentina
Rabie Saidi, EBI-EMBL, United Kingdom

Presentation Overview: Show

12:40-13:00

seqSCAN: Unsupervised Classification of Proteins for New Function Discovery.

Format: Pre-recorded with live Q&A

Moderator(s): Iddo Friedberg

Meet Barot, Center for Data Science, New York University, United States
Vladimir Gligorijevic, Center for Computational Biology, Flatiron Institute, United States
Kyunghyun Cho, Center for Data Science, New York University, United States
Richard Bonneau, Flatiron Institute, New York University, United States

Presentation Overview: Show

13:00-13:20

Discovery of cellular gene functions using viral genomes

Format: Pre-recorded with live Q&A

Moderator(s): Iddo Friedberg

Dustin Hancks, UT Southwestern Medical Center, United States
Sruthi Chappidi, UT Southwestern Medical Center, United States
Mahsa Sorouri, UT Southwestern Medical Center, United States

Presentation Overview: Show

13:20-13:40

dbCAN-PUL: a database of experimentally characterized CAZyme gene clusters and their substrates

Format: Pre-recorded with live Q&A

Moderator(s): Iddo Friedberg

Yanbin Yin, University of Nebraska - Lincoln, United States
Catie Ausland, Northern Illinois University, United States
Jinfang Zheng, University of Nebraska - Lincoln, United States

Presentation Overview: Show

PULs (polysaccharide utilization loci) are discrete gene clusters of CAZymes (Carbohydrate Active EnZymes) and other genes that work together to digest and utilize carbohydrate substrates. While PULs have been extensively characterized in Bacteroidetes, there exist PULs from other bacterial phyla, as well as archaea and metagenomes, that remain to be catalogued in a database for efficient retrieval. We have developed an online database dbCAN-PUL (http://bcb.unl.edu/dbCAN_PUL/) to display experimentally verified CAZyme-containing PULs from literature with pertinent metadata, sequences, and annotation. Compared to other online CAZyme and PUL resources, dbCAN-PUL has the following new features: (i) Batch download of PUL data by target substrate, species/genome, genus, or experimental characterization method; (ii) Annotation for each PUL that displays associated metadata such as substrate(s), experimental characterization method(s) and protein sequence information, (iii) Links to external annotation pages for CAZymes (CAZy), transporters (UniProt) and other genes, (iv) Display of homologous gene clusters in GenBank sequences via integrated MultiGeneBlast tool and (v) An integrated BLASTX service available for users to query their sequences against PUL proteins in dbCAN-PUL. With these features, dbCAN-PUL will be an important repository for CAZyme and PUL research, complementing our other web servers and databases (dbCAN2, dbCAN-seq). We have further shown that PULs targeting the same or similar substrates tend to have similar gene composition (i.e., protein family/domain combinations). Therefore, the PUL-substrate associations in dbCAN-PUL can be used to classify computer-predicted CAZyme gene clusters (CGCs) into substrate groups (e.g., xylan, pectin, starch, etc.). This will allow the prediction of the glycan substrates of CGCs given sequenced microbiome samples and contribute to addressing two fundamental personalized nutrition questions: (i) Is a gut microbe able to use a specific type of glycan? (ii) Can a person carrying certain gut microbes respond to an individualized diet?

Paper published at https://doi.org/10.1093/nar/gkaa742

14:20-14:30

Introduction to BOSC/Function joint session

Format: Live-stream

Moderator(s): Iddo Friedberg

Iddo Friedberg

14:30-14:40

Completing the functional human proteome together!

Format: Pre-recorded with live Q&A

Moderator(s): Iddo Friedberg

Monique Zahn, SIB Swiss Institute of Bioinformatics, Switzerland
Paula Duek, University of Geneva and SIB Swiss Institute of Bioinformatics, Switzerland
Camille Mary, University of Geneva, Switzerland
Amos Bairoch, University of Geneva and SIB Swiss Institute of Bioinformatics, Switzerland
Lydie Lane, University of Geneva and SIB Swiss Institute of Bioinformatics, Switzerland

Presentation Overview: Show

14:40-15:20

BOSC/Function Keynote: Open approaches to advance data-intensive biomedicine

Format: Live-stream

Moderator(s): Iddo Friedberg

Lara Mangravite, Sage Bionetworks, USA

Presentation Overview: Show

Sponsors

Function COSI

ISCB On the Web