Leveraging machine learning to predict antimicrobial resistance in ESKAPE pathogens
Confirmed Presenter: Ethan Wolfe, Department of Pathobiology and Diagnostic Investigation, Michigan State University, East Lansing, MI, USA, United States
Room: 518
Format: In Person
Moderator(s): Edward Braun
Authors List: Show
- Jacob Krol, Department of Biomedical Informatics, University of Colorado Anschutz Medical Campus, Aurora, CO, USA, United States
- Ethan Wolfe, Department of Pathobiology and Diagnostic Investigation, Michigan State University, East Lansing, MI, USA, United States
- Evan Brenner, Department of Biomedical Informatics, University of Colorado Anschutz Medical Campus, Aurora, CO, USA, United States
- Keenan Manpearl, Department of Biomedical Informatics, University of Colorado Anschutz Medical Campus, Aurora, CO, USA, United States
- Joseph Burke, Department of Pathobiology and Diagnostic Investigation, Michigan State University, East Lansing, MI, USA, United States
- Charmie Vang, Department of Biomedical Informatics, University of Colorado Anschutz Medical Campus, Aurora, CO, USA, United States
- Vignesh Sridhar, Department of Pathobiology and Diagnostic Investigation, Michigan State University, East Lansing, MI, USA, United States
- Jill Bilodeaux, Department of Biomedical Informatics, University of Colorado Anschutz Medical Campus, Aurora, CO, USA, United States
- Karn Jongnarangsin, Department of Pathobiology and Diagnostic Investigation, Michigan State University, East Lansing, MI, USA, United States
- Elliot Majlessi, Department of Pathobiology and Diagnostic Investigation, Michigan State University, East Lansing, MI, USA, United States
- Janani Ravi, Department of Biomedical Informatics, University of Colorado Anschutz Medical Campus, Aurora, CO, USA, United States
Presentation Overview: Show
Since the clinical introduction of antibiotics in the 1940s, antimicrobial resistance (AMR) has become an increasingly dire threat to global public health. Pathogens acquire AMR much faster than we discover new drugs, warranting new methods to better understand the molecular underpinnings of AMR. Traditional approaches for detecting AMR in novel bacterial strains require time-consuming, labor-intensive assays. Here, we introduce a machine learning approach to identify AMR-associated features. We focus on six highly drug-resistant bacterial pathogens responsible for most nosocomial infections: the “ESKAPE” pathogens. We use all NCBI-PGAP-annotated ESKAPE genomes with known AMR phenotype data from the Bacterial and Viral Bioinformatics Resource Center (BV-BRC). Then, for all complete and WGS genomes for each ESKAPE species, we cluster similar genes and construct pangenomes with Panaroo. To uncover the molecular mechanisms behind drug-/drug family-specific resistance and cross-resistance, we train logistic regression and random forest models on our pangenomes, which include antibiotic resistance/susceptibility labels per genome. The models are tested rigorously to yield ranked lists of AMR-associated genes and protein domains. In addition to recapitulating known AMR genes, our models have identified novel candidates for individual and cross-resistance mechanisms that await experimental validation. Our holistic approach promises thorough, reliable prediction of existing or developing resistance in newly identified pathogen genomes, along with mechanistic molecular contributors of resistance.
Predicting pathogen preferences and host adaptation by leveraging microbial genomics and machine learning
Confirmed Presenter: Evan Brenner, University of Colorado Anschutz Medical Campus, United States
Room: 518
Format: In Person
Moderator(s): Edward Braun
Authors List: Show
- Evan Brenner, University of Colorado Anschutz Medical Campus, United States
- Janani Ravi, University of Colorado Anschutz Medical Campus, United States
Presentation Overview: Show
Most emerging infectious diseases (EIDs) of humans originate in animals and are transmitted through zoonotic spillover events. However, the genetic determinants underlying host adaptation or host switching are often unclear. We hypothesize that genomic markers of pathogen adaptation to different hosts are detectable and can yield valuable insights into EID pathobiology. Utilizing publicly available databases, millions of bacterial and viral genomes with metadata, including their hosts of origin, are accessible for study. To leverage these, we are training machine learning models that associate pathogen genetic elements (e.g., genes, k-mers) with host labels. Our models are simple and interpretable (e.g., decision trees), run with reasonable computational requirements, and have been tested on a sampling of phylogenetically distinct bacterial and viral pathogens.
Our preliminary results have yielded high predictive performance for bacterial and viral pathogens, and top-ranked features in these models often pinpoint genomic elements that are 1) associated with horizontal gene transfer elements, and 2) demonstrated to play biologically relevant roles to host adaptation in prior literature. We will expand these models to new species, build more complex models that incorporate additional levels of genomic information (e.g., protein domains), and begin testing performance across species or genera rather than solely within. These advances offer promise in assessing threats to different host populations posed by new EIDs.