skip to main content
OSTI.GOV title logo U.S. Department of Energy
Office of Scientific and Technical Information

Title: TriGORank: A Gene Ontology Enriched Learning-to-Rank Framework for Trigenic Fitness Prediction

Conference · · 2021 IEEE International Conference on Bioinformatics and Biomedicine (BIBM)
 [1];  [2];  [1];  [1];  [1];  [1]
  1. Univ. of Illinois at Urbana-Champaign, IL (United States)
  2. Univ. of Illinois at Urbana-Champaign, IL (United States); Virginia Polytechnic Inst. and State Univ. (Virginia Tech), Blacksburg, VA (United States)

Machine learning (ML) has been gaining interest in the metabolic engineering community as a means to automate prediction tasks. In this work, we introduce and study the task of using ML to recommend high-fitness triplet mutants as candidates for wet-lab experiments. We first utilize individual fitness and digenic fitness scores as features and train machine learning models that produce a ranked list, from high to low fitness scores, for triplet gene mutants of S. cerevisiae. Then, we incorporate prior metabolic knowledge from an existing gene ontology, by designing a novel graph representation and deducing features that can capture gene similarity and gene interactions. Lastly, experimental results show that our proposed gene ontology enriched model, termed TriGORank, improves both performance and explainability.

Research Organization:
Univ. of Illinois at Urbana-Champaign, IL (United States)
Sponsoring Organization:
USDOE
DOE Contract Number:
SC0018420
OSTI ID:
1902720
Journal Information:
2021 IEEE International Conference on Bioinformatics and Biomedicine (BIBM), Conference: 2021 IEEE International Conference on Bioinformatics and Biomedicine (BIBM), Houston, TX (United States), 9-12 Dec 2021
Country of Publication:
United States
Language:
English

References (20)

Verbs semantics and lexical selection conference January 1994
Gene Ontology: tool for the unification of biology journal May 2000
Solving large scale linear prediction problems using stochastic gradient descent algorithms conference January 2004
A global genetic interaction network maps a wiring diagram of cellular function journal September 2016
DeepGO: predicting protein functions from sequence and interactions using a deep ontology-aware classifier journal October 2017
The Genetic Landscape of a Cell journal January 2010
Predicting Missing and Spurious Protein-Protein Interactions Using Graph Embeddings on GO Annotation Graph conference November 2019
Biosystems Design by Machine Learning journal June 2020
Ridge Regression: Biased Estimation for Nonorthogonal Problems journal February 1970
The Gene Ontology resource: enriching a GOld mine journal December 2020
Identification of synthetic lethality based on a functional network by using machine learning algorithms journal August 2018
Cumulated gain-based evaluation of IR techniques journal October 2002
Systematic analysis of complex genetic interactions journal April 2018
Modular epistasis in yeast metabolism journal December 2004
Translation of Genotype to Phenotype by a Hierarchy of Cell Subsystems journal February 2016
Learning to Rank for Information Retrieval journal January 2007
GOSemSim: an R package for measuring semantic similarity among GO terms and gene products journal February 2010
Random Forests journal January 2001
Using deep learning to model the hierarchical structure and function of a cell journal March 2018
Machine learning applications in systems metabolic engineering journal August 2020

Similar Records

Machine learning analysis of RB-TnSeq fitness data predicts functional gene modules in Pseudomonas putida KT2440
Journal Article · Tue Mar 19 00:00:00 EDT 2024 · mSystems · OSTI ID:1902720

PersGNN: Applying Topological Data Analysis and Geometric Deep Learning to Structure-Based Protein Function Prediction
Journal Article · Thu Oct 29 00:00:00 EDT 2020 · OSTI ID:1902720

Robust predictions of specialized metabolism genes through machine learning
Journal Article · Wed Jan 23 00:00:00 EST 2019 · Proceedings of the National Academy of Sciences of the United States of America · OSTI ID:1902720

Related Subjects