A Semi-Supervised Approach to GRN Inference Using Learning and Optimization

A Semi-Supervised Approach to GRN Inference Using Learning and Optimization

Meroua Daoudi, Souham Meshoul, Samia Boucherkha
Copyright: © 2021 |Volume: 12 |Issue: 4 |Pages: 22
ISSN: 1947-8283|EISSN: 1947-8291|EISBN13: 9781799861157|DOI: 10.4018/IJAMC.2021100109
Cite Article Cite Article

MLA

Daoudi, Meroua, et al. "A Semi-Supervised Approach to GRN Inference Using Learning and Optimization." IJAMC vol.12, no.4 2021: pp.155-176. http://doi.org/10.4018/IJAMC.2021100109

APA

Daoudi, M., Meshoul, S., & Boucherkha, S. (2021). A Semi-Supervised Approach to GRN Inference Using Learning and Optimization. International Journal of Applied Metaheuristic Computing (IJAMC), 12(4), 155-176. http://doi.org/10.4018/IJAMC.2021100109

Chicago

Daoudi, Meroua, Souham Meshoul, and Samia Boucherkha. "A Semi-Supervised Approach to GRN Inference Using Learning and Optimization," International Journal of Applied Metaheuristic Computing (IJAMC) 12, no.4: 155-176. http://doi.org/10.4018/IJAMC.2021100109

Export Reference

Mendeley
Favorite Full-Issue Download

Abstract

Gene regulatory network (GRN) inference is a challenging problem that lends itself to a learning task. Both positive and negative examples are needed to perform supervised and semi-supervised learning. However, GRN datasets include only positive examples and/or unlabeled ones. Recently a growing interest is being devoted to the generation of negative examples from unlabeled data. Within this context, the authors propose to generate potential negative examples from the set of unlabeled ones and keep those that lead to the best classification accuracy when used with positive examples. A new proposed genetic algorithm for fixed-size subset selection has been combined with a support vector machine model for this purpose. The authors assessed the performance of the proposed approach using simulated and experimental datasets. Using simulated datasets, the proposed approach outperforms the other methods in most cases and improves the performance metrics when using balanced data. Experimental datasets show that the proposed approach allows finding the optimal solution for each transcription factor in this study.

Request Access

You do not own this content. Please login to recommend this title to your institution's librarian or purchase it from the IGI Global bookstore.