LOTUS: A single- and multitask machine learning algorithm for the prediction of cancer driver genes

Abstract : Cancer driver genes, i.e., oncogenes and tumor suppressor genes, are involved in the acquisition of important functions in tumors, providing a selective growth advantage, allowing uncontrolled proliferation and avoiding apoptosis. It is therefore important to identify these driver genes, both for the fundamental understanding of cancer and to help finding new therapeutic targets or biomarkers. Although the most frequently mutated driver genes have been identified, it is believed that many more remain to be discovered, particularly for driver genes specific to some cancer types. In this paper, we propose a new computational method called LOTUS to predict new driver genes. LOTUS is a machine-learning based approach which allows to integrate various types of data in a versatile manner, including information about gene mutations and protein-protein interactions. In addition, LOTUS can predict cancer driver genes in a pan-cancer setting as well as for specific cancer types, using a multitask learning strategy to share information across cancer types. We empirically show that LOTUS outperforms five other state-of-the-art driver gene prediction methods, both in terms of intrinsic consistency and prediction accuracy, and provide predictions of new cancer genes across many cancer types.
Liste complète des métadonnées

Littérature citée [75 références]  Voir  Masquer  Télécharger

https://www.hal.inserm.fr/inserm-02325626
Contributeur : Myriam Bodescot <>
Soumis le : mardi 22 octobre 2019 - 12:04:41
Dernière modification le : jeudi 24 octobre 2019 - 01:38:08

Fichier

journal.pcbi.1007381.pdf
Publication financée par une institution

Identifiants

Citation

Olivier Collier, Véronique Stoven, Jean-Philippe Vert. LOTUS: A single- and multitask machine learning algorithm for the prediction of cancer driver genes. PLoS Computational Biology, Public Library of Science, 2019, 15 (9), pp.e1007381. ⟨10.1371/journal.pcbi.1007381⟩. ⟨inserm-02325626⟩

Partager

Métriques

Consultations de la notice

65

Téléchargements de fichiers

18