Efficient RNA Isoform Identification and Quantification from RNA-Seq Data with Network Flows - Mines Paris Accéder directement au contenu
Pré-Publication, Document De Travail Année : 2013

Efficient RNA Isoform Identification and Quantification from RNA-Seq Data with Network Flows

Résumé

Several state-of-the-art methods for isoform identification and quantification are based on sparse probabilistic models, such as Lasso regression. However, explicitly listing the -- possibly exponentially -- large set of candidate transcripts is intractable for genes with many exons. For this reason, existing approaches using sparse models are either restricted to genes with few exons, or only run the regression algorithm on a small set of pre-selected isoforms. We introduce in this paper a new technique, called FlipFlop, based on network flow optimization which can efficiently tackle the sparse estimation problem on the full set of candidate isoforms. By removing the need of preselection step, we obtain better isoform identification while keeping a low computational cost. Experiments with synthetic and real single-end RNA-Seq data confirm that our approach is more accurate than alternatives methods and one of the fastest available.
Fichier principal
Vignette du fichier
techreport.pdf (742.22 Ko) Télécharger le fichier
Origine : Fichiers produits par l'(les) auteur(s)

Dates et versions

hal-00803134 , version 1 (21-03-2013)
hal-00803134 , version 2 (10-09-2013)
hal-00803134 , version 3 (21-08-2014)

Identifiants

  • HAL Id : hal-00803134 , version 1

Citer

Elsa Bernard, Laurent Jacob, Julien Mairal, Jean-Philippe Vert. Efficient RNA Isoform Identification and Quantification from RNA-Seq Data with Network Flows. 2013. ⟨hal-00803134v1⟩
1874 Consultations
940 Téléchargements

Partager

Gmail Facebook X LinkedIn More