Predicting Intentions of Pedestrians from 2D Skeletal Pose Sequences with a Representation-Focused Multi-Branch Deep Learning Network - Mines Paris Accéder directement au contenu
Article Dans Une Revue Algorithms Année : 2020

Predicting Intentions of Pedestrians from 2D Skeletal Pose Sequences with a Representation-Focused Multi-Branch Deep Learning Network

Joseph Gesnouin
  • Fonction : Auteur
  • PersonId : 1085650
Steve Pechberti
  • Fonction : Auteur
  • PersonId : 1085651
Guillaume Bresson
  • Fonction : Auteur
  • PersonId : 1085652
Bogdan Stanciulescu
Fabien Moutarde

Résumé

Understanding the behaviors and intentions of humans is still one of the main challenges for vehicle autonomy. More specifically, inferring the intentions and actions of vulnerable actors, namely pedestrians, in complex situations such as urban traffic scenes remains a difficult task and a blocking point towards more automated vehicles. Answering the question "Is the pedestrian going to cross?" is a good starting point in order to advance in the quest to the fifth level of autonomous driving. In this paper, we address the problem of real-time discrete intention prediction of pedestrians in urban traffic environments by linking the dynamics of a pedestrian's skeleton to an intention. Hence, we propose SPI-Net (Skeleton-based Pedestrian Intention network): a representation-focused multi-branch network combining features from 2D pedestrian body poses for the prediction of pedestrians' discrete intentions. Experimental results show that SPI-Net achieved 94.4% accuracy in pedestrian crossing prediction on the JAAD data set while being efficient for real-time scenarios since SPI-Net can reach around one inference every 0.25 ms on one GPU (i.e., RTX 2080ti), or every 0.67 ms on one CPU (i.e., Intel Core i7 8700K).
Fichier principal
Vignette du fichier
algorithms-13-00331-v2.pdf (5.52 Mo) Télécharger le fichier
Origine : Fichiers éditeurs autorisés sur une archive ouverte

Dates et versions

hal-03059635 , version 1 (12-12-2020)

Identifiants

Citer

Joseph Gesnouin, Steve Pechberti, Guillaume Bresson, Bogdan Stanciulescu, Fabien Moutarde. Predicting Intentions of Pedestrians from 2D Skeletal Pose Sequences with a Representation-Focused Multi-Branch Deep Learning Network. Algorithms, 2020, 13 (12), pp.331. ⟨10.3390/a13120331⟩. ⟨hal-03059635⟩
241 Consultations
236 Téléchargements

Altmetric

Partager

Gmail Facebook X LinkedIn More