Parallel Interior-Point Solver for Block-Structured Nonlinear Programs on SIMD/GPU Architectures

François Pacaud; Michel Schanen; Sungho Shin; Daniel Adrian Maldonado; Mihai Anitescu

Pré-Publication, Document De Travail Année : 2023

Parallel Interior-Point Solver for Block-Structured Nonlinear Programs on SIMD/GPU Architectures

(1) , (2) , (2) , (2) , (2)

1
2

François Pacaud

Fonction : Auteur
PersonId : 1247781
IdHAL : francois-pacaud
ORCID : 0000-0002-9009-6573

Centre Automatique et Systèmes

Michel Schanen

Fonction : Auteur

Argonne National Laboratory [Lemont]

Sungho Shin

Fonction : Auteur

Argonne National Laboratory [Lemont]

Daniel Adrian Maldonado

Fonction : Auteur

Argonne National Laboratory [Lemont]

Mihai Anitescu

Fonction : Auteur

Argonne National Laboratory [Lemont]

Résumé

We investigate how to port the standard interior-point method to new exascale architectures for block-structured nonlinear programs with state equations. Computationally, we decompose the interior-point algorithm into two successive operations: the evaluation of the derivatives and the solution of the associated Karush-Kuhn-Tucker (KKT) linear system. Our method accelerates both operations using two levels of parallelism. First, we distribute the computations on multiple processes using coarse parallelism. Second, each process uses a SIMD/GPU accelerator locally to accelerate the operations using fine-grained parallelism. The KKT system is reduced by eliminating the inequalities and the state variables from the corresponding equations, to a dense matrix encoding the sensitivities of the problem's degrees of freedom, drastically minimizing the memory exchange. We demonstrate the method's capability on the supercomputer Polaris, a testbed for the future exascale Aurora system. Each node is equipped with four GPUs, a setup amenable to our two-level approach. Our experiments on the stochastic optimal power flow problem show that the method can achieve a 50x speed-up compared to the state-of-the-art method.

Domaines

Optimisation et contrôle [math.OC]

Fichier principal

main.pdf (532.72 Ko)

figures/callbacks_timings.pdf (15.86 Ko)

figures/gpu.png (11.49 Ko)

figures/kkt_timings.pdf (21.41 Ko)

figures/optim_8scens.pdf (18.92 Ko)

figures/optim_large_scale.pdf (16.01 Ko)

figures/optim_timings.pdf (16.3 Ko)

figures/reduction.png (192 Ko)

Origine : Fichiers produits par l'(les) auteur(s)

François Pacaud : Connectez-vous pour contacter le contributeur

https://minesparis-psl.hal.science/hal-04080717

Soumis le : mardi 25 avril 2023-10:53:53

Dernière modification le : samedi 1 juillet 2023-05:09:48

Dates et versions

hal-04080717 , version 1 (25-04-2023)

Identifiants

HAL Id : hal-04080717 , version 1

Citer

François Pacaud, Michel Schanen, Sungho Shin, Daniel Adrian Maldonado, Mihai Anitescu. Parallel Interior-Point Solver for Block-Structured Nonlinear Programs on SIMD/GPU Architectures. 2023. ⟨hal-04080717⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

ENSMP INSMI ENSMP_CAS PARISTECH TDS-MACS PSL ENSMP_DR

29 Consultations

101 Téléchargements

Parallel Interior-Point Solver for Block-Structured Nonlinear Programs on SIMD/GPU Architectures

Résumé

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Partager