. Xtensor, Multi-dimensional arrays with broadcasting and lazy computing. https://github, 2017.

M. Abadi and A. Agarwal, TensorFlow: Large- Scale Machine Learning on Heterogeneous Distributed Systems, 2015.

L. Bagnères, O. Zinenko, S. Huot, and C. Bastoul, Opening polyhedral compiler's black box, Proceedings of the 2016 International Symposium on Code Generation and Optimization, CGO 2016, pp.128-138, 2016.
DOI : 10.1109/VLHCC.2014.6883031

G. Baumgartner, A. Auer, D. E. Bernholdt, A. Bibireata, V. Choppella et al., Synthesis of High-Performance Parallel Programs for a Class of ab Initio Quantum Chemistry Models, Proc. IEEE 93, pp.276-292840311, 2004.
DOI : 10.1109/JPROC.2004.840311

J. Bergstra, O. Breuleux, F. Bastien, P. Lamblin, R. Pascanu et al., Theano: a CPU and GPU Math Expression Compiler, Proceedings of the Python for Scientific Computing Conference, 2010.

C. Chen, J. Chame, and M. Hall, CHiLL: A framework for composing high-level loop transformations, 2008.

A. Cohen, S. Donadio, M. Garzaran, C. Herrmann, O. Kiselyov et al., In search of a program generator to implement generic transformations for high-performance computing, 013 Special Issue on the First MetaOCaml Workshop, pp.25-46, 2004.
DOI : 10.1016/j.scico.2005.10.013

URL : https://hal.archives-ouvertes.fr/hal-01257287

A. Cohen, S. Girbal, and O. Temam, A Polyhedral Approach to Ease the Composition of Program Transformations, pp.292-303, 2004.
DOI : 10.1007/978-3-540-27866-5_38

URL : https://hal.archives-ouvertes.fr/hal-01257301

A. Cohen, M. Sigler, S. Girbal, O. Temam, D. Parello et al., Facilitating the search for compositions of program transformations, Proceedings of the 19th annual international conference on Supercomputing , ICS '05, pp.151-160, 2005.
DOI : 10.1145/1088149.1088169

URL : https://hal.archives-ouvertes.fr/hal-01257296

M. O. Deville, P. F. Fischer, and E. H. Mund, High-Order Methods for Incompressible Fluid Flow Jörg Stiller, and Jochen Fröhlich. 2016. Fast Static Condensation for the Helmholtz Equation in a Spectral-Element Discretization, pp.371-380, 2002.
DOI : 10.1115/1.1566402

I. Huismann, J. Stiller, and J. Fröhlich, Factorizing the factorization ??? a spectral-element solver for elliptic equations with linear operation count, Journal of Computational Physics, vol.346, pp.437-448, 2017.
DOI : 10.1016/j.jcp.2017.06.012

Z. Khaled, S. W. Ibrahim, E. Williams, A. I. Epifanovsky, and . Krylov, Analysis and tuning of libtensor framework on multicore architectures, 21st International Conference on High Performance Computing, pp.1-10, 2014.

F. Kjolstad, S. Kamil, S. Chou, D. Lugato, and S. Amarasinghe, The tensor algebra compiler, Proceedings of the ACM on Programming Languages, vol.1, issue.OOPSLA, 2017.
DOI : 10.1145/113446.113449

P. M. Knijnenburg, T. Kisuki, and M. F. O-'boyle, Embedded Processor Design Challenges, pp.171-187, 2002.

M. Puschel, J. M. Moura, J. R. Johnson, D. Padua, M. M. Veloso et al., SPIRAL: Code Generation for DSP Transforms, Proc. IEEE 93, pp.232-275840306, 2004.
DOI : 10.1109/JPROC.2004.840306

URL : http://spiral.ece.cmu.edu:8080/pub-spiral/pubfile/paper_1.pdf

Z. Qu, Static Condensation, pp.47-70, 2004.
DOI : 10.1007/978-1-4471-3827-3_4

J. Ragan-kelley, C. Barnes, A. Adams, S. Paris, F. Durand et al., Halide: A Language and Compiler for Optimizing Parallelism, Locality, and Recomputation in Image Processing Pipelines, Proceedings of the 34th ACM SIGPLAN Conference on Programming Language Design and Implementation (PLDI '13, pp.519-530, 2013.

F. Rathgeber, D. A. Ham, L. Mitchell, M. Lange, F. Luporini et al., Firedrake, ACM Transactions on Mathematical Software, vol.43, issue.3, pp.24-27, 2016.
DOI : 10.1137/10081962X

G. Rudy, M. M. Khan, M. Hall, C. Chen, and J. Chame, A Programming Language Interface to Describe Transformations and Code Generation, pp.136-150, 2011.
DOI : 10.1145/1809028.1806606

P. Springer, T. Su, and P. Bientinesi, HPTT: A Highperformance Tensor Transposition C++ Library, Proceedings of the 4th ACM SIGPLAN International Workshop on Libraries, Languages, and Compilers for Array Programming, pp.56-62, 2017.
DOI : 10.1145/3091966.3091968

M. Steuwer, T. Remmelg, and C. Dubach, LIFT: A functional data-parallel IR for high-performance GPU code generation, 2017 IEEE/ACM International Symposium on Code Generation and Optimization (CGO), pp.74-85, 2017.
DOI : 10.1109/CGO.2017.7863730

A. Susungi, A. Cohen, and C. Tadonki, More Data Locality for Static Control Programs on NUMA Architectures, Proceedings of the 7th International Workshop on Polyhedral Compilation Techniques (IMPACT '17), 2017.
URL : https://hal.archives-ouvertes.fr/hal-01529354