Multi-dimensional arrays with broadcasting and lazy computing. https://github, 2017. ,
TensorFlow: Large- Scale Machine Learning on Heterogeneous Distributed Systems, 2015. ,
Opening polyhedral compiler's black box, Proceedings of the 2016 International Symposium on Code Generation and Optimization, CGO 2016, pp.128-138, 2016. ,
DOI : 10.1109/VLHCC.2014.6883031
Synthesis of High-Performance Parallel Programs for a Class of ab Initio Quantum Chemistry Models, Proc. IEEE 93, pp.276-292840311, 2004. ,
DOI : 10.1109/JPROC.2004.840311
Theano: a CPU and GPU Math Expression Compiler, Proceedings of the Python for Scientific Computing Conference, 2010. ,
CHiLL: A framework for composing high-level loop transformations, 2008. ,
In search of a program generator to implement generic transformations for high-performance computing, 013 Special Issue on the First MetaOCaml Workshop, pp.25-46, 2004. ,
DOI : 10.1016/j.scico.2005.10.013
URL : https://hal.archives-ouvertes.fr/hal-01257287
A Polyhedral Approach to Ease the Composition of Program Transformations, pp.292-303, 2004. ,
DOI : 10.1007/978-3-540-27866-5_38
URL : https://hal.archives-ouvertes.fr/hal-01257301
Facilitating the search for compositions of program transformations, Proceedings of the 19th annual international conference on Supercomputing , ICS '05, pp.151-160, 2005. ,
DOI : 10.1145/1088149.1088169
URL : https://hal.archives-ouvertes.fr/hal-01257296
High-Order Methods for Incompressible Fluid Flow Jörg Stiller, and Jochen Fröhlich. 2016. Fast Static Condensation for the Helmholtz Equation in a Spectral-Element Discretization, pp.371-380, 2002. ,
DOI : 10.1115/1.1566402
Factorizing the factorization ??? a spectral-element solver for elliptic equations with linear operation count, Journal of Computational Physics, vol.346, pp.437-448, 2017. ,
DOI : 10.1016/j.jcp.2017.06.012
Analysis and tuning of libtensor framework on multicore architectures, 21st International Conference on High Performance Computing, pp.1-10, 2014. ,
The tensor algebra compiler, Proceedings of the ACM on Programming Languages, vol.1, issue.OOPSLA, 2017. ,
DOI : 10.1145/113446.113449
Embedded Processor Design Challenges, pp.171-187, 2002. ,
SPIRAL: Code Generation for DSP Transforms, Proc. IEEE 93, pp.232-275840306, 2004. ,
DOI : 10.1109/JPROC.2004.840306
URL : http://spiral.ece.cmu.edu:8080/pub-spiral/pubfile/paper_1.pdf
Static Condensation, pp.47-70, 2004. ,
DOI : 10.1007/978-1-4471-3827-3_4
Halide: A Language and Compiler for Optimizing Parallelism, Locality, and Recomputation in Image Processing Pipelines, Proceedings of the 34th ACM SIGPLAN Conference on Programming Language Design and Implementation (PLDI '13, pp.519-530, 2013. ,
Firedrake, ACM Transactions on Mathematical Software, vol.43, issue.3, pp.24-27, 2016. ,
DOI : 10.1137/10081962X
A Programming Language Interface to Describe Transformations and Code Generation, pp.136-150, 2011. ,
DOI : 10.1145/1809028.1806606
HPTT: A Highperformance Tensor Transposition C++ Library, Proceedings of the 4th ACM SIGPLAN International Workshop on Libraries, Languages, and Compilers for Array Programming, pp.56-62, 2017. ,
DOI : 10.1145/3091966.3091968
LIFT: A functional data-parallel IR for high-performance GPU code generation, 2017 IEEE/ACM International Symposium on Code Generation and Optimization (CGO), pp.74-85, 2017. ,
DOI : 10.1109/CGO.2017.7863730
More Data Locality for Static Control Programs on NUMA Architectures, Proceedings of the 7th International Workshop on Polyhedral Compilation Techniques (IMPACT '17), 2017. ,
URL : https://hal.archives-ouvertes.fr/hal-01529354