Loopy’s core idea is that a computation should be described simply and then
transformed into a version that gets high performance. This transformation
takes place under user control, from within Python.

It can capture the following types of optimizations:

Vector and multi-core parallelism in the OpenCL/CUDA model

Data layout transformations (structure of arrays to array of structures)