The cuBLAS binding provides an interface that accepts NumPy arrays and Numba’s
CUDA device arrays. The binding automatically transfers NumPy array arguments to
the device as required. This automatic transfer may generate some unnecessary
transfers, so optimal performance is likely to be obtained by the manual
transfer for NumPy arrays into device arrays and using the cuBLAS to manipulate
device arrays where possible.

No special naming convention is used to identify the data
type, unlike in the BLAS C and Fortran APIs. Arguments for array storage
information which are part of the cuBLAS C API are also not necessary since
NumPy arrays and device arrays contain this information.

All functions are accessed through the accelerate.cuda.blas.Blas class: