I haven't use MAGMA as of now and wanted to know that does MAGMA 1.1 support out-of-core memory operations without having to worry about the division of input data ? The input data is more than the capacity of global memory available with the GPU, and for such a case, is it possible to just allocate the memory on the CPU and let MAGMA figure out how to do the computations on the bigger input data ?

The routines aremagma_*getrf() for LU,magma_*potrf() for Cholesky,magma_*qrtrf() for QR.These automatically do out-of-core computation, so the regular testing_*trf sample code should work, given a large matrix.

An out-of-core triangular solve is not currently available in MAGMA, so you have to use LAPACK's triangular solve, getrs() or potrs(). Similarly for QR, use LAPACK's unmqr or ungqr.