CUDA Toolkit

The NVIDIA® CUDA® Toolkit provides a comprehensive development environment for C and C++ developers building GPU-accelerated applications. The CUDA Toolkit includes a compiler for NVIDIA GPUs, math libraries, and tools for debugging and optimizing the performance of your applications. You’ll also find programming guides, user manuals, API reference, and other documentation to help you get started quickly accelerating your application with GPUs.

cublasXT - a new BLAS GPU library that automatically scales performance across up to 8 GPUs in a single node, and supporting larger workloads. The re-designed FFT GPU library scales up to 2 GPUs in a single node, allowing larger transform sizes and higher throughput.

If you develop applications in languages other than C or C++, please review the Getting Started Page for a language solution that meets your needs. The CUDA Toolkit complements and fully supports programming with OpenACC directives.