NVIDIA CUDA Toolkit versions

Provides an environment to create high performance GPU-enabled apps
Dec 8, 2021
9.0
Dec 31, 2017
Review
7.5
Jun 22, 2015
7.0
Mar 16, 2015
6.5
Jul 24, 2014
6.0
Mar 5, 2014
5.5
Jun 2, 2013
5.0
Sep 11, 2012
Review
4.2
Apr 21, 2012
Review
4.1
Dec 16, 2011
4.0
Mar 29, 2011
Review
3.2
Nov 11, 2010
Review
3.1
Jun 27, 2010

What's new

v4.0 [Mar 29, 2011]
Share GPUs across multiple threads
Use all GPUs in the system concurrently from a single host thread
No-copy pinning of system memory, a faster alternative to cudaMallocHost()
C new/delete and support for virtual functions
Support for inline PTX assembly
Thrust library of templated performance primitives such as sort, reduce, etc.
NVIDIA Performance Primitives (NPP) library for image/video processing
Layered Textures for working with same size/format textures at larger sizes and higher performance
Unified Virtual Addressing
GPUDirect v2.0 support for Peer-to-Peer Communication
Automated Performance Analysis in Visual Profiler
C debugging in CUDA-GDB for Linux and MacOS
GPU binary disassembler for Fermi architecture (cuobjdump)
Parallel Nsight 2.0 now available for Windows developers with new debugging and profiling features.

Alternative downloads

NVIDIA PhysX SDK
NVIDIA PhysX SDK
rating

PhysX SDK is proprietary, multi-platform, realtime physics engine middleware.

NVIDIA Design Garage
NVIDIA Design Garage
Free
rating

Incredibly realistic images of the fastest and exclusive vehicles on the road.

NVIDIA OpenGL SDK
NVIDIA OpenGL SDK
Free
rating

It teaches developers how to make the most of the latest GeForce GPUs.

NVIDIA® NVMixer
NVIDIA® NVMixer
Free
rating

Places all audio information and controls in one central application.

CUDAfy.NET
CUDAfy.NET
rating

You can program NVIDIA CUDA Graphics Processing Units and OpenCL Devices.