日照网站开发建设seo网站自动发布外链工具
很多炼丹师不知道自己英伟达显卡支持哪些精度模式,本文整理了NVIDIA官网的数据,为你解开疑惑。
1. 首先了解CUDA计算能力及其支持的精度模式;
2. 查看自己显卡(或其它NVIDIA硬件)的计算能力值为多少。
表1 CUDA计算能力及其支持的精度模式
CUDA Compute Capability | TF32 | FP32 | FP16 | INT8 | FP16 Tensor Cores | INT8 Tensor Cores | DLA |
9 | Yes | Yes | Yes | Yes | Yes | Yes | No |
8.9 | Yes | Yes | Yes | Yes | Yes | Yes | No |
8.7 | Yes | Yes | Yes | Yes | Yes | Yes | Yes |
8.6 | Yes | Yes | Yes | Yes | Yes | Yes | No |
8 | Yes | Yes | Yes | Yes | Yes | Yes | No |
7.5 | No | Yes | Yes | Yes | Yes | Yes | No |
7.2 | No | Yes | Yes | Yes | Yes | Yes | Yes |
7 | No | Yes | Yes | Yes | Yes | No | No |
6.1 | No | Yes | Yes | Yes | No | No | No |
6 | No | Yes | Yes | No | No | No | No |
表2 NVIDIA 硬件(包含显卡、嵌入式板卡等)对应的计算能力
GPU | Compute Capability |
NVIDIA H100 | 9 |
NVIDIA L4 | 8.9 |
NVIDIA L40 | 8.9 |
RTX 6000 | 8.9 |
GeForce RTX 4090 | 8.9 |
GeForce RTX 4080 | 8.9 |
GeForce RTX 4070 Ti | 8.9 |
GeForce RTX 4070 | 8.9 |
GeForce RTX 4060 | 8.9 |
GeForce RTX 4050 | 8.9 |
Jetson AGX Orin | 8.7 |
Jetson Orin NX | 8.7 |
Jetson Orin Nano | 8.7 |
NVIDIA A40 | 8.6 |
NVIDIA A10 | 8.6 |
NVIDIA A16 | 8.6 |
NVIDIA A2 | 8.6 |
RTX A6000 | 8.6 |
RTX A5000 | 8.6 |
RTX A4000 | 8.6 |
RTX A3000 | 8.6 |
RTX A2000 | 8.6 |
GeForce RTX 3090 Ti | 8.6 |
GeForce RTX 3090 | 8.6 |
GeForce RTX 3080 Ti | 8.6 |
GeForce RTX 3080 | 8.6 |
GeForce RTX 3070 Ti | 8.6 |
GeForce RTX 3070 | 8.6 |
Geforce RTX 3060 Ti | 8.6 |
Geforce RTX 3060 | 8.6 |
GeForce RTX 3050 Ti | 8.6 |
GeForce RTX 3050 | 8.6 |
NVIDIA A100 | 8 |
NVIDIA A30 | 8 |
NVIDIA T4 | 7.5 |
Quadro RTX 8000 | 7.5 |
Quadro RTX 6000 | 7.5 |
Quadro RTX 5000 | 7.5 |
Quadro RTX 4000 | 7.5 |
RTX 5000 | 7.5 |
RTX 4000 | 7.5 |
RTX 3000 | 7.5 |
T2000 | 7.5 |
T1200 | 7.5 |
T1000 | 7.5 |
T600 | 7.5 |
T500 | 7.5 |
T400 | 7.5 |
GeForce GTX 1650 Ti | 7.5 |
NVIDIA TITAN RTX | 7.5 |
Geforce RTX 2080 Ti | 7.5 |
Geforce RTX 2080 | 7.5 |
Geforce RTX 2070 | 7.5 |
Geforce RTX 2060 | 7.5 |
Jetson AGX Xavier | 7.2 |
Jetson Xavier NX | 7.2 |
NVIDIA V100 | 7 |
Quadro GV100 | 7 |
NVIDIA TITAN V | 7 |
Jetson TX2 | 6.2 |
Tesla P40 | 6.1 |
Tesla P4 | 6.1 |
Quadro P6000 | 6.1 |
Quadro P5200 | 6.1 |
Quadro P5000 | 6.1 |
Quadro P4200 | 6.1 |
Quadro P4000 | 6.1 |
Quadro P3200 | 6.1 |
Quadro P3000 | 6.1 |
Quadro P2200 | 6.1 |
Quadro P2000 | 6.1 |
Quadro P1000 | 6.1 |
Quadro P620 | 6.1 |
Quadro P600 | 6.1 |
Quadro P500 | 6.1 |
Quadro P400 | 6.1 |
P620 | 6.1 |
P520 | 6.1 |
NVIDIA TITAN Xp | 6.1 |
NVIDIA TITAN X | 6.1 |
GeForce GTX 1080 Ti | 6.1 |
GeForce GTX 1080 | 6.1 |
GeForce GTX 1070 Ti | 6.1 |
GeForce GTX 1070 | 6.1 |
GeForce GTX 1060 | 6.1 |
GeForce GTX 1050 | 6.1 |
Tesla P100 | 6 |
Quadro GP100 | 6 |
Jetson Nano | 5.3 |
通过以上两表,可了解每个硬件支持的精度模式。
参考:
Support Matrix :: NVIDIA Deep Learning TensorRT Documentation
CUDA GPUs - Compute Capability | NVIDIA Developer