Tesla V100 vs. P100: Do you know the differences?

May 19, 2022

What is Tesla V100?

V100 is the most advanced data center GPU ever built. It can accelerate AI, high-performance computing (HPC),data scienceand graphics. It’s powered byNVIDIA Volta architecture, comes in 16GB and 32GB configurations, and offers the performanceof up to 100 CPUs in a single GPU. Data scientists, researchers, and engineers can now work less on optimizing memory usage and more time on developing breakthrough AI innovations.

What is Tesla P100?

NVIDIA P100 is powered by Pascal architecture. Tesla P100 based servers are perfect for 3D modeling and deep learning workloads.

NVIDIA Tesla P100 GPU accelerators are the most advanced ever built, powered by the breakthrough NVIDIA Pascal™ architecture, and these GPUs can boost throughput and save computational costs for high-performance computing.

Fundamental & Architectural Differences

Feature	Tesla V100	Tesla P100
Architecture	Volta	Pascal
Code name	GV100	GP100
Release Year	2017	2016
Cores / GPU	5120	3584
GPU Boost Clock	1530 MHz	1480 MHz
Tensor Cores / GPU	640	NA
Memory type	HBM2	HBM2
Maximum RAM amount	32 GB	16 GB
Memory clock speed	1758 MHz	1430 MHz
Memory bandwidth	900.1 GB/s	720.9 GB/s
CUDA Support	From 7.0 Version	From 6.0 Version
Floating-point performance	14,029 gflops	10,609 gflops

Which GPU should be used when?

Tesla V100 FOR DEEP LEARNING TRAINING:

Caffe, TensorFlow, and CNTK are up to 3x faster with Tesla V100
than P100
Almost all the top deep learning frameworks are GPU-accelerated
Up to 125 TFLOPS of TensorFlow operations per GPU
Up to 32 GB of memory capacity per GPU
Up to 900 GB/s memory bandwidth per GPU

Tesla V100 FOR MOLECULAR DYNAMICS:

Servers with V100 replace over 202 CPU servers for applications such as Amber and HOOMD-blue
100% of the top MD applications are GPU-accelerated
Key math libraries like FFT and BLAS
Up to 15.7 TFLOPS of single-precision performance per GPU
Up to 32 GB of memory capacity per GPU
Up to 900 GB/s of memory bandwidth per GPU

Tesla V100 FOR ENGINEERING:

Servers with Tesla V100 replace up to 10 CPU servers for applications such
as SIMULIA Abaqus and ANSYS FLUENT
The top engineering applications are GPU-accelerated
Up to 7.8 TFLOPS of double-precision floating point performance per GPU
Up to 32 GB of memory capacity per GPU
Up to 900 GB/s memory bandwidth per GPU

Tesla V100 FOR GEOSCIENCE:

Servers with V100 replace up to 54 CPU servers for applications such as RTM and SPECFEM3D
Top Geoscience applications are GPU-accelerated
Up to 15.7 TFLOPS of single-precision floating-point performance
Up to 32 GB of memory capacity per GPU
Up to 900 GB/s memory bandwidth per GPU

Tesla V100 FOR BENCHMARKING:

Servers with Tesla V100 replace up to 23 CPU servers for benchmarks such as Cloverleaf, MiniFE, Linpack, and HPCG
The top benchmarks are GPU-accelerated
Up to 7.8 TFLOPS of double-precision floating point performance per GPU
Up to 32 GB of memory capacity per GPU
Up to 900 GB/s memory bandwidth per GPU

Tesla P100 can also be used for all the above applications but Tesla P100 packed with fewer resources compared to Tesla V100. Let's say, in Molecular dynamics, V100 replaces up to 202 CPU servers but P100 replaces up to only 40 CPU servers.

In the P100 memory bandwidth limit is 732 GB per second for all applications; compared to V100 (900 GB/s), it is low.

Cost & Strategic Advantages

Advantages Of NVIDIA Tesla V100:

640 Tensor Cores
Single and double precision calculations can be carried out
Suitable for a variety of scientific fields (financial calculations, climate and weather forecasting, CFD modeling, data analysis, etc.)
Tesla V100 delivers superior absolute performance & great price/performance gains for AI
If your application doesn't support multi-GPU, pick Tesla V100, the most powerful single GPU.
If you have a budget for only a server with 2 Tesla P100s, but not 2 Tesla V100s and you need 3x performance than P100, then pick 1 Tesla V100.

Advantages Of Nvidia Tesla P100:

Provides high performance
Both single and double precision calculations can be carried out
Suitable for a variety of scientific fields (financial calculations, climate and weather forecasting, CFD modeling, data analysis, etc.)
Tesla P100 will still greatly outperform a CPU-only configuration
If your application showed diminishing returns to advances in GPU performance in the past (Tesla P100 might be a price/performance choice)

Conclusion

A critical question our customers ask is, what kind of GPU I should choose? Which GPU cards can help me deliver results faster?

If you want maximum Deep Learning performance, Tesla V100 is a great choice because of its performance. The dedicated TensorCores have huge performance potential for deep learning applications. NVIDIA has even termed a new “TensorFLOP” to measure this gain.

Tesla V100 is the fastest NVIDIA GPU available on the market. V100 is 3x faster than P100. If you primarily require a large amount of memory for machine learning, you can use either Tesla P100 or V100.

Request a Cloud GPU Free Trial

Sign up for Free Trial

Latest Blogs