NVIDIA has announced their entire Quadro Pascal lineup today, at the SOLIDWORKS World annual trade show. The new lineup features a mix of Pascal based Quadro graphics cards along with the flagship Quadro GP100.
NVIDIA Announces Flagship Quadro GP100 Graphics Card With GP100 GPU and 16 GB HBM2 For Workstation Users
The Quadro GP100 is the closest thing we could get to a consumer-level graphics card with the GP100 GPU. It’s still cratering the compute heavy users and would come at a significant price over the GP102 GPU based Quadro P6000. The Quadro GP100 is also the first instance of many technologies coming close to the consumer front in the form of HBM2, NVLINK and over 20 TFLOPs of half-precision compute. The card will be aimed at users who demand the best performance in the most demanding rendering and GPGPU compute applications.
“Professional workflows are now infused with artificial intelligence, virtual reality and photorealism, creating new challenges for our most demanding users,” said Bob Pette, vice president of Professional Visualization at NVIDIA. “Our new Quadro lineup provides the graphics and compute performance required to address these challenges. And, by unifying compute and design, the Quadro GP100 transforms the average desktop workstation with the power of a supercomputer.” via NVIDIA
Benefits of Quadro Pascal Visual Computing Platform
The new generation of Quadro Pascal-based GPUs,the GP100, P4000, P2000, P1000, P600 and P400, enables millions of engineers, designers, researchers and artists to:
Unify simulation, HPC, rendering and design – The GP100 combines unprecedented double precision performance with 16GB of high-bandwidth memory (HBM2) so users can conduct simulations during the design process and gather realistic multiphysics simulations faster than ever before. Customers can combine two GP100 GPUs with NVLink technology and scale to 32GB of HBM2 to create a massive visual computing solution on a single workstation.
Explore deep learning – The GP100 provides more than 20 TFLOPS of 16-bit floating point precision computing, making it an ideal development platform to enable deep learning in Windows and Linux environments.
Incorporate VR into design and simulation workflows – The “VR Ready” Quadro GP100 and P4000 have the power to create detailed, lifelike, immersive environments. Larger, more complex designs can be experienced at scale.
Reap the benefits of photorealistic design – Pascal-based Quadro GPUs can render photorealistic images more than 18 times faster than a CPU.
Create expansive visual workspaces – Visualize data in high resolution and HDR color on up to four 5K displays.
Build massive digital signage configurations cost effectively – Up to 32 4K displays can be configured through a single chassis by combining up to eight P4000 GPUs and two Quadro Sync II cards.
The Mighty Pascal GP100 Specifications – The First and The Fastest FinFET GPU To Date
The GP100 GPU is composed of an array of Graphics Processing Clusters (GPCs), Streaming Multiprocessors (SMs), and memory controllers. The chip achieves its colossal throughput by providing six GPCs, up to 60 SMs, and eight 512-bit memory controllers (4096 bits total). The chip has been specifically designed for the compute market and it serves a great deal of performance in this sector.
The Pascal architecture’s computational prowess is more than just brute force: it increases performance not only by adding more SMs than previous GPUs, but by making each SM more efficient. Each SM has 64 CUDA cores and four texture units, for a total of 3840 CUDA cores and 240 texture units. These SM units have been arranged into 32 TPCs comprising of two SMs.
Because of the importance of high-precision computation for technical computing and HPC codes, a key design goal for Tesla P100 is high double-precision performance. Each GP100 SM has 32 FP64 units, providing a 2:1 ratio of single- to double-precision throughput. Compared to the 3:1 ratio in Kepler GK110 GPUs, this allows Tesla P100 to process FP64 workloads more efficiently. For more information and details on GP100 GPU, you can read our article here. You can also find performance benchmarks of GP100 GPU here.
The Quadro GP100 Can Combine Massive Memory Pools With NVLINK – Can Deliver Up To 32 GB HBM2 Based VRAM in Workloads
The GP100 GPU packs four stacks of HBM2 memory. The total VRAM featured on this chip is 16 GB which is currently the highest amount of HBM2 memory featured on a card. The GP100 GPU is the first chip to utilize the new HBM2 standard since its launch in Q2 2016. The chip features 720 GB/s bandwidth which is higher than the 432GB/s for Quadro P6000.
The most impressive thing for the Quadro GP100 is its new NVLINK interface. This is the first instance of the new technology on PCI-e graphics cards. Even the Tesla P100 PCI-e variants don’t feature this technology. It uses a new interface which is proprietorially developed by NVIDIA for their NVLINK enabled cards. Two large connectors can be found on the sides of the GPU’s PCB, longer in length than a regular SLI connector. These cards support 2-Way NVLINK connectivity which allows users to take advantage from memory pooling (16 GB Plus 16 GB) for a total of 32 GB HBM2 VRAM access and lower latency data transfer speeds.
In terms of display capabilities, the card features four DP 1.4 ports rated to operate at 4K @120 Hz and 5K @ 60 Hz. There’s also a single DVI port and of course, the card is VR Ready since its main focus is on tasks that are aimed at the VR market too.
NVIDIA Completes Pascal Quadro Lineup With 5 New Additions To The Family
NVIDIA has also announced five new products in the Quadro lineup. All new cards will utilize the latest Pascal architecture and replace older Maxwell based cards. They are based on a range of Pascal GPUs and will have various price and performance tiers.
Pascal Quadro P4000 Details and Specs
The Quadro P4000 replaces the older Quadro M4000 graphics card. It comes with the new 16nm GP104 GPU core that houses 1792 CUDA Cores and 8 GB of GDDR5 memory. The memory runs along a 256-bit memory bus interface and comes with 256 GB/s bandwidth. The total compute power for this card is rated at 5.3 TF (Teraflops) and comes in a single slot package. The TDP of the card is rated at 105W so power is fed through a 6-Pin connector. There are four DP 1.4 connectors on the card.
The card should theoretically deliver up to 2x the performance of its predecessor while consuming 15W less power than the Maxwell core.
Pascal Quadro P2000 Details and Specs
The Quadro P2000 is replacing the Quadro M2000 with the new GP106 graphics core. The card features 1024 CUDA Cores and 5 GB of GDDR5 memory. The card uses a 160-bit bus interface due to its unusual 5 Gigs of VRAM. There’s also 3 TFLOPs of computing power on this card and a rated TDP of 75W. Quadro P2000 also comes in the single-slot package with four DP1.4 connectors.
No power input is required since the card gets its power input directly from the PCI-e slot. Performance is rated up to 60% faster than its predecessor.
Pascal Quadro P1000, P600, P400 Details and Specs
Next up, we have the Quadro P1000, P600 and P400 products, all of which are based on the 14nm GP107 GPU core. The core configs for these variants are 640, 384 and 256 while the memory config is 4 GB, 2 GB and 2 GB GDDR5, respectively. The cards feature 1.8 TFLOPs, 1.1 TFLOPs and 0.6 TFLOPs of compute performance, respectively. All cards come in low profile form factors with TDP less than 50W. The Quadro P1000 and P600 feature four Mini DP1.4 connectors while the P400 features only three display ports.
Performance for these new products is rated at over 80% faster for the P1000 over its predecessor, over 40% faster for the P600 over its predecessor and 2x faster for the P400 over its predecessor. These incremental performance gains do bring a ton of productivity to workstation users. Specifications for the entire Pascal lineup can be seen below:
NVIDIA Pascal Quadro Family:
Graphics Card
NVIDIA Quadro P400
NVIDIA Quadro P600
NVIDIA Quadro P1000
NVIDIA Quadro P2000
NVIDIA Quadro P4000
NVIDIA Quadro P5000
NVIDIA Quadro P6000
NVIDIA Quadro GP100
Graphics Core
GP107
GP107
GP107
GP106
GP104
GP104
GP102
GP100
Process Node
14nm FinFET
14nm FinFET
14nm FinFET
16nm FinFET
16nm FinFET
16nm FinFET
16nm FinFET
16nm FinFET
Die Size
132mm2
132mm2
132mm2
200mm2
314mm2
314mm2
471mm2
610mm2
Transistors
3.3 Billion
3.3 Billion
3.3 Billion
4.4 Billion
7.2 Billion
7.2 Billion
12.0 Billion
15.3 Billion
CUDA Cores
256 CUDA
384 CUDA
640 CUDA
1024 CUDA
1792 CUDA
2560 CUDA
3840 CUDA
3584 CUDA
Base Clock
TBD
TBD
TBD
TBD
TBD
1620 MHz
1417 MHz
TBD
Boost Clock
1170 MHz
1430 MHz
1400 MHz
1470 MHz
1480 MHz
1733 MHz
1530 MHz
>1395 MHz
FP32 Compute
0.6 TFLOPs
1.1 TFLOPs
1.8 TFLOPs
3.0 TFLOPs
5.3 TFLOPs
9 TFLOPs
12 TFLOPs
10 TFLOPs
VRAM
2 GB GDDR5
2 GB GDDR5
4 GB GDDR5
5 GB GDDR5
8 GB GDDR5
16 GB GDDR5X
24 GB GDDR5X
16 GB HBM2
Bus Interface
128-bit
128-bit
128-bit
160-bit
256-bit
256-bit
384-bit
4096-bit
Bandwidth
TBD
TBD
TBD
TBD
256 GB/s
288 GB/s
432 GB/s
720 GB/s
Power Connector
None
None
None
None
Single 6-Pin
Single 8-Pin
Single 8-Pin
Single 8-Pin
TDP
30W
40W
47W
75W
105W
180W
250W
235W
Display Outputs
3x Mini Display Port 1.4
4x Mini Display Port 1.4
4x Mini Display Port 1.4
4x Display Port 1.4
4x Display Port 1.4
4x Display Port 1.4
1x DVI-D
4x Display Port 1.4
1x DVI-D
4x Display Port 1.4
1x DVI-D
Launch Date
March 2017
March 2017
March 2017
March 2017
March 2017
October 2016
October 2016
March 2017
Launch Price
TBD
TBD
TBD
TBD
TBD
$1900 US
$4800 US
TBD
Availability of Pascal Quadro Graphics Cards
The new Quadro products will be available starting in March from leading workstation OEMs, including Dell, HP, Lenovo and Fujitsu, and authorized distribution partners, including PNY Technologies in North America and Europe, ELSA/Ryoyo in Japan and Leadtek in Asia Pacific.
The post NVIDIA Quadro GP100 With 16 GB HBM2 and NVLINK Is The Ultimate Workstation Card – Full Pascal Quadro Lineup Announced by Hassan Mujtaba appeared first on Wccftech.