New features of the NVIDIA Turing GPU architecture have been revealed and detailed by the folks over at Videocardz. The new details show how the Turing GPUs are a huge departure from current GeForce graphics cards based on the Pascal GPU architecture and the techniques NVIDIA is using to deliver the best performance to end users and gamers.
NVIDIA Turing GPUs For GeForce RTX Graphics Cards Detailed - More Core Performance, Better Memory Compression, and New Features For Gamers
Starting with the most significant part of the Turing GPU architecture, the Turing SM, we are seeing an entirely new graphics core. The Turing SM is made up of a combination of INT32, FP32, and the new Tensor cores. Each SM has 96 KB of L1 cache which is shared across the entire GPU. There are four warp schedulers and dispatchers inside a Turing GPU and similarly, there are four register file units.
Coming to the new execution units or cores, Turing has both INT32 and FP32 units. Each SM has 64 each and 8 Tensor cores. This new architectural design allows Turing to execute floating point and non-floating point operations in parallel which allows for up to 36% higher throughput in standard floating point operations. The entire SM works in harmony by using different blocks to deliver high performance and better texture caching, enabling for up to 50% better CUDA core performance when compared to the previous generation.
Following is a shot of the Turing SM by Videocardz:
NVIDIA GeForce RTX/GTX "Turing" Family:
| Graphics Card Name | NVIDIA GeForce GTX 1650 | NVIDIA GeForce GTX 1650 D6 | NVIDIA GeForce GTX 1650 | NVIDIA GeForce GTX 1660 | NVIDIA GeForce GTX 1660 SUPER | NVIDIA GeForce GTX 1660 Ti | NVIDIA GeForce RTX 2060 | NVIDIA GeForce RTX 2070 | NVIDIA GeForce RTX 2080 | NVIDIA GeForce RTX 2080 Ti |
|---|---|---|---|---|---|---|---|---|---|---|
| GPU Architecture | Turing GPU (TU117) | Turing GPU (TU117) | Turing GPU (TU116) | Turing GPU (TU116) | Turing GPU (TU116) | Turing GPU (TU116) | Turing GPU (TU106) | Turing GPU (TU106) | Turing GPU (TU104) | Turing GPU (TU102) |
| Process | 12nm FNN | 12nm FNN | 12nm FNN | 12nm FNN | 12nm FNN | 12nm FNN | 12nm FNN | 12nm FNN | 12nm FNN | 12nm FNN |
| Die Size | 200mm2 | 200mm2 | 284mm2 | 284mm2 | 284mm2 | 284mm2 | 445mm2 | 445mm2 | 545mm2 | 754mm2 |
| Transistors | 4.7 Billion | 4.7 Billion | 6.6 Billion | 6.6 Billion | 6.6 Billion | 6.6 Billion | 10.6 Billion | 10.6 Billion | 13.6 Billion | 18.6 Billion |
| CUDA Cores | 896 Cores | 896 Cores | 1280 Cores | 1408 Cores | 1408 Cores | 1536 Cores | 1920 Cores | 2304 Cores | 2944 Cores | 4352 Cores |
| TMUs/ROPs | 56/32 | 56/32 | 80/32 | 88/48 | 88/48 | 96/48 | 120/48 | 144/64 | 192/64 | 288/96 |
| GigaRays | N/A | N/A | N/A | N/A | N/A | N/A | 5 Giga Rays/s | 6 Giga Rays/s | 8 Giga Rays/s | 10 Giga Rays/s |
| Cache | 1.5 MB L2 Cache | 1.5 MB L2 Cache | 1.5 MB L2 Cache | 1.5 MB L2 Cache | 1.5 MB L2 Cache | 1.5 MB L2 Cache | 4 MB L2 Cache | 4 MB L2 Cache | 4 MB L2 Cache | 6 MB L2 Cache |
| Base Clock | 1485 MHz | 1410 MHz | 1530 MHz | 1530 MHz | 1530 MHz | 1500 MHz | 1365 MHz | 1410 MHz | 1515 MHz | 1350 MHz |
| Boost Clock | 1665 MHz | 1590 MHz | 1725 MHz | 1785 MHz | 1785 MHz | 1770 MHz | 1680 MHz | 1620 MHz 1710 MHz OC | 1710 MHz 1800 MHz OC | 1545 MHz 1635 MHz OC |
| Compute | 3.0 TFLOPs | 3.0 TFLOPs | 4.4 TFLOPs | 5.0 TFLOPs | 5.0 TFLOPs | 5.5 TFLOPs | 6.5 TFLOPs | 7.5 TFLOPs | 10.1 TFLOPs | 13.4 TFLOPs |
| Memory | Up To 4 GB GDDR5 | Up To 4 GB GDDR6 | Up To 4 GB GDDR6 | Up To 6 GB GDDR5 | Up To 6 GB GDDR6 | Up To 6 GB GDDR6 | Up To 6 GB GDDR6 | Up To 8 GB GDDR6 | Up To 8 GB GDDR6 | Up To 11 GB GDDR6 |
| Memory Speed | 8.00 Gbps | 12.00 Gbps | 12.00 Gbps | 8.00 Gbps | 14.00 Gbps | 12.00 Gbps | 14.00 Gbps | 14.00 Gbps | 14.00 Gbps | 14.00 Gbps |
| Memory Interface | 128-bit | 128-bit | 128-bit | 192-bit | 192-bit | 192-bit | 192-bit | 256-bit | 256-bit | 352-bit |
| Memory Bandwidth | 128 GB/s | 192 GB/s | 192 GB/s | 192 GB/s | 336 GB/s | 288 GB/s | 336 GB/s | 448 GB/s | 448 GB/s | 616 GB/s |
| Power Connectors | N/A | N/A | 6 Pin | 8 Pin | 8 Pin | 8 Pin | 8 Pin | 8 Pin | 8+8 Pin | 8+8 Pin |
| TDP | 75W | 75W | 100W | 120W | 125W | 120W | 160W | 185W (Founders) 175W (Reference) | 225W (Founders) 215W (Reference) | 260W (Founders) 250W (Reference) |
| Starting Price | $149 US | $149 US | $159 US | $219 US | $229 US | $279 US | $349 US | $499 US | $699 US | $999 US |
| Price (Founders Edition) | $149 US | $149 US | $159 US | $219 US | $229 US | $279 US | $349 US | $599 US | $799 US | $1,199 US |
| Launch | April 2019 | April 2020 | November 2019 | March 2019 | October 2019 | February 2019 | January 2019 | October 2018 | September 2018 | September 2018 |









