We have a very very delicious rumor making the rounds today and it comes from a highly credible source. @kopite7kimi, the Twitter leaker responsible for pretty much all of the Ampere leaks, revealed a tidbit about the upcoming NVIDIA Ada Lovelace architecture (likely called NVIDIA ADA). Our colleagues over at 3DCenter extrapolated a lot of information from the die size, which Kopite appears to have more or less confirmed. While the source is highly reliable, we are still marking this post as a rumor because of the magnitude of the leak.
NVIDIA ADA GPU Leaked: Monster 64 TFLOPs GPU with 18432 CUDA Cores and 5nm process architecture
The Ada Lovelace architecture - which will likely only be referred to as NVIDIA ADA by the way - was recently leaked by Kopite (and confirmed by Videocardz) and we already seem to have the preliminary specifications of NVIDIA's upcoming GPU. As we mentioned in the original Ada article, Hopper appears to have been delayed for now (and along with it, NVIDIA's MCM ambitions). Thankfully, it seems that NVIDIA has kept its pedal to the metal and its Ada architecture, championed by the AD102 GPU will be an absolute beast. Given below is the original die size leak:
GA102 has a "7*6" structure.
Maybe AD102 will get a "12*6" structure.
— kopite7kimi (@kopite7kimi) December 28, 2020
The folks over at 3DCenter quickly extrapolated a ton of details (we have revised their TFLOP numbers to be a bit more conservative with a clock of 1.75 GHz) which Kopite confirmed:
And a larger cache. It looks like this.
— kopite7kimi (@kopite7kimi) December 28, 2020
For those that want all the information in one place, here is a table summarizing everything:
NVIDIA CUDA GPU (RUMORED) Preliminary:
| GPU | TU102 | GA102 | AD102 |
|---|---|---|---|
| Flagship SKU | RTX 2080 Ti | RTX 3090 Ti | RTX 4090? |
| Architecture | Turing | Ampere | Ada Lovelace |
| Process | TSMC 12nm NFF | Samsung 8nm | TSMC 4N? |
| Die Size | 754mm2 | 628mm2 | ~600mm2 |
| Graphics Processing Clusters (GPC) | 6 | 7 | 12 |
| Texture Processing Clusters (TPC) | 36 | 42 | 72 |
| Streaming Multiprocessors (SM) | 72 | 84 | 144 |
| CUDA Cores | 4608 | 10752 | 18432 |
| L2 Cache | 6 MB | 6 MB | 96 MB |
| Theoretical TFLOPs | 16 TFLOPs | 40 TFLOPs | ~90 TFLOPs? |
| Memory Type | GDDR6 | GDDR6X | GDDR6X |
| Memory Capacity | 11 GB (2080 Ti) | 24 GB (3090 Ti) | 24 GB (4090?) |
| Memory Speed | 14 Gbps | 21 Gbps | 24 Gbps? |
| Memory Bandwidth | 616 GB/s | 1.008 GB/s | 1152 GB/s? |
| Memory Bus | 384-bit | 384-bit | 384-bit |
| PCIe Interface | PCIe Gen 3.0 | PCIe Gen 4.0 | PCIe Gen 4.0 |
| TGP | 250W | 350W | 600W? |
| Release | Sep. 2018 | Sept. 20 | 2H 2022 (TBC) |









