NVIDIA ADA LOVELACE GPU Gets First Rumored Specs: Absolute Monster At 18432 CUDA Cores And 64 TFLOPs Of Graphics Horsepower-April 2024-www.yitit.com

We have a very very delicious rumor making the rounds today and it comes from a highly credible source. @kopite7kimi, the Twitter leaker responsible for pretty much all of the Ampere leaks, revealed a tidbit about the upcoming NVIDIA Ada Lovelace architecture (likely called NVIDIA ADA). Our colleagues over at 3DCenter extrapolated a lot of information from the die size, which Kopite appears to have more or less confirmed. While the source is highly reliable, we are still marking this post as a rumor because of the magnitude of the leak.

NVIDIA ADA GPU Leaked: Monster 64 TFLOPs GPU with 18432 CUDA Cores and 5nm process architecture

The Ada Lovelace architecture - which will likely only be referred to as NVIDIA ADA by the way - was recently leaked by Kopite (and confirmed by Videocardz) and we already seem to have the preliminary specifications of NVIDIA's upcoming GPU. As we mentioned in the original Ada article, Hopper appears to have been delayed for now (and along with it, NVIDIA's MCM ambitions). Thankfully, it seems that NVIDIA has kept its pedal to the metal and its Ada architecture, championed by the AD102 GPU will be an absolute beast. Given below is the original die size leak:

GA102 has a "7*6" structure.

Maybe AD102 will get a "12*6" structure.

— kopite7kimi (@kopite7kimi) December 28, 2020

The folks over at 3DCenter quickly extrapolated a ton of details (we have revised their TFLOP numbers to be a bit more conservative with a clock of 1.75 GHz) which Kopite confirmed:

And a larger cache. It looks like this.

— kopite7kimi (@kopite7kimi) December 28, 2020

For those that want all the information in one place, here is a table summarizing everything:

NVIDIA CUDA GPU (RUMORED) Preliminary:

GPU	TU102	GA102	AD102
Flagship SKU	RTX 2080 Ti	RTX 3090 Ti	RTX 4090?
Architecture	Turing	Ampere	Ada Lovelace
Process	TSMC 12nm NFF	Samsung 8nm	TSMC 4N?
Die Size	754mm2	628mm2	~600mm2
Graphics Processing Clusters (GPC)	6	7	12
Texture Processing Clusters (TPC)	36	42	72
Streaming Multiprocessors (SM)	72	84	144
CUDA Cores	4608	10752	18432
L2 Cache	6 MB	6 MB	96 MB
Theoretical TFLOPs	16 TFLOPs	40 TFLOPs	~90 TFLOPs?
Memory Type	GDDR6	GDDR6X	GDDR6X
Memory Capacity	11 GB (2080 Ti)	24 GB (3090 Ti)	24 GB (4090?)
Memory Speed	14 Gbps	21 Gbps	24 Gbps?
Memory Bandwidth	616 GB/s	1.008 GB/s	1152 GB/s?
Memory Bus	384-bit	384-bit	384-bit
PCIe Interface	PCIe Gen 3.0	PCIe Gen 4.0	PCIe Gen 4.0
TGP	250W	350W	600W?
Release	Sep. 2018	Sept. 20	2H 2022 (TBC)