AMD debuted its new family of Radeon Instinct GPUs for AI and deep-learning earlier this week, including the powerful MI25 featuring Vega. The company's most advanced graphics architecture to date. Onequite peculiar contraption was showcased on stage by Radeon Technologies Group's head, Raja Koduri,whichhas caught our attention.
A small, 4x3 inch, quad GPU cube with one hundred TERAFLOPS of FP16 compute and half that inFP32. Toput that figureinto perspective, Nvidia's liquid cooled Drive PX2 AI supercomputer for autonomous, driverless, cars delivers 16 TFLOPS of FP32 compute and 8 TFLOPS of FP16 compute. That'sless than one sixth ofwhat the Vega cube is capable of. And while the Drive PX2 is a large box meant to goinside the trunk of a car, the Vega cube fits into the palm your hand.
Meet The Mini-Nuke Of Supercomputers, AMD's VegaCube

AMD Vega Cube, held by RTG's Chief Architect Raja Koduri
The prototype that Koduri showcased on stage is made up of four individual Vega 10 graphics processors, each residing on its own little circuit board. Vega cubes aredesigned to be stacked upverticallyone on-top the other via a unique interface. This ,in theory at least, would enable the creation of extraordinarily powerful supercomputers orders of magnitude smaller than what we see today.
We can only imagine the challenge associated with cooling such a dense device. Especially considering that each Radeon Instinct MI25 accelerator, powered by a single Vega 10 chip, is rated at 300W of power. Nvidia's own Telsa P100 deep-learning accelerator is rated at 300W However, it doesn't come in a configuration that allows anywhere near the same computing or thermal densityas the Vega Cube.
Liquid cooling would be anobvious route. One that Nvidia has already taken with its Drive PX2 box, in fact. Although it's not the only one. We've seen AMD take its liquid cooled 275W R9 Fury X, powered by Vega's older sibling Fiji,down to 175W in the form of the R9 Nano. And do it in exchange for less than 15% of the performance. A similarly power-optimized variant of Vega 10 with lower clock speeds is unquestionalyin the pipeline.
Replicating The Power Of The Human Brain, We're Getting Close
Back in 2001, prolific futuristand one of the world's biggest proponentsof the technological singularity hypothesis, Ray Kurzweil, predicted that by 2019 a typical $1000 computer will match the processing power of the human brain. As things stand today, AMD's Vega cube will be able to hit that performance marknext year, all be it atmore than $1000.

As Vega is introduced into the larger consumer gaming-focused market next year and as costs come down over the next couple of years, it's entirely feasible that by 2019 we will actually have $1000 computers that match theprocessing power of the human brain. It's important to note though that machinescanonly be assmart as the software running on them. Advancements in AI will be the key to converting that processing power into actual intelligence. Whether that's going to happen within the next three years or not is yet to be seen. One thing's for sure though, we're getting incredibly close.
AMD Vega Lineup
| Graphics Card | Radeon R9 Fury X | Radeon RX 480 | Radeon RX Vega Frontier Edition | Radeon Vega Pro | Radeon RX Vega (Gaming) | Radeon RX Vega Pro Duo | 
|---|---|---|---|---|---|---|
| GPU | Fiji XT | Polaris 10 | Vega 10 | Vega 10 | Vega 10 | 2x Vega 10 | 
| Process Node | 28nm | 14nm FinFET | FinFET | FinFET | FinFET | FinFET | 
| Stream Processors | 4096 | 2304 | 4096 | 3584 | 4096 (?) | Up to 8192 | 
| Performance | 8.6 TFLOPS 8.6 (FP16) TFLOPS  | 5.8 TFLOPS 5.8 (FP16) TFLOPS  | ~13 TFLOLPS ~25 (FP16) TFLOPS  | 11 TFLOLPS 22 (FP16) TFLOPS  | >13 TFLOLPS >25 (FP16) TFLOPS  | TBA TBA  | 
| Memory | 4GB HBM | 8GB GDDR5 | 16GB HBM2 | TBA | TBA | TBA | 
| Memory Bus | 4096-bit | 256-bit | 2048-bit | 2048-bit | 2048-bit | 4096-bit | 
| Bandwidth | 512GB/s | 256GB/S | 480GB/S | 400GB/S | TBA | TBA | 
| TDP | 275W | 150W | TBA | TBA | TBA | TBA | 
| Launch | 2015 | 2016 | June 2017 | June 2017 | July 2017 | TBA | 









