AMD's most powerful GPU yet code-named Vega 10 is set to debut at the end of the year with Vega 11 followingearly next year. The company also has a new board code-named "Magnum" that will be showcased at SC 2016this upcoming November. It's not clear yet whether Vega 10 will make its debut alongside "Magnum" next month but several of our sources have confirmed that we'll definitely see Vega 10 before year's end.
Magnum is a unique chip, it features a matrix of logic blocks that can be configured and programmed individually for any desired application or program. In other words, it's the company's first ever FPGA and its greatest attempt yet to expand its penetration into the high performance embedded market.
AMD's Vega 10 Debuting Before Year's End - Features New V9 GCN Architecture
Moving on to Vega 10, the GPU that most of you reading this are probably interested in. Vega 10 will be the very first GPU based on the company's brand new GCN graphics architecture and 9th generation collection of visual IP.Vega 10 has been long in the making and we've heard so much about it over the past several months.
The specifications for AMD's upcoming flagship Radeon graphics card, Fury X successor, based on Vega 10 have been leaked on one of the company's leading engineers' LinkedIn page. Packing an impressive 4096 next generation V9 cores and an unbelievable 12 teraflops of compute performance. Putting it ahead of Nvidia's top dog the $1200 GTX TItan X.
Since then we have actually spotted the card on AMD's & Sapphire's official websites. The card was touted as a 4K gaming capable high-end offering with a 2016 release date. Not long after that Raja Koduri announced that the Radeon Technologies Group was celebrating a major milestone with regards to Vega 10's development. Subsequently, the head of PR at the Radeon Technologies Group teased the launch venue for Vega 10 on his facebook page.
Completely Redesigned V9 GCN Architecture With Significant Advances
The V9 nomenclature refers to AMD's 9th graphics architecture. Which represents a significant departure from the previous 8th generation GCN architecture of Polaris ,RX 400 series, and AMD's "Islands" family of R9 200 & HD 7000graphics cards.
The structure and configuration of SIMD units is entirely new. Each Vega NCUis now capable of simultaneously processing variable length wavefronts. To understand why this is such a bigdeal we have to look at AMD's current GCN implementation.

In AMD's current GCNimplementation, each compute unit has four 16-wide vector SIMD units, capable of executing four 16-wide wavefronts ( a group of threads ) over four cycles. In addition to one scalar unit, capable of executing one instructionper cycle. This unit is delegated time-critical tasks, where the four-cycle turnaround of the SIMD units isn't sufficient.
Unfortunately, these 16-wide SIMD units work exactly the same no matter how small of a wavefront they're fed. Executing a 16-wide wavefront would take just as long as executing a4-wide wavefront, rendering the other 12 ALUs inside the SIMD completely useless. And as graphicsworkloads are inherently non uniform it's effectively impossible to find any scenario whereall 16-wide SIMD units are fully occupied at any given time.
Variable Width SIMDs, Getting More Performance Out Of Fewer Cycles
This is no longer the case in AMD's new GCN implementation inside Vega. The V9 architectureincludes new incredibly clever schedulers and coherency subsystemsthat allow several smaller wavefronts tobe executed simultaneously inside any SIMD that's able to accommodate the workload. This in effect allows each NCUto finish considerably more work in the same amount of time compared to its predecessor. In addition to freeing up valuable cache and memory resources forother compute units.

It's very hard to predict how much of a difference this big of animprovement in resource utilization and CU occupancywill yield given how unpredictable and inherently fluctuantgraphics workloads are. Which brings us neatly to Vega's rumored specs.
Vega 10 will come in several configurations for gaming & professional computing applications. All configurations will feature second generation high bandwidth memory but in various capacities. Radeon Pro Vega 10 will feature the highest configuration of 4096 stream processors and 16GB of memory. The board has a TDP of ~230W and is powered by eight and six pin PCIe power connectors. Cooling wise the Radeon Pro will come in active and passivelycooled configurations for servers.
AMD Vega Lineup
| Graphics Card | Radeon R9 Fury X | Radeon RX 480 | Radeon RX Vega Frontier Edition | Radeon Vega Pro | Radeon RX Vega (Gaming) | Radeon RX Vega Pro Duo |
|---|---|---|---|---|---|---|
| GPU | Fiji XT | Polaris 10 | Vega 10 | Vega 10 | Vega 10 | 2x Vega 10 |
| Process Node | 28nm | 14nm FinFET | FinFET | FinFET | FinFET | FinFET |
| Stream Processors | 4096 | 2304 | 4096 | 3584 | 4096 (?) | Up to 8192 |
| Performance | 8.6 TFLOPS 8.6 (FP16) TFLOPS | 5.8 TFLOPS 5.8 (FP16) TFLOPS | ~13 TFLOLPS ~25 (FP16) TFLOPS | 11 TFLOLPS 22 (FP16) TFLOPS | >13 TFLOLPS >25 (FP16) TFLOPS | TBA TBA |
| Memory | 4GB HBM | 8GB GDDR5 | 16GB HBM2 | TBA | TBA | TBA |
| Memory Bus | 4096-bit | 256-bit | 2048-bit | 2048-bit | 2048-bit | 4096-bit |
| Bandwidth | 512GB/s | 256GB/S | 480GB/S | 400GB/S | TBA | TBA |
| TDP | 275W | 150W | TBA | TBA | TBA | TBA |
| Launch | 2015 | 2016 | June 2017 | June 2017 | July 2017 | TBA |









