Home / Component / Graphics / Nvidia: ‘Pascal’ architecture’s NVLink to enable 8-way multi-GPU capability

Nvidia: ‘Pascal’ architecture’s NVLink to enable 8-way multi-GPU capability

Compute performance of modern graphics processing units (GPUs) is tremendous, but so are the needs of modern applications that use such chips to display beautiful images or perform complex scientific calculations. Nowadays it is rather impossible to install more than four GPUs into a computer box and get adequate performance scaling. But brace yourself as Nvidia is working on eight-way multi-GPU technology.

The vast majority of personal computers today have only one graphics processor, but many gaming PCs used to play games integrate two graphics cards for increased framerate. Enthusiasts, who want to have unbeatable performance in select games and benchmarks opt for three-way or four-way multi-GPU setups, but these are pretty rare because scaling beyond two GPUs is not really high. Professionals, who need high-performance GPUs for simulations, deep learning and other applications also benefit from four graphics processors and could use even more GPUs per box. Unfortunately, that is virtually impossible because of limitations imposed by today’s PCI Express and SLI technologies. However, Nvidia hopes that with the emergence of the code-named “Pascal” GPUs and NVLink bus, it will be considerably easier to build multi-GPU machines.

nvidia_battlebox_4K_uhd_sli_geforce_gtx_titan

Today even the top-of-the-range Intel Core i7-5960X processor has only 40 PCI Express 3.0 lanes (up to 40GB/s of bandwidth), thus, can connect up to two graphics cards using PCIe 3.0 x16 or up to four cards using PCIe 3.0 x8 bus. In both cases, maximum bandwidth available for GPU-to-GPU communications will be limited to 16GB/s or 8GB/s (useful bandwidth will be around 12GB/s and 6GB/s) in the best case scenarios since GPUs need to communicate with the CPU too.

nvidia_pascal_nvlink_0

In a bid to considerably improve communication speed between GPUs, Nvidia will implement support of proprietary NVLink bus into its next-generation “Pascal” GPUs. Each NVLink point-to-point connection will support 20GB/s of bandwidth in both directions simultaneously (16GB/s effective bandwidth in both directions) and each “Pascal” high-end GPU will support at least four of such links. In case a of a system with NVLink, two GPUs would get a total peak bandwidth of 80GB/s (64GB/s effective) per direction between them. Moreover, PCI Express bandwidth would be preserved for CPU-to-GPU communications. In case of four-GPU sub-system, graphics processors would get up to 40GB/s bandwidth to communicate with each other.

nvidia_pascal_nvlink

According to Nvidia, NVLink is projected to deliver up to two times higher performance in many applications simply by replacing the PCIe interconnect for communication among peer GPUs. It should be noted that in an NVLink-enabled system, CPU-initiated transactions such as control and configuration are still directed over a PCIe connection, while any GPU-initiated transactions use NVLink, which allows to preserve the PCIe programming model.

nvidia_pascal_nvlink_1

Additional bandwidth provided by NVLink could allow one to build a personal computer with up to eight GPUs. However, to make it useful in applications beyond technical computing, Nvidia will have to find a way to efficiently use eight graphics cards for rendering. Since performance scaling beyond two GPUs is generally low, it is unlikely that eight-way multi-GPU technology will actually make it to the market. However, if Nvidia manages to improve efficiency of current multi-GPU technologies in general by replacing SLI [scalable link interface] with NVLink, that could further boost popularity of the company’s graphics cards among gamers.

Performance improvement could be even more significant in systems that completely rely on NVLink instead of PCI Express. IBM plans to add NVLink to select Power microprocessors for supercomputers and the technology will be extremely useful for high-performance servers powered by Nvidia Tesla accelerators.

Discuss on our Facebook page, HERE.

KitGuru Says: While actual video games will hardly be able to use eight GPUs efficiently, we are pretty sure that 3DMark benchmarks will set records on extreme many-GPU graphics sub-systems.

Become a Patron!

Check Also

Leaker claims Nvidia RTX 5070 Ti will pack 8,960 CUDA cores

Leaker Kopite7kimi, known for accurate Nvidia leaks, claims that a GeForce RTX 5070 Ti is in the works and could launch alongside the RTX 5080 at CES.

8 comments

  1. Good explanation, i am just a single video card user, but good of you to show where the industry is going

  2. I use a single graphics card in the £150-200 region (it’s a GTX 760). If I wanted to upgrade, would it be more effective to double up or sell the one I’ve got and put that money towards a more powerful single card solution?

  3. If they keep this up, potentially getting several 4 $250 NVidia cards would replace getting a $1,000 card by a large amount. Assuming all games support them. Specially if the scaling is much more amazing than it is now.

    This is kind of the case now, but it would be more worthwhile in the end. I’m more of a single card user myself. Just less effort and less opportunity for component failure. But, if they keep these power usages down and temps, this won’t be a problem.

  4. Ólafur Hrafn Halldórsson

    Single card solution imo

  5. In your situation (gtx 760) I would definitely stick to a single card upgrade. Sell your GTX 760 (you should be able to get maybe $100 – 120 out of it on like e-bay, perhaps more. (GTX 780s are going for $300-350 on ebay, and GTX 770s for like $200-225ish, so you could probably sell a 760 for $140-150, and with e-bay fees you’d make out with ~$120 or so) Then if you could afford an extra $150 – 200 or so you could easily get yourself a GTX 970. You can find used GTX 970s on amazon or ebay for under $300, and even brand new they only go for $320ish for ones like the EVGA Superclocked etc..
    You would get a little bit more power out of two GTX 760s vs a single 970, but not really a whole lot (a GTX 760 is equal to a GTX 670, and a GTX 680 is exactly HALF the power of the new GTX 980. So if you consider that the GTX 670 is roughly 12 – 15% weaker than the GTX 680, thus the GTX 760 is also about 12 – 15% weaker than a 680; you can come to the conclusion that a GTX 980 is roughly 60 – 65% faster than your GTX 760. Then we have to include that the GTX 970 is about 15% weaker than the GTX 980.
    So overall this means that a GTX 970 would be 45 – 50% faster than your GTX 760, plus it runs on very low power/wattage considering how powerful it is (~150 watts TDP), has very low temperatures, stays low noise especially ones like the ASUS Strix or the EVGA ones which have options for the fans to turn themselves off until it gets too hot. The GTX 970 also has all of the Maxwell architecture improvements and special technologies like DSR (allows 1440p and 4k rendering on a 1080p screen), VXGI (special lighting features), Full DX12 support, MFAA (anti-aliasing that gives MSAA quality for half the fps cost) etc..

  6. Charles J Marsh

    One of the best comments I’ve ever read. Well written sir!

  7. Thanks, I always try to be as informative as possible.

  8. So will this new card NVLINK with my 980Ti?