Intel's Aurora Supercomputer Now Expected to Exceed 2 ExaFLOPS Performance
by Ryan Smith on October 27, 2021 3:25 PM EST- Posted in
- Supercomputing
- Intel
- HPC
- Aurora
- Exascale
- Sapphire Rapids
- Ponte Vecchio
As part of Intel’s 2021 Innovation event, the company offered a brief update on the Aurora supercomputer, which Intel is building for Argonne National Laboratory. The first of the US’s two under-construction exascale supercomputers, Aurora and its critical processors are finally coming together, allowing Intel to finally narrow its performance projections. As it turns out, the 1-and-change exaFLOPS system is going to be more like a 2 exaFLOPS system – Aurora’s performance is coming in high enough that Intel now expects the system to exceed 2 exaFLOPS of double precision compute performance.
Planned to be the first of the US’s two public exascale systems, the Aurora supercomputer has been through a tumultuous development process. The contract was initially awarded to Intel and Cray back in 2015 for a pre-exascale system based on Intel’s Xeon Phi accelerators, a plan that went out the window when Intel discontinued Xeon Phi development. In its place, the Aurora contract was renegotiated to become an exascale system based on a combination of Intel’s Xeon CPUs and what became their Ponte Vecchio Xe-HPC GPUs. Since then, Intel has been working down to the wire on getting the necessary silicon built in order to make a delivery window that’s already shifted from 2020 to 2021 to 2022(ish), going as far as fabbing parts of Ponte Vecchio on rival TSMC’s 5nm process.
But there is finally light at the end of the tunnel, it would seem. As Intel pushes to complete the system, its performance is coming in ahead of expectations. According to the chip company, they now expect that the assembled supercomputer will be able to deliver over 2 exaFLOPS of double precision (FP64) performance. The system previously didn’t have a specific performance figure attached to it, beyond the fact that it would be over 1 exaFLOPS in FP64 throughput.
This higher performance figure for Aurora comes courtesy of Ponte Vecchio, which according to CEO Pat Gelsinger is overdelivering on performance. Gelsinger hasn’t gone into additional detail in how Ponte Vecchio is overdelivering, but given that IPC and overall efficiency tends to be relatively easy to nail down during simulations, the most likely candidate here is that Ponte Vecchio’s is clocking higher than Intel’s previous projections. Ponte Vecchio is one of the first HPC chips (and the first Intel GPU) built on TSMC’s N5 process, so there have been a lot of unknowns going into this project.
For Intel, this is no doubt a welcome bit of good luck for a project that has seen many hurdles. The repeated delays have already allowed rival AMD to get the honors of delivering the first exascale system with Frontier, which is currently being installed and is expected to offer 1.5 exaFLOPS in performance. So while Intel no longer gets to be first, once Aurora does come online next year, it will be the faster of the two systems.
Source: Intel
14 Comments
View All Comments
whatthe123 - Wednesday, October 27, 2021 - link
Looks both good and bad for intel. Good because I doubt they would intentionally push the target up when they've been missing the original target for half a decade, so it's likely real performance gains and an already impressive chip, bad because moving to TSMC is probably a big reason they've hit an unexpected improvement in performance. Could mean strong competition from 5nm tsmc parts next year.shabby - Wednesday, October 27, 2021 - link
I wonder who made the suggestion to use tsmc... and then i wonder if they said to themselves "uh that was a joke guys" lololwhatthe123 - Wednesday, October 27, 2021 - link
Rumor was Keller suggested it, though I wouldn't be surprised if Raja also suggested it since they kept missing 10nm deadlines and hes done designs on TSMC before. I'm sure intel will eventually get their fabs back on their feet but it was just bad business not to outsource more parts in the interim, especially on chiplet designs like this. Effectively upping capacity substantially by ordering TSMC is $$$$ in the bank in this market.drothgery - Wednesday, October 27, 2021 - link
Ponte Vecchio is a multi-chip module. A lot of it is built on Intel processes.Zzzoom - Wednesday, October 27, 2021 - link
A lot of a Ryzen or EPYC is built on Globalfoundries processes too, it's still compute that matters.drothgery - Thursday, October 28, 2021 - link
In this case the Xe HPC cores are on TMSC N5, but the Golden Cove cores are Intel 7 (and a bunch of stuff on various other processes).Spunjji - Thursday, October 28, 2021 - link
Wondering if this is a big change (1.7 to 2.1, for example) or a small one (1.9 to 2.01).Hardware Geek - Thursday, October 28, 2021 - link
I'll believe it when it is up and running. 3 years late and how much over budget?Spunjji - Wednesday, November 10, 2021 - link
Also wondering what the power usage will look like compared with AMD's 1.5 EFLOPS super computer...lightningz71 - Thursday, October 28, 2021 - link
Given the multiple times it's been pushed back, and the fact that the foundry industry isn't standing still, nor is the competition, it is only fitting, and honestly to be expected, that the final delivered performance of the system be above the originally expected numbers when the contract was signed many years ago!