Intel Gaudi 2 Compute Accelerator is Faster than Nvidia H100 in Stable Diffusion

Nvidia is a market leader in artificial intelligence computing hardware. The green corporation makes billions, and new devices are shipped by the ton. At the same time, both AMD and Intel have alternative solutions. The latter offers a powerful computing accelerator called Gaudi 2, which we don't hear much about in the news but has high potential. Stability AI, which develops the famous Stable Diffusion AI, shared its tests of Intel Gaudi 2 and Nvidia H100. And suddenly it turned out that the Intel accelerator provides better results.

In the calculations of the new Stable Diffusion 3 model, the Intel Gaudi 2 accelerator showed exceptional results. For testing, a model with 2 billion parameters was run on two nodes with 16 accelerators each. It turned out that the configuration with Gaudi 2 provides image processing 56% faster! And if we compare it with the older Nvidia A100, then the difference is 2.4 times!

Also presented are the calculation results of the Stable Beluga 2.5 model with 70 billion parameters based on the LLaMA 2 model. Without additional operations and optimizations when managing PyTorch, a configuration of 256 Gaudi 2 accelerators provides an average throughput of 116,777 tokens per second, which is 28% faster than the A100 configuration.

The Intel Gaudi 2 device is based on a powerful chip of its own architecture. It is known to be focused on heterogeneous computing, equipped with 24 large tensor cores, 48 MB of SRAM and 96 gigabytes of HBM2e memory, plus 24 integrated Gigabit Ethernet. The large amount of memory may be one of the factors that determined the Gaudi 2's success in tests. The original Nvidia H100 accelerator is equipped with 80 gigabytes, which is not enough for large AI models. But we must remember that the green giant has already announced the H200 with 141 GB of HBM3e memory.

Source:
Wccftech

Intel Gaudi 2 Compute Accelerator is Faster than Nvidia H100 in Stable Diffusion

Galaxy S25: The Snapgragon 8 Gen 4 reportedly has crazy clock speeds and outperforms Apple's A18

A battle royale game is being developed based on the Ready Player One franchise OPEN

Aroged

Related Posts

Fallout 4: A Geforce-exclusive setting causes the game to crash on RTX cards

Aroged: Intel increased supplies of components for laptops by 39%

The Collector's Edition of the Gothic remake will cost $200

Tesla has found a way to reduce the price of its electric cars in the coming months

Instead of UEFI update: MSI with instructions for stable operation for Core i9-13900K and i9-14900K

A battle royale game is being developed based on the Ready Player One franchise OPEN

Leave a Reply Cancel reply

Premium Content

New Fallout footage – with a wedding, a ghoul and a cake

Playtron is a new operating system that aims to conquer gaming laptops

A shooter with strategy elements has been released: Outpost: Infinity Siege – The game is receiving negative reviews from users

Browse by Category

Categories

Recent Posts

Intel Gaudi 2 Compute Accelerator is Faster than Nvidia H100 in Stable Diffusion

Galaxy S25: The Snapgragon 8 Gen 4 reportedly has crazy clock speeds and outperforms Apple's A18

A battle royale game is being developed based on the Ready Player One franchise OPEN

Related Posts

Leave a Reply Cancel reply

Premium Content

Browse by Category

Browse by Tags

Categories

Browse by Tag

Recent Posts