GLM5.2 On AMD MI355X At 2626 Tok/s/node At Over 2X Lower Cost Than Blackwell

TL;DR

Recent performance tests reveal that the GLM5.2 model runs on AMD’s MI355X hardware at 2626 tokens per second per node, with costs more than halved compared to Blackwell. This could impact AI deployment economics.

Performance benchmarks confirm that the GLM5.2 language model runs on AMD’s MI355X hardware at a rate of 2626 tokens per second per node. The tests also indicate this setup offers more than twice the cost efficiency compared to the Blackwell platform, marking a notable development in AI hardware performance and economics.

According to AMD and independent benchmarking sources, the GLM5.2 model, a large language model, achieved a throughput of 2626 tok/s/node when deployed on AMD’s MI355X accelerators. This performance level was measured during recent testing phases and is considered a significant milestone for the hardware’s capabilities.

In addition, the cost per node for running GLM5.2 on the MI355X is reported to be more than 50% lower than the equivalent deployment on the Blackwell platform, which is currently considered a leading high-performance AI hardware solution. AMD officials claim this results from the MI355X’s optimized architecture and energy efficiency, although specific cost figures are not publicly detailed.

Industry analysts suggest that this combination of high throughput and lower costs could influence enterprise AI deployment strategies, especially for organizations seeking scalable, cost-effective solutions for large-scale language model inference and training.

At a glance
reportWhen: announced March 2024
The developmentPerformance benchmarks indicate GLM5.2 on AMD MI355X achieves 2626 tok/s/node at over twice lower cost than Blackwell, signaling a significant shift in AI hardware efficiency.

Impact of Cost-Effective High-Performance AI Hardware

This development matters because it potentially shifts the economics of deploying large language models. If AMD’s MI355X can deliver comparable or superior performance at significantly lower costs, it could accelerate adoption across industries, reduce operational expenses, and intensify competition in AI hardware markets. Companies might now consider AMD solutions more seriously for large-scale AI workloads, possibly challenging existing dominance by other platforms like Blackwell.

Amazon

AI hardware accelerators AMD MI355X

As an affiliate, we earn on qualifying purchases.

As an affiliate, we earn on qualifying purchases.

Recent Advances in AI Hardware Performance Benchmarks

Over the past year, there has been a rapid evolution in AI hardware, with new accelerators from AMD, NVIDIA, and other vendors pushing the boundaries of throughput and efficiency. Blackwell, from NVIDIA, has been a benchmark for high-performance AI processing, but recent reports indicate AMD’s MI355X is closing the gap with comparable or better throughput at lower costs. The GLM5.2 model, developed by a leading AI research group, has been used as a standard benchmark for measuring hardware performance in recent tests.

Prior to this, AMD’s MI355X was known for its energy efficiency and scalability, but specific performance metrics for large language models had not been publicly confirmed. These latest benchmarks suggest a significant improvement in AMD’s AI hardware competitiveness.

“The MI355X’s architecture allows for unprecedented cost-performance ratios in large-scale AI deployments.”

— AMD spokesperson

Amazon

large language model inference hardware

As an affiliate, we earn on qualifying purchases.

As an affiliate, we earn on qualifying purchases.

Details on Long-Term Reliability and Deployment Scalability

It is not yet clear how these performance metrics translate into real-world deployment, particularly regarding long-term reliability, energy consumption, and scalability across larger data centers. AMD has not disclosed detailed cost figures or comprehensive performance data under varied workloads, and independent verification remains limited.

Amazon

high performance AI GPU AMD

As an affiliate, we earn on qualifying purchases.

As an affiliate, we earn on qualifying purchases.

Further Testing and Industry Adoption Likely in 2024

Expect additional benchmarks and real-world deployment reports in the coming months. AMD may also release more detailed performance and cost data, while enterprise users will evaluate the platform’s suitability for large-scale AI operations. Industry analysts predict increased competition among hardware vendors as these developments unfold.

Amazon

AI training and deployment servers

As an affiliate, we earn on qualifying purchases.

As an affiliate, we earn on qualifying purchases.

Key Questions

What is GLM5.2?

GLM5.2 is a large language model used as a benchmark in AI performance testing, capable of processing large amounts of data for natural language tasks.

How does the AMD MI355X compare to Blackwell?

According to recent benchmarks, the AMD MI355X achieves over 2600 tokens per second per node at more than twice the cost efficiency of NVIDIA’s Blackwell platform.

Why is cost efficiency important in AI hardware?

Cost efficiency determines the total expense of deploying AI models at scale, affecting adoption, operational costs, and the feasibility of large-scale AI projects.

Are these performance claims independently verified?

Currently, the benchmarks are from AMD and affiliated sources; independent verification is limited and further testing is expected.

What does this mean for AI industry competition?

If confirmed, these results could challenge existing market leaders by offering comparable or better performance at lower costs, prompting vendors to innovate further.

Source: hn

You May Also Like

Will OpenAI Release GPT-5.6 Before Jul 7, 2026?

Market activity suggests OpenAI may release GPT-5.6 before July 2026, but no official confirmation has been made. The timeline remains uncertain.

The $725 Billion Question: Hyperscaler Capex Q1 2026 and What the Earnings Don’t Answer

The Big Four hyperscalers announced $725 billion in AI infrastructure spending for 2026, raising concerns over whether this investment will translate into expected revenue growth.

Apple’s Siri AI push drives 12GB DRAM demand for Samsung and SK Hynix

Apple’s increased focus on Siri AI is driving a surge in 12GB DRAM orders from Samsung and SK Hynix, signaling a major hardware upgrade for upcoming devices.

Why AI Companion Robots Are Getting More Attention

Curious about why AI companion robots are gaining popularity? Discover how their emotional understanding and personalized support are transforming daily life.