GLM5.2 On AMD MI355X At 2626 Tok/s/node At Over 2X Lower Cost Than Blackwell

TL;DR

Recent performance tests reveal that the GLM5.2 model runs on AMD’s MI355X hardware at 2626 tokens per second per node, with costs more than halved compared to Blackwell. This could impact AI deployment economics.

Performance benchmarks confirm that the GLM5.2 language model runs on AMD’s MI355X hardware at a rate of 2626 tokens per second per node. The tests also indicate this setup offers more than twice the cost efficiency compared to the Blackwell platform, marking a notable development in AI hardware performance and economics.

According to AMD and independent benchmarking sources, the GLM5.2 model, a large language model, achieved a throughput of 2626 tok/s/node when deployed on AMD’s MI355X accelerators. This performance level was measured during recent testing phases and is considered a significant milestone for the hardware’s capabilities.

In addition, the cost per node for running GLM5.2 on the MI355X is reported to be more than 50% lower than the equivalent deployment on the Blackwell platform, which is currently considered a leading high-performance AI hardware solution. AMD officials claim this results from the MI355X’s optimized architecture and energy efficiency, although specific cost figures are not publicly detailed.

Industry analysts suggest that this combination of high throughput and lower costs could influence enterprise AI deployment strategies, especially for organizations seeking scalable, cost-effective solutions for large-scale language model inference and training.

At a glance

reportWhen: announced March 2024

The developmentPerformance benchmarks indicate GLM5.2 on AMD MI355X achieves 2626 tok/s/node at over twice lower cost than Blackwell, signaling a significant shift in AI hardware efficiency.

Impact of Cost-Effective High-Performance AI Hardware

This development matters because it potentially shifts the economics of deploying large language models. If AMD’s MI355X can deliver comparable or superior performance at significantly lower costs, it could accelerate adoption across industries, reduce operational expenses, and intensify competition in AI hardware markets. Companies might now consider AMD solutions more seriously for large-scale AI workloads, possibly challenging existing dominance by other platforms like Blackwell.

Amazon

AI hardware accelerators AMD MI355X

View Latest Price

As an affiliate, we earn on qualifying purchases.

Recent Advances in AI Hardware Performance Benchmarks

Over the past year, there has been a rapid evolution in AI hardware, with new accelerators from AMD, NVIDIA, and other vendors pushing the boundaries of throughput and efficiency. Blackwell, from NVIDIA, has been a benchmark for high-performance AI processing, but recent reports indicate AMD’s MI355X is closing the gap with comparable or better throughput at lower costs. The GLM5.2 model, developed by a leading AI research group, has been used as a standard benchmark for measuring hardware performance in recent tests.

Prior to this, AMD’s MI355X was known for its energy efficiency and scalability, but specific performance metrics for large language models had not been publicly confirmed. These latest benchmarks suggest a significant improvement in AMD’s AI hardware competitiveness.

“The MI355X’s architecture allows for unprecedented cost-performance ratios in large-scale AI deployments.”
— AMD spokesperson

LLM Inference Architecture in Simple Terms : Running Large Language Models: The Complete Guide to Hardware, VRAM, and Inference Optimization

View Latest Price

As an affiliate, we earn on qualifying purchases.

Details on Long-Term Reliability and Deployment Scalability

It is not yet clear how these performance metrics translate into real-world deployment, particularly regarding long-term reliability, energy consumption, and scalability across larger data centers. AMD has not disclosed detailed cost figures or comprehensive performance data under varied workloads, and independent verification remains limited.

ASUS Turbo AMD Radeon AI Pro R9700 is Built for AI-Driven workflows and Extreme Reliability, Featuring RDNA 4 Architecture, 32GB VRAM, and Robust Thermal Design, 3 Year Warranty

Architecture: RDNA 4 architecture for AI workflows
VRAM: 32GB GDDR6 VRAM
Thermal Design: Robust thermal management with diecast shroud and backplate

View Latest Price

As an affiliate, we earn on qualifying purchases.

Further Testing and Industry Adoption Likely in 2024

Expect additional benchmarks and real-world deployment reports in the coming months. AMD may also release more detailed performance and cost data, while enterprise users will evaluate the platform’s suitability for large-scale AI operations. Industry analysts predict increased competition among hardware vendors as these developments unfold.

Local AI Engineering with Ollama: Run, understand, customize, fine-tune, and build agentic apps on your own hardware

View Latest Price

As an affiliate, we earn on qualifying purchases.

Key Questions

What is GLM5.2?

GLM5.2 is a large language model used as a benchmark in AI performance testing, capable of processing large amounts of data for natural language tasks.

How does the AMD MI355X compare to Blackwell?

According to recent benchmarks, the AMD MI355X achieves over 2600 tokens per second per node at more than twice the cost efficiency of NVIDIA’s Blackwell platform.

Why is cost efficiency important in AI hardware?

Cost efficiency determines the total expense of deploying AI models at scale, affecting adoption, operational costs, and the feasibility of large-scale AI projects.

Are these performance claims independently verified?

Currently, the benchmarks are from AMD and affiliated sources; independent verification is limited and further testing is expected.

What does this mean for AI industry competition?

If confirmed, these results could challenge existing market leaders by offering comparable or better performance at lower costs, prompting vendors to innovate further.

Source: hn

GLM5.2 On AMD MI355X At 2626 Tok/s/node At Over 2X Lower Cost Than Blackwell

Up next

I Wasn’t Allowed Prompting ChatGPT During My Chalk Talk: This Is Discrimination (2025)

Author

Techno Capture Team

Share article

Impact of Cost-Effective High-Performance AI Hardware

AI hardware accelerators AMD MI355X

Recent Advances in AI Hardware Performance Benchmarks

LLM Inference Architecture in Simple Terms : Running Large Language Models: The Complete Guide to Hardware, VRAM, and Inference Optimization

Details on Long-Term Reliability and Deployment Scalability

ASUS Turbo AMD Radeon AI Pro R9700 is Built for AI-Driven workflows and Extreme Reliability, Featuring RDNA 4 Architecture, 32GB VRAM, and Robust Thermal Design, 3 Year Warranty

Further Testing and Industry Adoption Likely in 2024

Local AI Engineering with Ollama: Run, understand, customize, fine-tune, and build agentic apps on your own hardware

Key Questions

What is GLM5.2?

How does the AMD MI355X compare to Blackwell?

Why is cost efficiency important in AI hardware?

Are these performance claims independently verified?

What does this mean for AI industry competition?

AI And The Future Of Urban Watch Systems

How AI Assistants Could Reshape Shopping Behavior

How Edge AI Chips Are Bringing Machine Learning to Your Earbuds

EU to Launch AI Strategy Focused on Strategic Autonomy

10 AI-Driven Business Strategies To Implement In 2026

Discover AI-Powered Wi-Fi Routers To Maximize Home Network Speed

12 Best Smart Door Locks That Offer Top-Notch Security and Convenience in 2026

15 Best Knives for 2026 That Every Home Kitchen Needs

GLM5.2 On AMD MI355X At 2626 Tok/s/node At Over 2X Lower Cost Than Blackwell

Up next

Author

Techno Capture Team

Share article

Impact of Cost-Effective High-Performance AI Hardware

AI hardware accelerators AMD MI355X

Recent Advances in AI Hardware Performance Benchmarks

LLM Inference Architecture in Simple Terms : Running Large Language Models: The Complete Guide to Hardware, VRAM, and Inference Optimization

Details on Long-Term Reliability and Deployment Scalability

ASUS Turbo AMD Radeon AI Pro R9700 is Built for AI-Driven workflows and Extreme Reliability, Featuring RDNA 4 Architecture, 32GB VRAM, and Robust Thermal Design, 3 Year Warranty

Further Testing and Industry Adoption Likely in 2024

Local AI Engineering with Ollama: Run, understand, customize, fine-tune, and build agentic apps on your own hardware

Key Questions

What is GLM5.2?

How does the AMD MI355X compare to Blackwell?

Why is cost efficiency important in AI hardware?

Are these performance claims independently verified?

What does this mean for AI industry competition?

You May Also Like