Name: Furiosa NXT RNGD Server
Brand: FuriosaAI
Availability: InStock
Rating: 5 (1 reviews)

A data center aisle with black server racks on both sides under bright linear ceiling lights.

"Furiosa RNGD provides a compelling combination of benefits: excellent real-world performance, a dramatic reduction in our total cost of ownership, and a surprisingly straightforward integration.”

Kijeong Jeon, Product Unit Lead

RNGD enables 4x more inference capacity

Enterprise AI scale is constrained by data center power density, with most infrastructure today limited to 15kW per rack.

Max. # of servers per rack

5x

2x

Server power consumption

3 kW

7.5 kW

Tokens/s per rack

26,400 tokens/s

6,600 tokens/s

Max. # of users per rack

880

220

Rack-scale comparison between Furiosa’s NXT RNGD AI infrastructure and an RTX PRO 6000-based deployment, showcasing high-density AI inference systems designed to maximize performance, efficiency, and data center utilization.

Built for what's NXT

Furiosa NXT RNGD (pronounced “renegade”) Server delivers exceptional performance with cost-efficient scalability for inference with advanced LLM and agentic AI applications. Designed for air-cooled data centers, the NXT RNGD Server can be deployed on-premises, in managed environments, or colocation facilities.

8× RNGD

Cards

4 petaFLOPS

512 TFLOPS × 8 cards

384 GB

HBM3 capacity

12 TB/s

Memory bandwidth

3 kW

Power consumption

Multi-batch LLM generation demo

Multi-batch LLM generation interface showing zero TPS and 128 batches processed on EXAONE-4.0-32B-FP8.

This demo showcases the stability and performance of a single server powered by eight RNGD accelerators. Using LG AI Research’s EXAONE 4.0 32B FP8 model, the system scales from a single generation to 512 concurrent generations, delivering approximately 12,000 tokens per second of aggregate throughput while maintaining around 20 tokens per second per user. The results demonstrate consistent performance and responsiveness under high-concurrency workloads.

Start testing with Furiosa Access

Data center deployment of a Furiosa RNGD AI inference server with multiple accelerator cards installed in a rack-mounted system, delivering scalable performance, high density, and energy-efficient execution for large-scale AI and LLM workloads.

Interested in evaluating NXT RNGD Server? The Furiosa Access Program provides a structured path for customers and partners to evaluate, integrate, qualify, and deploy Furiosa accelerators through both online and offline access. Available worldwide.

Furiosa Access Locations

Seoul

KOR

Bay Area

USA

Lisbon

PRT

Johor Bahru

MYS