Furiosa NXT RNGD AI server system featuring multiple red accelerator modules installed in a high-performance rack chassis.
Furiosa NXT RNGD Server

Enterprise-ready 3kW inference appliance for agentic systems

"Furiosa RNGD provides a compelling combination of benefits: excellent real-world performance, a dramatic reduction in our total cost of ownership, and a surprisingly straightforward integration.”
Kijeong Jeon, Product Unit Lead

RNGD enables 4x more inference capacity

Enterprise AI scale is constrained by data center power density, with most infrastructure today limited to 15kW per rack.

Max. # of servers per rack

5x

2x

Server power consumption

3 kW

7.5 kW

Tokens/s per rack

26,400 tokens/s

6,600 tokens/s

Max. # of users per rack

880

220

Rack-scale comparison between Furiosa’s NXT RNGD AI infrastructure and an RTX PRO 6000-based deployment, showcasing high-density AI inference systems designed to maximize performance, efficiency, and data center utilization.

Built for what's NXT

Product image of the RNGD server
Furiosa NXT RNGD (pronounced “renegade”) Server delivers exceptional performance with cost-efficient scalability for inference with advanced LLM and agentic AI applications. Designed for air-cooled data centers, the NXT RNGD Server can be deployed on-premises, in managed environments, or colocation facilities.
8× RNGD
Cards
4 petaFLOPS
512 TFLOPS × 8 cards
384 GB
HBM3 capacity
12 TB/s
Memory bandwidth
3 kW
Power consumption

Multi-batch LLM generation demo

Multi-batch LLM generation interface showing zero TPS and 128 batches processed on EXAONE-4.0-32B-FP8.
This demo showcases the stability and performance of a single server powered by eight RNGD accelerators. Using LG AI Research’s EXAONE 4.0 32B FP8 model, the system scales from a single generation to 512 concurrent generations, delivering approximately 12,000 tokens per second of aggregate throughput while maintaining around 20 tokens per second per user. The results demonstrate consistent performance and responsiveness under high-concurrency workloads.

Start testing with Furiosa Access

Data center deployment of a Furiosa RNGD AI inference server with multiple accelerator cards installed in a rack-mounted system, delivering scalable performance, high density, and energy-efficient execution for large-scale AI and LLM workloads.
Interested in evaluating NXT RNGD Server? The Furiosa Access Program provides a structured path for customers and partners to evaluate, integrate, qualify, and deploy Furiosa accelerators through both online and offline access. Available worldwide.
Furiosa Access Locations
Seoul
KOR
Bay Area
USA
Lisbon
PRT
Johor Bahru
MYS
Logo of DCP AI Cloud Center
Logo of Equinix

See the specs

Download the NXT RNGD data sheet

Blog

LG U+ and FuriosaAI unveil the Sovereign AI Appliance

News
LG U+ and FuriosaAI unveil the Sovereign AI Appliance

Secure, Production-Ready Agentic AI: The Furiosa and Helikai Partnership

News
Secure, Production-Ready Agentic AI: The Furiosa and Helikai Partnership

RNGD Enters Mass Production: 4,000 High-Performance AI Accelerators Shipped by TSMC

News
RNGD Enters Mass Production: 4,000 High-Performance AI Accelerators Shipped by TSMC