Furiosa Algorithms Research

We work at the intersection of AI and hardware to make AI computing sustainable

Transformers in the Dark: Navigating Unknown Search Spaces via Bandit Feedback
Since day 1, our hardware and software team have worked together as a single team toward a shared vision.
Transformers in the Dark: Navigating Unknown Search Spaces via Bandit Feedback
TABED: Test-Time Adaptive Ensemble Drafting for Robust Speculative Decoding in LVLMs
Since day 1, our hardware and software team have worked together as a single team toward a shared vision.
TABED: Test-Time Adaptive Ensemble Drafting for Robust Speculative Decoding in LVLMs
Draft-based Approximate Inference for LLMs
Since day 1, our hardware and software team have worked together as a single team toward a shared vision.
Draft-based Approximate Inference for LLMs

Publications

Transformers in the Dark: Navigating Unknown Search Spaces via Bandit Feedback

TMLR
2026
search
transformer
View Job

TABED: Test-Time Adaptive Ensemble Drafting for Robust Speculative Decoding in LVLMs

EACL
2026
speculative-decoding
vision-lanuage
View Job

Draft-based Approximate Inference for LLMs

ICLR
2026
speculative-decoding
kv-cache
View Job

ParallelBench: Understanding the Tradeoffs of Parallel Decoding in Diffusion LLMs

ICLR
2026
parallel-decoding
benchmark
View Job

Inference-Aligned SFT for Diffusion LLMs via Group-based Trajectory Sampling

ICLR
2026
diffusion-llm
discrete-diffusion
sft
View Job

XQuant: Breaking the Memory Wall for LLM Inference with KV Cache Rematerialization

2026
kv-cache
quantization
View Job

Counting Guidance for High Fidelity Text-to-Image Synthesis

WACV
2025
text-to-image
View Job

VersaPRM: Multi-Domain Process Reward Model via Synthetic Reasoning Data

ICML
2025
reward-model
reasoning
View Job

Parameter-Efficient Fine-Tuning of State Space Models

ICML
2025
ssm
peft
View Job

State-offset Tuning: State-based Parameter-Efficient Fine-Tuning for State Space Models

ACL
2025
ssm
peft
View Job

Eta Inversion: Designing an Optimal Eta Function for Diffusion-based Real Image Editing

ECCV
2024
diffusion
image-editing
View Job

Can MLLMs Perform Multimodal In-Context Learning for Text-to-Image Generation?

COLM
2024
text-to-image
in-context-learning
View Job
Long-term collaboration partners on LLM efficiency, quantization, parallel decoding, and other advanced research areas for efficient inference.

Join our team

See Open Roles