Qualcomm to Acquire Modular. Read More →

April 16, 2026

How Frontier Coding Agents Built a Video Diffusion Pipeline on MAX

Rajan Agarwal

Evan Chu

Tim Davis

Eric Johnson

Wan 2.1 inference pipeline as implemented on MAX. Agents had to rebuild all three stages in MAX/Mojo with no PyTorch, vLLM, transformers, or diffusers in the final submission. — Wan 2.1 inference pipeline as implemented on MAX. Agents had to rebuild all three stages in MAX/Mojo with no PyTorch, vLLM, transformers, or diffusers in the final submission.

Diffusion denoising over 8 steps: the pipeline refines random noise into coherent video frames. The verifier compares final output against PyTorch reference frames using per-frame PSNR. — Wan 2.1 inference pipeline as implemented on MAX. Agents had to rebuild all three stages in MAX/Mojo with no PyTorch, vLLM, transformers, or diffusers in the final submission.

Read more from Modular

No items found.

Build the future of AI with Modular

Get started - FREE

Sign up today
Signup to our Cloud Platform today to get started easily.
Sign Up
Browse open models
Browse our model catalog, or deploy your own custom model
Browse models

No items found.