Blog

Thank you! Your submission has been received!
Oops! Something went wrong while submitting the form.

Democratizing AI Compute Series

Go behind the scenes of the AI industry with Chris Lattner

Latest

🚨

NEW

Product

MAX 25.2: Unleash the power of your H200's–without CUDA!

We’re excited to announce MAX 25.2, a major update that unlocks industry-leading performance on the largest language models–built from the ground up without CUDA.

March 25, 2025

/

Modular Team

Read

🚨

NEW

Industry

What about TVM, XLA, and AI compilers? (Democratizing AI Compute, Part 6)

March 12, 2025

/

Chris Lattner

Read

🚨

NEW

Industry

What about OpenCL and CUDA C++ alternatives? (Democratizing AI Compute, Part 5)

March 5, 2025

/

Chris Lattner

Read

🚨

NEW

Community

Modverse #46: MAX 25.1, MAX Builds, and Democratizing AI Compute

We recently introduced MAX 25.1, a major leap forward in AI development. This release enhances agentic and LLM workflows, introduces MAX Builds as a central hub for GenAI models and application recipes, and debuts a new GPU programming interface. Developers can now take advantage of GPU-accelerated embeddings, OpenAI-compatible function calling, structured output generation, and high-performance LLM optimizations like paged attention and prefix caching for improved efficiency.

February 27, 2025

/

Caroline Frasca

Read

🚨

NEW

Industry

CUDA is the incumbent, but is it any good? (Democratizing AI Compute, Part 4)

Answering the question of whether CUDA is “good” is much trickier than it sounds.

February 20, 2025

/

Chris Lattner

Read

🚨

NEW

Product

MAX 25.1 - Introducing MAX Builds

February 18, 2025

/

Modular Team

Read

🚨

NEW

Industry

How did CUDA succeed? (Democratizing AI Compute, Part 3)

If we as an ecosystem hope to make progress, we need to understand how the CUDA software empire became so dominant.

February 12, 2025

/

Chris Lattner

Read

🚨

NEW

Product

Paged Attention & Prefix Caching Now Available in MAX Serve

PagedAttention & Prefix Caching Now Available in MAX Serve

February 6, 2025

/

Ehsan M. Kermani

Read

🚨

NEW

Industry

What exactly is “CUDA”? (Democratizing AI Compute, Part 2)

February 5, 2025

/

Chris Lattner

Read

🚨

NEW

Industry

DeepSeek's Impact on AI (Democratizing AI Compute, Part 1)

Part 1 of an article that explores the future of hardware acceleration for AI beyond CUDA, framed in the context of the release of DeepSeek

January 30, 2025

/

Chris Lattner

Read

Sign up for our newsletter

Get all our latest news, announcements and updates delivered directly to your inbox. Unsubscribe at anytime.

Thank you for your submission.

Your report has been received and is being reviewed by the Sales team. A member from our team will reach out to you shortly.

Thank you,

Modular Sales Team