Blog

🚨

New

Qualcomm to Acquire Modular

June 24, 2026

🚨

New

ModCon 2026: Modular’s Developer Conference

June 17, 2026

🚨

New

Modular 26.4: SOTA MoE Serving, Model Bringup via Agent Skills, Mojo 1.0 Beta 2 and More

June 18, 2026

Latest

🚨

News

Series

Modverse #46: MAX 25.1, MAX Builds, and Democratizing AI Compute

We recently introduced MAX 25.1, a major leap forward in AI development. This release enhances agentic and LLM workflows, introduces MAX Builds as a central hub for GenAI models and application recipes, and debuts a new GPU programming interface. Developers can now take advantage of GPU-accelerated embeddings, OpenAI-compatible function calling, structured output generation, and high-performance LLM optimizations like paged attention and prefix caching for improved efficiency.

February 27, 2025

Caroline Frasca

Read

🚨

News

Series

CUDA is the incumbent, but is it any good? (Democratizing AI Compute, Part 4)

Answering the question of whether CUDA is “good” is much trickier than it sounds.

February 20, 2025

Chris Lattner

Read

🚨

News

Product

MAX 25.1 - Introducing MAX Builds

February 18, 2025

Modular Team

Read

🚨

News

Series

How did CUDA succeed? (Democratizing AI Compute, Part 3)

If we as an ecosystem hope to make progress, we need to understand how the CUDA software empire became so dominant.

February 12, 2025

Chris Lattner

Read

🚨

News

Product

Paged Attention & Prefix Caching Now Available in MAX Serve

PagedAttention & Prefix Caching Now Available in MAX Serve

February 6, 2025

Ehsan M. Kermani

Read

🚨

News

Series

What exactly is “CUDA”? (Democratizing AI Compute, Part 2)

February 5, 2025

Chris Lattner

Read

🚨

News

Engineering

Agentic Building Blocks: Creating AI Agents with MAX Serve and OpenAI Function Calling

January 30, 2025

Ehsan M. Kermani

Read

🚨

News

Series

DeepSeek's Impact on AI (Democratizing AI Compute, Part 1)

Part 1 of an article that explores the future of hardware acceleration for AI beyond CUDA, framed in the context of the release of DeepSeek

January 30, 2025

Chris Lattner

Read

🚨

News

Engineering

Use MAX with Open WebUI for RAG and Web Search

Learn how quickly MAX and Open WebUI get you up-and-running with RAG, web search, and Llama 3.1 on GPU

January 23, 2025

Bill Welense

Read

🚨

News

Engineering

Hands-on with Mojo 24.6

Mojo 24.6 introduces key improvements in argument conventions, memory management, and reference tracking, enhancing code clarity and safety with features like 'mut' for mutable arguments, 'origins' for references, and new collection types.

January 21, 2025

Ehsan M. Kermani

Read

No items found within this category

We couldn’t find anything. Try changing or resetting your filters.

Build the future of AI with Modular

Get started - FREE

View Editions

Sign up today
Signup to our Cloud Platform today to get started easily.
Sign Up
Browse open models
Browse our model catalog, or deploy your own custom model
Browse models

Blog

Latest

Sign up for our newsletter