Join our newsletter

Get all our latest news, announcements and updates delivered directly to your inbox. Unsubscribe at anytime.

Email*

First Name

Last Name

Thanks for signing up to our newsletter! 🚀

Oops! Something went wrong while submitting the form.

Return to page

Product

MODULAR PLATFORM

MAX Framework

GenAI serving framework

Mojo Language

The best GPU & CPU performance

Mammoth

Scale intelligently to any cluster

DEPLOYMENT OPTIONS

Editions

All the ways you can use Modular

AI Agents

Build agent workflows

RAG & CAG

AI retrieval and controlled generation

Chatbots

Conversations and interactions

Code Generation

Work with top open code gen models

Batch processing

Improve resource utilization

AI Inference

Fast, Scalable AI Inference

Research

Model & kernel development

Resources

Docs

Get up and running. Fast.

Models

500+ supported open models

Tutorials

Build amazing things

Recipes

Step-by-step guides

GPU Puzzles

Learn GPU Programming

Community

Build the future of AI together

About

Build AI for anyone, anywhere.

Careers

We’re currently hiring!

Culture

What we believe

Request a demo

Request Demo

Get Started

Get started

All Articles (X)

Topics

Topic

Popular

🔥

Community

Engineering

Product

Company

Authors

Abdul Dakkak

Alex Kirchhoff

Alexandr Nikitin

Ali Taha

Andrew Luo

Arjun Surendran

Arthur Evans

Ash Vardanian

Austin Doolittle

Bill Welense

Billy Zhu

Blake Huang

Brendan Duke

Brendan Hansknecht

Chad Jarvis

Chris Hoge

Chris Lattner

Dan Moldovan

Deep Dhillon

Denali Lumma

Ehsan M. Kermani

Eric Johnson

Evan Ovadia

Fabian Tschopp

Feras Boulala

Goldie Gadde

Hengjie Wang

Ian Tramble

Jack Clayton

Jakub Tucholski

Jeff Niu

Joe Loser

Joe Williams

Kalor Lewis

Kate Caldwell

Konstantinos Krommydas

Laszlo Kindrat

Liam Stewart

Liina Lind

Matthew Brookhart

Max Hutchinson

Mike Edwards

Mikhail Zolotukhin

Mostafa Hagog

Navroop Bath

Paige Bedwell

Patrick Beck

Rashid Kaleem

Robert Webb

Ryan Guo

Scott Main

Sean Paradiso

Shashank Prasanna

Shashank Sharma

Stef Lindall

Steffi Stumpos

Stephen McGroarty

Swetha Muniraju

Tatiana Shpeisman

Tim Davis

Tracy Sharpe

Tristan Konolige

Tyler Kenney

Walter Erquinigo

Weiwei Chen

William Hatch

Yihua Lou

Zac Bowling

Clear

Thank you! Your submission has been received!

Oops! Something went wrong while submitting the form.

🚨

NEW

Company

Why do HW companies struggle to build AI software? (Democratizing AI Compute, Part 9)

April 22, 2025

Chris Lattner

Read

🚨

NEW

Community

Modverse #47: MAX 25.2 and an evening of GPU programming at Modular HQ

MAX 25.2 is turning heads — and for good reason. This powerful update delivers industry-leading performance for large language models on NVIDIA GPUs, all without CUDA. MAX 25.2 builds on the momentum of 25.1 and introduces major upgrades to help you build GenAI systems that are faster, leaner, and easier to scale.

April 17, 2025

Caroline Frasca

Read

🚨

NEW

Company

What about the MLIR compiler infrastructure? (Democratizing AI Compute, Part 8)

April 8, 2025

Chris Lattner

Read

🚨

NEW

Company

What about Triton and Python eDSLs? (Democratizing AI Compute, Part 7)

In this post, we’ll break down how Python eDSLs work, their strengths and weaknesses, and take a close look at Triton.

March 26, 2025

Chris Lattner

Read

🚨

NEW

Product

MAX 25.2: Unleash the power of your H200's–without CUDA!

We’re excited to announce MAX 25.2, a major update that unlocks industry-leading performance on the largest language models–built from the ground up without CUDA.

March 25, 2025

Modular Team

Read

🚨

NEW

Company

What about TVM, XLA, and AI compilers? (Democratizing AI Compute, Part 6)

March 12, 2025

Chris Lattner

Read

🚨

NEW

Company

What about OpenCL and CUDA C++ alternatives? (Democratizing AI Compute, Part 5)

March 5, 2025

Chris Lattner

Read

🚨

NEW

Community

Modverse #46: MAX 25.1, MAX Builds, and Democratizing AI Compute

We recently introduced MAX 25.1, a major leap forward in AI development. This release enhances agentic and LLM workflows, introduces MAX Builds as a central hub for GenAI models and application recipes, and debuts a new GPU programming interface. Developers can now take advantage of GPU-accelerated embeddings, OpenAI-compatible function calling, structured output generation, and high-performance LLM optimizations like paged attention and prefix caching for improved efficiency.

February 27, 2025

Caroline Frasca

Read

🚨

NEW

Company

CUDA is the incumbent, but is it any good? (Democratizing AI Compute, Part 4)

Answering the question of whether CUDA is “good” is much trickier than it sounds.

February 20, 2025

Chris Lattner

Read

🚨

NEW

Product

MAX 25.1 - Introducing MAX Builds

February 18, 2025

Modular Team

Read

+ Load 10 more

4 / 13

🤔

No results for this query

Reset all filters

Start building with Modular

Get started - Docs

Quick start resources

Get started guide
With just a few commands, you can install MAX as a conda package and deploy a GenAI model on a local endpoint.
Read Guide
Browse open source models
500+ supported models, most of which have been optimized for lightning fast speed on the Modular platform.
Browse Models
Find examples
Follow step by step recipes to build Agents, chatbots, and more with MAX.
View Recipes

Latest from our blog:

New

🔥 Modular 2025 Year in Review

Get the latest news,
announcements & updates:

Join our Newsletter

Product
Quick Start
Solutions
Batch inference
AI Agents
AI Inference
Chatbots
Code Generation
RAG & CAG
Research
Developers
Connect
Company

Terms, Privacy & Acceptable Use