Product Articles (X)

🚨

NEW

Product

Modular 25.4: One Container, AMD and NVIDIA GPUs, No Lock-In

We're excited to announce Modular Platform 25.4, a major release that brings the full power of AMD GPUs to our entire platform. This release marks a major leap toward democratizing access to high-performance AI by enabling seamless portability to AMD GPUs.

June 18, 2025

Modular Team

Read

🚨

NEW

Product

Introducing Mammoth: Enterprise-Scale GenAI Deployments Made Simple

Introducing Mammoth, a distributed AI serving tool built specifically for the realities of enterprise AI deployment.

June 10, 2025

Modular Team

Read

🚨

NEW

Product

Modular Platform 25.3: 450K+ Lines of Open Source Code and pip Packaging

Announcing Modular Platform 25.3: our largest open source release, with 450k+ lines of high-performance AI kernels, plus pip install modular.

May 6, 2025

Modular Team

Read

🚨

NEW

Product

A New, Simpler License for MAX and Mojo

New licensing terms for MAX and Mojo that allows for unlimited non-commercial usage

April 23, 2025

Modular Team

Read

🚨

NEW

Product

MAX 25.2: Unleash the power of your H200's–without CUDA!

We’re excited to announce MAX 25.2, a major update that unlocks industry-leading performance on the largest language models–built from the ground up without CUDA.

March 25, 2025

Modular Team

Read

🚨

NEW

Product

MAX 25.1 - Introducing MAX Builds

February 18, 2025

Modular Team

Read

🚨

NEW

Product

Paged Attention & Prefix Caching Now Available in MAX Serve

PagedAttention & Prefix Caching Now Available in MAX Serve

February 6, 2025

Ehsan M. Kermani

Read

🚨

NEW

Product

Introducing MAX 24.6: A GPU Native Generative AI Platform

MAX 24.6 release bog featuring MAX GPU

December 17, 2024

Modular Team

Read

🚨

NEW

Product

MAX 24.5 - With SOTA CPU Performance for Llama 3.1

We’re excited to announce the release of MAX 24.5, which ships with significant improvements to Llama 3.1 CPU performance, new Python graph API bindings, our biggest update to Mojo ever, industry-standard packaging, and a clarified license.

September 13, 2024

Modular Team

Read

🚨

NEW

Product

Develop locally, deploy globally

The recent surge in AI application development can be attributed to several factors: (1) advancements in machine learning algorithms that unlock previously intractable use cases, (2) the exponential growth in computational power enabling the training of ever-more complex models, and (3) the ubiquitous availability of vast datasets required to fuel these algorithms. However, as AI projects become increasingly pervasive, effective development paradigms, like those commonly found in traditional software development, remain elusive.

July 9, 2024

Modular Team

Read

Product Articles (X)

Modular 25.4: One Container, AMD and NVIDIA GPUs, No Lock-In

Introducing Mammoth: Enterprise-Scale GenAI Deployments Made Simple

Modular Platform 25.3: 450K+ Lines of Open Source Code and pip Packaging

A New, Simpler License for MAX and Mojo

MAX 25.2: Unleash the power of your H200's–without CUDA!

MAX 25.1 - Introducing MAX Builds

Paged Attention & Prefix Caching Now Available in MAX Serve

Introducing MAX 24.6: A GPU Native Generative AI Platform

MAX 24.5 - With SOTA CPU Performance for Llama 3.1

Develop locally, deploy globally

Easy ways to get started