Hippocratic AI + Modular to power real-time patient conversations. Read More →

Blog

Thank you! Your submission has been received!
Oops! Something went wrong while submitting the form.
Illustration of a smiling astronaut and a cheerful orange flame character floating in front of a neon-lit triangular background.

Democratizing AI Compute Series

Go behind the scenes of the AI industry with Chris Lattner

🚨

News

Product

AI Agents for AWS Marketplace

Modular Inc. announces MAX High-Performance GenAI Serving and MAX Code Repo Agent now available in AWS Marketplace's new AI Agents and Tools category, delivering 10x performance improvements and streamlined AI deployment for enterprises.

July 16, 2025

/

Modular Team

,  

🚨

News

Product

Modular 25.4: One Container, AMD and NVIDIA GPUs, No Lock-In

We're excited to announce Modular Platform 25.4, a major release that brings the full power of AMD GPUs to our entire platform. This release marks a major leap toward democratizing access to high-performance AI by enabling seamless portability to AMD GPUs.

June 18, 2025

/

Modular Team

,  

🚨

News

Product

Introducing Mammoth: Enterprise-Scale GenAI Deployments Made Simple

Introducing Mammoth, a distributed AI serving tool built specifically for the realities of enterprise AI deployment.

June 10, 2025

/

Modular Team

,  

🚨

News

Product

Modular Platform 25.3: 450K+ Lines of Open Source Code and pip Packaging

Announcing Modular Platform 25.3: our largest open source release, with 450k+ lines of high-performance AI kernels, plus pip install modular.

May 6, 2025

/

Modular Team

,  

🚨

News

Product

A New, Simpler License for MAX and Mojo

New licensing terms for MAX and Mojo that allows for unlimited non-commercial usage

April 23, 2025

/

Modular Team

,  

🚨

News

Product

MAX 25.2: Unleash the power of your H200's–without CUDA!

We’re excited to announce MAX 25.2, a major update that unlocks industry-leading performance on the largest language models–built from the ground up without CUDA.

March 25, 2025

/

Modular Team

,  

🚨

News

Product

MAX 25.1 - Introducing MAX Builds

February 18, 2025

/

Modular Team

,  

🚨

News

Product

Paged Attention & Prefix Caching Now Available in MAX Serve

PagedAttention & Prefix Caching Now Available in MAX Serve

February 6, 2025

/

Ehsan M. Kermani

,  

🚨

News

Product

Introducing MAX 24.6: A GPU Native Generative AI Platform

MAX 24.6 release bog featuring MAX GPU

December 17, 2024

/

Modular Team

,  

🚨

News

Product

MAX 24.5 - With SOTA CPU Performance for Llama 3.1

We’re excited to announce the release of MAX 24.5, which ships with significant improvements to Llama 3.1 CPU performance, new Python graph API bindings, our biggest update to Mojo ever, industry-standard packaging, and a clarified license.

September 13, 2024

/

Modular Team

,  

  • Series

    Democratizing Compute

    Go behind the scenes of the AI industry in this blog series by Chris Lattner. Trace the evolution of AI compute, dissect its current challenges, and discover how Modular is raising the bar with the world’s most open inference stack.

    11 part series

  • Series

    Matrix Multiplication on Blackwell

    Learn how to write a high-performance GPU kernel on Blackwell that offers performance competitive to that of NVIDIA's cuBLAS implementation while leveraging Mojo's special features to make the kernel as simple as possible.

    4 part series

  • Series

    Structured Mojo Kernels

    Learn how Mojo simplifies GPU programming with modular kernel architecture, compile-time abstractions, and zero-cost performance across modern GPU hardware.

    3 part series

  • Series

    Software Pipelining for GPU Kernels

    Explore software pipelining for GPU kernels from first principles. We formalize dependencies as a graph, solve for the optimal schedule with a constraint solver, and show how it all integrates into MAX via pure Mojo.

    1 part series

No items found within this category

We couldn’t find anything. Try changing or resetting your filters.

Build the future of AI with Modular

View Editions
  • Person with blonde hair using a laptop with an Apple logo.

    Sign up today

    Signup to our Cloud Platform today to get started easily.

    Sign Up
  • Magnifying glass emoji with black handle and round clear lens.

    Browse open models

    Browse our model catalog, or deploy your own custom model

    Browse models