Modular acquires BentoML to deliver production AI in the cloud!  - Read more

Blog

Thank you! Your submission has been received!
Oops! Something went wrong while submitting the form.
Illustration of a smiling astronaut and a cheerful orange flame character floating in front of a neon-lit triangular background.

Democratizing AI Compute Series

Go behind the scenes of the AI industry with Chris Lattner

Latest

🚨

News

Series

Matrix Multiplication on Blackwell: Part 1 - Introduction

This series of blog posts will showcase how one can: 1. Write a high-performance GPU kernel on Blackwell that offers performance competitive to that of NVIDIA's cuBLAS implementation. 2. Shows how one can leverage Mojo's special features to make the kernel as simple as possible.

August 28, 2025

/

Ali Taha

,  

Jiexiang Liu

,  

Hengjie Wang

,  

🚨

News

Community

Modverse #50: Modular Platform 25.5, Community Meetups, and Mojo's Debut in the Stack Overflow Developer Survey

This past month brought a wave of community projects and milestones across the Modular ecosystem!Modular Platform 25.5 landed with Large Scale Batch Inference, leaner packages, and new integrations that make scaling AI easier than ever. It’s already powering production deployments like SF Compute’s Large Scale Inference Batch API, cutting costs by up to 80% while supporting more than 15 leading models.

August 21, 2025

/

Caroline Frasca

,  

🚨

News

Product

Modular Platform 25.5: Introducing Large Scale Batch Inference

Modular Platform 25.5 is here, and introduces Large Scale Batch Inference: a highly asynchronous, at-scale batch API built on open standards and powered by Mammoth. We're launching this new capability through our partner SF Compute, enabling high-volume AI performance with a fast, accurate, and efficient platform that seamlessly scales workloads across any hardware.

August 5, 2025

/

Modular Team

,  

🚨

News

Company

SF Compute and Modular Partner to Revolutionize AI Inference Economics

Modular has partnered with SF Compute to address a fundamental asymmetry in the AI ecosystem: while model capabilities advance exponentially, the economic structures governing compute costs remain anchored in legacy paradigms. 

July 31, 2025

/

Modular Team

,  

SF Compute Team

,  

🚨

News

Product

AI Agents for AWS Marketplace

Modular Inc. announces MAX High-Performance GenAI Serving and MAX Code Repo Agent now available in AWS Marketplace's new AI Agents and Tools category, delivering 10x performance improvements and streamlined AI deployment for enterprises.

July 16, 2025

/

Modular Team

,  

🚨

News

Community

Modverse #49: Modular Platform 25.4, Modular 🤝 AMD, and Modular Hack Weekend

Between a global hackathon, a major release, and standout community projects, last month was full of progress across the Modular ecosystem!Modular Platform 25.4 launched on June 18th, alongside the announcement of our official partnership with AMD, bringing full support for AMD Instinct™ MI300X and MI325X GPUs. You can now deploy the same container across both AMD and NVIDIA hardware with no code changes, no vendor lock-in, and no additional configuration!

July 9, 2025

/

Caroline Frasca

,  

🚨

News

Community

Inside Modular Hack Weekend: Top Projects and Community Highlights

July 3, 2025

/

Modular Team

,  

🚨

News

Series

How is Modular Democratizing AI Compute? (Democratizing AI Compute, Part 11)

Given time, budget, and expertise from a team of veterans who’ve built this stack before, Modular set out to solve one of the defining challenges of our era: how to Democratize AI Compute. But what does that really mean—and how does it all add up?

June 20, 2025

/

Chris Lattner

,  

🚨

News

Product

Modular 25.4: One Container, AMD and NVIDIA GPUs, No Lock-In

We're excited to announce Modular Platform 25.4, a major release that brings the full power of AMD GPUs to our entire platform. This release marks a major leap toward democratizing access to high-performance AI by enabling seamless portability to AMD GPUs.

June 18, 2025

/

Modular Team

,  

🚨

News

Product

Introducing Mammoth: Enterprise-Scale GenAI Deployments Made Simple

Introducing Mammoth, a distributed AI serving tool built specifically for the realities of enterprise AI deployment.

June 10, 2025

/

Modular Team

,  

No items found within this category

We couldn’t find anything. Try changing or resetting your filters.

Build the future of AI with Modular

View Editions
  • Person with blonde hair using a laptop with an Apple logo.

    Get started guide

    Install MAX with a few commands and deploy a GenAI model locally.

    Read Guide
  • Magnifying glass emoji with black handle and round clear lens.

    Browse open models

    500+ models, many optimized for lightning-fast performance

    Browse models