Product Articles (X)

View all articles

🚨

NEW

Product

Modular Platform 25.5: Introducing Large Scale Batch Inference

Modular Platform 25.5 is here, and introduces Large Scale Batch Inference: a highly asynchronous, at-scale batch API built on open standards and powered by Mammoth. We're launching this new capability through our partner SF Compute, enabling high-volume AI performance with a fast, accurate, and efficient platform that seamlessly scales workloads across any hardware.

August 5, 2025

/

Modular Team

Read

🚨

NEW

Product

AI Agents for AWS Marketplace

Modular Inc. announces MAX High-Performance GenAI Serving and MAX Code Repo Agent now available in AWS Marketplace's new AI Agents and Tools category, delivering 10x performance improvements and streamlined AI deployment for enterprises.

July 16, 2025

/

Modular Team

Read

🚨

NEW

Product

Modular 25.4: One Container, AMD and NVIDIA GPUs, No Lock-In

We're excited to announce Modular Platform 25.4, a major release that brings the full power of AMD GPUs to our entire platform. This release marks a major leap toward democratizing access to high-performance AI by enabling seamless portability to AMD GPUs.

June 18, 2025

/

Modular Team

Read

🚨

NEW

Product

Introducing Mammoth: Enterprise-Scale GenAI Deployments Made Simple

Introducing Mammoth, a distributed AI serving tool built specifically for the realities of enterprise AI deployment.

June 10, 2025

/

Modular Team

Read

🚨

NEW

Product

Modular Platform 25.3: 450K+ Lines of Open Source Code and pip Packaging

Announcing Modular Platform 25.3: our largest open source release, with 450k+ lines of high-performance AI kernels, plus pip install modular.

May 6, 2025

/

Modular Team

Read

🚨

NEW

Product

A New, Simpler License for MAX and Mojo

New licensing terms for MAX and Mojo that allows for unlimited non-commercial usage

April 23, 2025

/

Modular Team

Read

🚨

NEW

Product

MAX 25.2: Unleash the power of your H200's–without CUDA!

We’re excited to announce MAX 25.2, a major update that unlocks industry-leading performance on the largest language models–built from the ground up without CUDA.

March 25, 2025

/

Modular Team

Read

🚨

NEW

Product

MAX 25.1 - Introducing MAX Builds

February 18, 2025

/

Modular Team

Read

🚨

NEW

Product

Paged Attention & Prefix Caching Now Available in MAX Serve

PagedAttention & Prefix Caching Now Available in MAX Serve

February 6, 2025

/

Ehsan M. Kermani

Read

🚨

NEW

Product

Introducing MAX 24.6: A GPU Native Generative AI Platform

MAX 24.6 release bog featuring MAX GPU

December 17, 2024

/

Modular Team

Read

🤔

No results for this query