Modular: All Articles

🚨

NEW

Community

How I Beat Unsloth's CUDA Kernel Using Mojo—With Zero GPU Experience

GPU programming has a steep learning curve. The performance gains are massive, but the path to get there (CUDA, PTX, memory hierarchies, occupancy tuning) stops most developers before they start. Mojo claims to flatten that curve: Python-like syntax, systems-level performance, no interop gymnastics.

January 12, 2026

David Robertson

Read

🚨

NEW

Community

🔥 Modular 2025 Year in Review

Our four-part series documenting the path to record-breaking matrix multiplication performance became essential reading for anyone serious about LLM optimization. The series walks through every optimization step—from baseline implementations to advanced techniques like warp specialization and async copies—showing you exactly how to extract maximum performance from cutting-edge hardware.

December 19, 2025

Michael Dunn-OConnor

Read

🚨

NEW

Product

The path to Mojo 1.0

While we are excited about this milestone, this of course won’t be the end of Mojo development! Some commonly requested capabilities for more general systems programming won’t be completed for 1.0, such as a robust async programming model and support for private members. Read below for more information on that!

December 5, 2025

Modular Team

Read

🚨

NEW

Community

Modverse #52: Advancing AI Together — Community Projects & Platform Milestones

The Modular universe is buzzing! From next-level community projects to recognition across the AI and developer space, here’s the latest from our growing ecosystem.

December 3, 2025

Inaara Walji

Read

🚨

NEW

Product

Modular 25.7: Faster Inference, Safer GPU Programming, and a More Unified Developer Experience

Today, we’re excited to release Modular Platform 25.7, an update that deepens our vision of a unified, high-performance compute layer for AI. With a fully open MAX Python API, an experimental next-generation modeling API, expanded hardware support for NVIDIA Grace superchips, and a safer, more capable Mojo GPU programming experience, this release moves us closer to an ecosystem where developers spend less time fighting infrastructure and more time advancing what AI can do.

November 20, 2025

Modular Team

Read

🚨

NEW

Company

"TTS 1 Max" (powered by Modular Platform) Ranked #1 Speech Model on Artificial Analysis

November 7, 2025

Modular Team

Read

🚨

NEW

Community

PyTorch and LLVM in 2025 — Keeping up With AI Innovation

November 6, 2025

Michael Dunn-OConnor

Read

🚨

NEW

Engineering

Achieving State-of-the-Art Performance on AMD MI355 — in Just 14 Days

October 17, 2025

Tracy Sharpe

Anand Pratap Singh

Prince Jain

Abdul Dakkak

Read

🚨

NEW

Company

Modular Raises $250M to scale AI's Unified Compute Layer

Modular Raises $250M in Third Round to Unify AI Compute

September 24, 2025

Modular Team

Read

🚨

NEW

Product

Modular 25.6: Unifying the latest GPUs from NVIDIA, AMD, and Apple

We’re excited to announce Modular Platform 25.6 – a major milestone in our mission to build AI’s unified compute layer. With 25.6, we’re delivering the clearest proof yet of our mission: a unified compute layer that spans from laptops to the world’s most powerful datacenter GPUs. The platform now delivers:

September 22, 2025

Modular Team

Read

All Articles (X)

How I Beat Unsloth's CUDA Kernel Using Mojo—With Zero GPU Experience

🔥 Modular 2025 Year in Review

The path to Mojo 1.0

Modverse #52: Advancing AI Together — Community Projects & Platform Milestones

Modular 25.7: Faster Inference, Safer GPU Programming, and a More Unified Developer Experience

"TTS 1 Max" (powered by Modular Platform) Ranked #1 Speech Model on Artificial Analysis

PyTorch and LLVM in 2025 — Keeping up With AI Innovation

Achieving State-of-the-Art Performance on AMD MI355 — in Just 14 Days

Modular Raises $250M to scale AI's Unified Compute Layer

Modular 25.6: Unifying the latest GPUs from NVIDIA, AMD, and Apple

Quick start resources