Modular: All Articles

🚨

NEW

Industry

Do LLMs eliminate the need for programming languages?

We’re very excited about the positive reception of Mojo since its launch as well as the community of people building around it. Given new Large Language Model (LLM) powered developer tools like Copilot and Ghostwriter, many developers are wondering about the future of programming – do programming languages still matter when AI writes the code?

June 8, 2023

Chris Lattner

Read

🚨

NEW

Product

Accelerating AI model serving with the Modular AI Engine

A few weeks ago, we announced the world’s fastest unified AI inference engine. The Modular AI Engine provides significant usability, portability, and performance gains for the leading AI frameworks — PyTorch and TensorFlow — and delivers world-leading execution performance for all cloud-available CPU architectures.

June 1, 2023

Alexandr Nikitin

Eric Johnson

Read

🚨

NEW

Company

Our launch & what's next

Last week, we launched Modular to the world after more than 16 months in stealth. We started Modular with a deep conviction — after 6+ years of building and scaling AI infrastructure to billions of users and 20+ years of building foundational compute infrastructure — it was clear the world needed a better path forward. Everyone wants less complexity, better access to compute and hardware, and the ability to develop and deploy AI faster.

May 11, 2023

Tim Davis

Read

🚨

NEW

Product

A unified, extensible platform to superpower your AI

We’re excited to finally share what we’ve been building at Modular. This announcement begins Modular’s journey to radically change the nature of AI programmability, usability, scalability, and compute.

May 2, 2023

Chris Lattner

Tim Davis

Eric Johnson

Read

🚨

NEW

Engineering

The world's fastest unified matrix multiplication

In this post, we describe Modular’s approach to solving this problem and its game-changing benefits, including a new standard in state-of-the-art (SOTA) performance on CPU as compared to existing solutions.

April 20, 2023

Abdul Dakkak

Chad Jarvis

Eric Johnson

Hengjie Wang

Ian Tramble

Read

🚨

NEW

Engineering

AI’s compute fragmentation: what matrix multiplication teaches us

AI is powered by a virtuous circle of data, algorithms (“models”), and compute. Growth in one pushes needs in the others and can grossly affect the developer experience on aspects like usability and performance. Today, we have more data and more AI model research than ever before, but compute isn’t scaling at the same speed due to … well, physics.

March 23, 2023

Eric Johnson

Abdul Dakkak

Chad Jarvis

Read

🚨

NEW

Company

We want to hear from you

At Modular, we are rebuilding AI infrastructure for the world. Our goal is to move past AI tools that are themselves research projects and into a future where AI development and deployment are orders of magnitude more efficient for everyone. You should be able to do this without trading off performance or having to rewrite your entire code base.

December 15, 2022

Eric Johnson

Read

🚨

NEW

Engineering

If AI serving tech can’t solve today’s problems, how do we scale into the future?

The technological progress that has been made in AI over the last ten years is breathtaking — from AlexNet in 2012 to the recent release of ChatGPT, which has taken large foundational models and conversational AI to another level.

December 8, 2022

Eric Johnson

Tim Davis

Read

🚨

NEW

Engineering

Part 2: Increasing development velocity of giant AI models

The first four requirements address one fundamental problem with how we've been using MLIR: weights are constant data, but shouldn't be managed like other MLIR attributes. Until now, we've been trying to place a square peg into a round hole, creating a lot of wasted space that's costing us development velocity (and, therefore, money for users of the tools).

November 10, 2022

Abdul Dakkak

Eric Johnson

Read

🚨

NEW

Company

Modular is rebuilding AI in the face of a new economy

Here in November 2022, we see a continuing onslaught of bad news: significant layoffs of incredible people as companies tighten their belts; companies that raised too much money, too fast, without core fundamentals are dying; and a changing climate where over-tightening rather than under-tightening is seemingly the new normal.

November 8, 2022

Chris Lattner

Tim Davis

Read

All Articles (X)

Do LLMs eliminate the need for programming languages?

Accelerating AI model serving with the Modular AI Engine

Our launch & what's next

A unified, extensible platform to superpower your AI

The world's fastest unified matrix multiplication

AI’s compute fragmentation: what matrix multiplication teaches us

We want to hear from you

If AI serving tech can’t solve today’s problems, how do we scale into the future?

Part 2: Increasing development velocity of giant AI models

Modular is rebuilding AI in the face of a new economy

Quick start resources