Hippocratic AI + Modular to power real-time patient conversations. Read More →

LTX-2.3 logo

LTX-2.3 NVFP4 Audio-Video Generation on Modular

LTX-2.3 NVFP4 is a quantized 22B DiT-based audio-video diffusion model from Lightricks. It generates synchronized video and audio, with support for text, image, video, and audio-conditioned generation workflows.

Example Usage

Audio output
0:00
Code to use
Python

  from openai import OpenAI
  
  client = OpenAI(
      base_url="https://model.api.modular.com",
      api_key="<your_api_token>",
  )
  
  response = client.audio.speech.create(
      model="Lightricks/LTX-2.3-nvfp4",
      voice="alloy",
      input="Welcome to Modular Cloud. What can I help you with?"
  )
  
  response.stream_to_file("output.mp3")
  
Video output
Code to use
Python

  import base64
  from openai import OpenAI
  
  client = OpenAI(
      base_url="https://model.api.modular.com",
      api_key="<your_api_token>",
  )
  
  prompt = """A 2D pop art comic book animation. Thick black ink outlines,
  flat color fills, Ben-Day halftone dots covering every surface,
  limited palette of pink, cyan, red, and black. A rainy back alley
  in Shinjuku at night, rendered as a comic panel: kanji neon signs
  drawn as flat graphic shapes with halftone glow, puddles drawn as
  flat cyan shapes with white highlight marks, rain shown as diagonal
  ink lines. A helmeted astronaut figure in a black raincoat walks
  away from the viewer down the alley, never turning. Steam from a
  ramen vent drawn as cartoon curls. The viewpoint pushes slowly
  forward through the alley. Hand-drawn cel animation feel, no
  photorealism, no 3D shading, no lens effects."""
  
  response = client.responses.create(
      model="Lightricks/LTX-2.3-nvfp4",
      input=prompt,
      extra_body={
          "provider_options": {
              "video": {
                  "height": 512,
                  "width": 512,
                  "steps": 28,
                  "num_frames": 81,
                  "frames_per_second": 16,
                  "response_format": "b64_json",
              }
          }
      },
  )
  
  video_data = response.output[0].content[0].video_data
  
  with open("output.mp4", "wb") as f:
      f.write(base64.b64decode(video_data))
Model Details
  • Developed by
    Lightricks
  • Model family
    Lightricks/LTX-2.3-nvfp4
  • Modality
    Video,
    Audio,
  • Total Params
    22B
  • Precision
    NVFP4
  • Deployment options
    Shared, Dedicated, Self-hosted

Why choose LTX-2.3 on Modular?

  • High performance, out of the box

    Run leading open models with strong default performance and the ability to optimize down to the kernel — extracting more from every GPU.

  • Lower Infrastructure Costs

    Deploy efficiently across NVIDIA and AMD hardware to reduce GPU count, increase throughput, and avoid expensive closed-model licensing.

  • Easy Integration

    Integrate through an OpenAI-compatible endpoint, swap models freely, and scale across clouds or hardware without redesigning your application stack.

LTX-2.3
Want to self-host this model with our open source infrastructure?
Read How

🔥 Trending models

Similar models

Get started with Modular

  • Request a demo

    Schedule a demo of Modular and explore a custom end-to-end deployment built around your models, hardware, and performance goals.

    • Distributed, large-scale online inference endpoints

    • Highest-performance to maximize ROI and latency

    • Deploy in Modular cloud or your cloud

    • View all features with a custom demo

    Book a demo

    Talk with our sales lead Jay!

    30min demo.  Evaluate with your workloads.  Ask us anything.

  • Talk to us!

    Book a demo for a personalized walkthrough of Modular in your environment. Learn how teams use it to simplify systems and tune performance at scale.

    • Custom 30 min walkthrough of our platform

    • Cover specific model or deployment needs

    • Flexible pricing to fit your specific needs

    Book a demo

    Talk with our sales lead Jay!

  • Start using MAX

    ( FREE )

    Run any open source model in 5 minutes, then benchmark it. Scale it to millions yourself (for free!).

  • Start using Mojo

    ( FREE )

    Install Mojo and get up and running in minutes. A simple install, familiar tooling, and clear docs make it easy to start writing code immediately.