The production
AI platform built for developers

Fireworks partners with the world's leading generative AI researchers to serve the best models, at the fastest speeds.

Get Started for Free Contact Sales

Companies of all sizes trust Fireworks to power their production AI use-cases

Models curated and optimized by Fireworks

Image generation

Stable Diffusion XL

Image generation model, produced by stability.ai.

Try now

Image generation

Playground v2 1024

Playground v2 is a diffusion-based text-to-image generative model. The model was trained from scratch by the research team at playground.com.

Try now

Image generation

Segmind Stable Diffusion 1B (SSD-1B)

Image generation model. Distilled from Stable Diffusion XL 1.0 and 50% smaller.

Try now

Image generation

Japanese Stable Diffusion XL

Japanese Stable Diffusion XL (JSDXL) is a Japanese-specific SDXL model that is capable of inputting prompts in Japanese and generating Japanese-style images.

Try now

See all 4 models

The fastest and most uncompromising AI platform!

Fireworks AI

tokens / second

Next provider

tokens / second

average provider

tokens / second

Industry Leading Performance

Independently benchmarked to have the top speed of all inference providers

Enterprise Scale Throughput

Our proprietary stack blows open source options out of the water (see blog)

FireLLaVA: the first commercially permissive OSS LLaVA model

State-of-the-art Models

Use powerful models curated by Fireworks or our in-house trained multi-modal and function-calling models

0 Billion+

tokens served in a day

Battle Tested for Reliability

Fireworks is the 2nd most used open-source model provider and also generates over 1M images/day

fetch("https://api.fireworks.ai/inference/v1/chat/completions", {
  method: "POST",
  headers: {
    "Content-Type": "application
    "Authorization: "Bearer <API KEY>",
  },
  body: JSON.stringify({
    model: "accounts/fireworks/mixtral-8x7b",
    prompt: "Say this is a test",
    max_tokens: 700,
  }),
})
  

Built for Developers

Our OpenAI-compatible API makes it easy to start building with Fireworks!

Level up with Fireworks AI Enterprise

Get dedicated deployments for your models to ensure uptime and speed

Fireworks is proudly compliant with HIPAA and SOC2 and offers secure VPC and VPN connectivity

Meet your needs with data privacy - own your data and your models

The production AI platform built for developers

Models curated and optimized by Fireworks

The fastest and most uncompromising AI platform!

Level up with Fireworks AI Enterprise

The production
AI platform built for developers