Pricing

Our most powerful models and APIs.

Mistral Medium 3.5

State-of-the-art performance. Simplified enterprise deployments. Cost-efficient.

Input (/M tokens)

$1.5

Output (/M tokens)

$7.5

OCR 4

The world's best document extraction and understanding model.

OCR

$4

/ 1000 pages

Batch-API

$2

/ 1000 pages

Document AI

$5

/ 1000 pages

Voxtral TTS

State-of-the-art text-to-speech generation and voice cloning.

Audio generation

$0.016

per 1k characters

Available on

/v1/audio/speech

Enterprise APIs

Introducing our Enterprise APIs, including regional data processing controls, system-level SLAs, increased rate limits, and premium support.

Browse our APIs.

Cutting-edge AI models across text, reasoning, vision, and more—ready to integrate into your workflows with minimal coding.

Capabilities

32 results

Licenses

Mistral Medium 3.5

Open

State-of-the-art performance. Simplified enterprise deployments. Cost-efficient.

Text-to-text

Reasoning

Coding

Agentic

Multimodal

Input (/M tokens)

$1.5

Output (/M tokens)

$7.5

Mistral Small 4

Open

SOTA. Multimodal. Multilingual. Apache 2.0.

Text-to-text

Agentic

Multimodal

Lightweight

Input (/M tokens)

$0.15

Output (/M tokens)

$0.6

OCR 4

Premier

The world's best document extraction and understanding model.

OCR

Multimodal

Text-to-text

OCR

$4

/ 1000 pages

Batch-API

$2

/ 1000 pages

Document AI

$5

/ 1000 pages

Mistral Large 3

Open

Open-weight, general-purpose, flagship multimodal and multilingual model.

Text-to-text

Multimodal

Input (/M tokens)

$0.5

Output (/M tokens)

$1.5

Voxtral TTS

Open

State-of-the-art text-to-speech generation and voice cloning.

Voice

Audio generation

$0.016

per 1k characters

Available on

/v1/audio/speech

Voxtral Mini Transcribe 2

Premier

State-of-the-art transcription model

Transcription

Lightweight

Audio Input/min

$0.003

Available on

/v1/audio/transcriptions

Voxtral Mini Transcribe Realtime

Open

Open model fine-tuned and optimized for transcription.

Audio Input/min

$0.006

Available on

/v1/audio/transcriptions

46930ac2-3c2f-488c-901f-ab54ec24ea7a

Voxtral Small

Open

State-of-the-art performance on speech and audio understanding.

Transcription

Text-to-text

Audio Input (per min / per M tok)

$0.004

Text Input (per min / per M tok)

$0.1

Output (/M tokens)

$0.4

Available on

/v1/chat/completions

Devstral 2

Open

Open-weights agentic coding model for autonomous software engineering.

Coding

Agentic

Text-to-text

Input (/M tokens)

$0.4

Output (/M tokens)

$2

c0b2c725-8e59-4c3c-bfb1-b10088f8a86c

Devstral Small 2

Labs

The best lightweight, open model for coding agents.

Coding

Agentic

Text-to-text

Lightweight

Multimodal

Input (/M tokens)

$0.1

Output (/M tokens)

$0.3

Codestral

Premier

Low-latency coding model optimized for high-frequency completion, fill-in-the-middle, and code generation tasks.

Coding

Text-to-text

Input (/M tokens)

$0.3

Output (/M tokens)

$0.9

Leanstral

Labs

First open-source code agent for Lean 4.

Coding

API endpoint

Free

Magistral Medium

Premier

Thinking model excelling in domain-specific, transparent, and multilingual reasoning.

Text-to-text

Reasoning

Multimodal

Input (/M tokens)

$2

Output (/M tokens)

$5

Magistral Small

Premier

Thinking model excelling in domain-specific, transparent, and multilingual reasoning.

Text-to-text

Reasoning

Multimodal

Lightweight

Input (/M tokens)

$0.5

Output (/M tokens)

$1.5

Ministral 3 - 3B

Open

Best-in-class frontier AI to the edge.

Text-to-text

Agentic

Lightweight

Input (/M tokens)

$0.1

Output (/M tokens)

$0.1

Ministral 3 - 8B

Open

Best-in-class frontier AI to the edge.

Text-to-text

Agentic

Lightweight

Input (/M tokens)

$0.15

Output (/M tokens)

$0.15

Ministral 3 - 14B

Open

Best-in-class frontier AI to the edge.

Text-to-text

Agentic

Lightweight

Input (/M tokens)

$0.2

Output (/M tokens)

$0.2

Classifier API model 3B

Fine-tune Ministral 3B for classification tasks, like moderation, sentiment analysis, fraud detection, and more.

Classifier APIs

Training cost (/M tokens)

$1

Storage cost (per month per model)

$2

Input (/M tokens)

$0.1

Output (/M tokens)

$0.1

Classifier API model 8B

Fine-tune Ministral 8B for classification tasks, like moderation, sentiment analysis, fraud detection, and more.

Classifier APIs

Training cost (/M tokens)

$1

Storage cost (per month per model)

$2

Input (/M tokens)

$0.04

Output (/M tokens)

$0.04

Mistral Moderation

A classifier service for text content moderation.

Classifier APIs

Input (/M tokens)

$0.1

Codestral Embed

Premier

State-of-the-art embeddings for code and natural language queries.

Embedding

Coding

Input (/M tokens)

$0.15

Mistral Embed

State-of-the-art semantic model for extracting representation of text extracts.

Embedding

Text-to-text

Input (/M tokens)

$0.1

Agent API

Enhances AI with built-in tools for code execution, web search, image generation, persistent memory, and agentic orchestration.

Tools

Price

Model cost per M token + tool call

Libraries

Upload and manage documents, enabling agents to access your external data.

Tools

OCR (per 1K pages)

$3

Indexing (per M tokens)

$1

Call (per call)

$0.01

Code execution

Execute and interpret code snippets within the chat interface.

Tools

Price (per 1K calls)

$30

Web search

Enhance your work, research, and learning with web search, complete with citations for accurate and up-to-date information.

Tools

Price (per 1K calls)

$30

Images

Generate images based on user prompts and preferences.

Tools

Price (per 1K images)

$100

Premium news

Access to news articles via integrated news provider verification for enhanced information retrieval.

Tools

Price (per 1K calls)

$50

Data capture

Easily record and access API call data for debugging and continuous optimization.

Tools

Price (per M tokens)

$0.04

44e33e3e-0fd8-43d7-957e-d2689d4c9313

Mistral NeMo

Open

State-of-the-art Mistral model trained specifically for code tasks.

Coding

Lightweight

Input (/M tokens)

$0.15

Output (/M tokens)

$0.15

bf49f0e3-8ed4-4b75-9b65-fd8349e8450a

Mixtral 8x7B

Open

A 7B sparse Mixture-of-Experts (SMoE). Uses 12.9B active parameters out of 45B total.

Text-to-text

Lightweight

Input (/M tokens)

$0.7

Output (/M tokens)

$0.7

fbb679fe-e5df-44e2-adc4-c589c931e16f

Mixtral 8x22B

Open

Mixtral 8x22B is currently the most performant open model. A performant, 22B sparse Mixture-of-Experts (SMoE). Uses only 39B active parameters out of 141B.

Text-to-text

Input (/M tokens)

$2

Output (/M tokens)

$6