Pricing

API

Enterprise deployments

Pricing

Batch processing
-50%
High-volume processing at half price for maximum efficiency
Cached input tokens
-90% on input token
Ideal for repeated prompts, cutting input costs by 90%

Our most powerful models and APIs.

Batch processing
-50%
High-volume processing at half price for maximum efficiency
Cached input tokens
-90% on input token
Ideal for repeated prompts, cutting input costs by 90%

Regional Inference Endpoints are now live!

Mistral Medium 3.5

State-of-the-art performance. Simplified enterprise deployments. Cost-efficient.

Input (/M tokens)

$1.5

Output (/M tokens)

$7.5

mistral-medium-latest

Copy to clipboardCopied

Documentation

OCR 4

The world's best document extraction and understanding model.

OCR

/ 1000 pages

Document AI

/ 1000 pages

mistral-ocr-4-0

Copy to clipboardCopied

Documentation

Voxtral TTS

State-of-the-art text-to-speech generation and voice cloning.

Audio generation

$0.016

per 1k characters

Available on

/v1/audio/speech

voxtral-mini-tts-latest

Copy to clipboardCopied

Documentation

Enterprise APIs

Introducing our Enterprise APIs, including regional data processing controls, system-level SLAs, increased rate limits, and premium support.

Contact sales team

Browse our APIs.

Cutting-edge AI models across text, reasoning, vision, and more—ready to integrate into your workflows with minimal coding.

Capabilities

32 results

Text-to-text
Reasoning
Coding
Agentic
Multimodal
Lightweight
OCR
Voice
Transcription
Classifier APIs
Embedding
Tools

Licenses

Open
Premier
Labs

Mistral Medium 3.5

Open

State-of-the-art performance. Simplified enterprise deployments. Cost-efficient.

Text-to-text

Reasoning

Coding

Agentic

Multimodal

Input (/M tokens)

$1.5

Output (/M tokens)

$7.5

mistral-medium-latest

Copy to clipboardCopied

Documentation

Mistral Small 4

Open

SOTA. Multimodal. Multilingual. Apache 2.0.

Text-to-text

Agentic

Multimodal

Lightweight

Input (/M tokens)

$0.15

Output (/M tokens)

$0.6

mistral-small-latest

Copy to clipboardCopied

Documentation

OCR 4

Premier

The world's best document extraction and understanding model.

OCR

Multimodal

Text-to-text

OCR

/ 1000 pages

Document AI

/ 1000 pages

mistral-ocr-4-0

Copy to clipboardCopied

Documentation

Mistral Large 3

Open

Open-weight, general-purpose, flagship multimodal and multilingual model.

Text-to-text

Multimodal

Input (/M tokens)

$0.5

Output (/M tokens)

$1.5

mistral-large-latest

Copy to clipboardCopied

Documentation

Voxtral TTS

Open

State-of-the-art text-to-speech generation and voice cloning.

Voice

Audio generation

$0.016

per 1k characters

Available on

/v1/audio/speech

voxtral-mini-tts-latest

Copy to clipboardCopied

Documentation

Voxtral Mini Transcribe 2

Premier

State-of-the-art transcription model

Transcription

Voice

Lightweight

Audio Input/min

$0.003

Available on

/v1/audio/transcriptions

voxtral-mini-latest

Copy to clipboardCopied

Voxtral Mini Transcribe Realtime

Open

Open model fine-tuned and optimized for transcription.

Transcription

Voice

Audio Input/min

$0.006

Available on

/v1/audio/transcriptions

voxtral-mini-transcribe-realtime-2602

Copy to clipboardCopied

Documentation

Voxtral Small

Open

State-of-the-art performance on speech and audio understanding.

Transcription

Voice

Text-to-text

Audio Input (per min / per M tok)

$0.004

Text Input (per min / per M tok)

$0.1

Output (/M tokens)

$0.4

Available on

/v1/chat/completions

voxtral-small-latest

Copy to clipboardCopied

Devstral 2

Open

Open-weights agentic coding model for autonomous software engineering.

Coding

Agentic

Text-to-text

Input (/M tokens)

$0.4

Output (/M tokens)

devstral-medium-latest

Copy to clipboardCopied

Devstral Small 2

Labs

The best lightweight, open model for coding agents.

Coding

Agentic

Text-to-text

Lightweight

Multimodal

Input (/M tokens)

$0.1

Output (/M tokens)

$0.3

devstral-small-latest

Copy to clipboardCopied

Codestral

Premier

Low-latency coding model optimized for high-frequency completion, fill-in-the-middle, and code generation tasks.

Coding

Text-to-text

Input (/M tokens)

$0.3

Output (/M tokens)

$0.9

codestral-latest

Copy to clipboardCopied

Leanstral

Labs

First open-source code agent for Lean 4.

Coding

API endpoint

We are keeping this endpoint highly accessible for a limited period to gather realistic feedback and observability data to fuel the next generation of verified code models.

Free

labs-leanstral-2603

Copy to clipboardCopied

Magistral Medium

Premier

Thinking model excelling in domain-specific, transparent, and multilingual reasoning.

Text-to-text

Reasoning

Multimodal

Input (/M tokens)

Output (/M tokens)

magistral-medium-latest

Copy to clipboardCopied

Magistral Small

Premier

Thinking model excelling in domain-specific, transparent, and multilingual reasoning.

Text-to-text

Reasoning

Multimodal

Lightweight

Input (/M tokens)

$0.5

Output (/M tokens)

$1.5

magistral-small-latest

Copy to clipboardCopied

Ministral 3 - 3B

Open

Best-in-class frontier AI to the edge.

Text-to-text

Agentic

Lightweight

Input (/M tokens)

$0.1

Output (/M tokens)

$0.1

ministral-3b-latest

Copy to clipboardCopied

Documentation

Ministral 3 - 8B

Open

Best-in-class frontier AI to the edge.

Text-to-text

Agentic

Lightweight

Input (/M tokens)

$0.15

Output (/M tokens)

$0.15

ministral-8b-latest

Copy to clipboardCopied

Documentation

Ministral 3 - 14B

Open

Best-in-class frontier AI to the edge.

Text-to-text

Agentic

Lightweight

Input (/M tokens)

$0.2

Output (/M tokens)

$0.2

ministral-14b-latest

Copy to clipboardCopied

Classifier API model 3B

Fine-tune Ministral 3B for classification tasks, like moderation, sentiment analysis, fraud detection, and more.

Classifier APIs

Training cost (/M tokens)

One-off training: Price per token on the data you want to fine-tune our standard models on; minimum fee per fine-tuning job of $4.

Storage cost (per month per model)

Price per month per model for storage (irrespective of model usage; models can be deleted any time).

Input (/M tokens)

$0.1

Output (/M tokens)

$0.1

Classifier API model 3B

Copy to clipboardCopied

Classifier API model 8B

Fine-tune Ministral 8B for classification tasks, like moderation, sentiment analysis, fraud detection, and more.

Classifier APIs

Training cost (/M tokens)

One-off training: Price per token on the data you want to fine-tune our standard models on; minimum fee per fine-tuning job of $4.

Storage cost (per month per model)

Price per month per model for storage (irrespective of model usage; models can be deleted any time).

Input (/M tokens)

$0.04

Output (/M tokens)

$0.04

Classifier API model 8B

Copy to clipboardCopied

Mistral Moderation

A classifier service for text content moderation.

Classifier APIs

Input (/M tokens)

$0.1

mistral-moderation-2603

Copy to clipboardCopied

Codestral Embed

Premier

State-of-the-art embeddings for code and natural language queries.

Embedding

Coding

Input (/M tokens)

$0.15

codestral-embed

Copy to clipboardCopied

Mistral Embed

State-of-the-art semantic model for extracting representation of text extracts.

Embedding

Text-to-text

Input (/M tokens)

$0.1

mistral-embed

Copy to clipboardCopied

Agent API

Enhances AI with built-in tools for code execution, web search, image generation, persistent memory, and agentic orchestration.

Tools

Price

Model cost per M token + tool call

Libraries

Upload and manage documents, enabling agents to access your external data.

Tools

OCR (per 1K pages)

Indexing (per M tokens)

Call (per call)

$0.01

Code execution

Execute and interpret code snippets within the chat interface.

Tools

Price (per 1K calls)

$30

Web search

Enhance your work, research, and learning with web search, complete with citations for accurate and up-to-date information.

Tools

Price (per 1K calls)

$30

Images

Generate images based on user prompts and preferences.

Tools

Price (per 1K images)

$100

Premium news

Access to news articles via integrated news provider verification for enhanced information retrieval.

Tools

Price (per 1K calls)

$50

Data capture

Easily record and access API call data for debugging and continuous optimization.

Tools

Price (per M tokens)

$0.04

Mistral NeMo

Open

State-of-the-art Mistral model trained specifically for code tasks.

Coding

Lightweight

Input (/M tokens)

$0.15

Output (/M tokens)

$0.15

open-mistral-nemo

Copy to clipboardCopied

Mixtral 8x7B

Open

A 7B sparse Mixture-of-Experts (SMoE). Uses 12.9B active parameters out of 45B total.

Text-to-text

Lightweight

Input (/M tokens)

$0.7

Output (/M tokens)

$0.7

open-mixtral-8x7b

Copy to clipboardCopied

Mixtral 8x22B

Open

Mixtral 8x22B is currently the most performant open model. A performant, 22B sparse Mixture-of-Experts (SMoE). Uses only 39B active parameters out of 141B.

Text-to-text

Input (/M tokens)

Output (/M tokens)

open-mixtral-8x22b

Copy to clipboardCopied

Oh lawd, no results

Try adjusting your search or filters to find what you are looking for.