Pricing
Our most powerful models and APIs.
Mistral Medium 3.5
State-of-the-art performance. Simplified enterprise deployments. Cost-efficient.
Input (/M tokens)
$1.5
Output (/M tokens)
$7.5
OCR 4
The world's best document extraction and understanding model.
OCR
$4
/ 1000 pages
Batch-API
$2
/ 1000 pages
Document AI
$5
/ 1000 pages
Voxtral TTS
State-of-the-art text-to-speech generation and voice cloning.
Audio generation
$0.016
per 1k characters
Available on
/v1/audio/speech
Enterprise APIs
Introducing our Enterprise APIs, including regional data processing controls, system-level SLAs, increased rate limits, and premium support.
Browse our APIs.
Cutting-edge AI models across text, reasoning, vision, and more—ready to integrate into your workflows with minimal coding.
Capabilities
32 results
Licenses
Mistral Medium 3.5
Open
State-of-the-art performance. Simplified enterprise deployments. Cost-efficient.
Text-to-text
Reasoning
Coding
Agentic
Multimodal
Input (/M tokens)
$1.5
Output (/M tokens)
$7.5
Mistral Small 4
Open
SOTA. Multimodal. Multilingual. Apache 2.0.
Text-to-text
Agentic
Multimodal
Lightweight
Input (/M tokens)
$0.15
Output (/M tokens)
$0.6
OCR 4
Premier
The world's best document extraction and understanding model.
OCR
Multimodal
Text-to-text
OCR
$4
/ 1000 pages
Batch-API
$2
/ 1000 pages
Document AI
$5
/ 1000 pages
Mistral Large 3
Open
Open-weight, general-purpose, flagship multimodal and multilingual model.
Text-to-text
Multimodal
Input (/M tokens)
$0.5
Output (/M tokens)
$1.5
Voxtral TTS
Open
State-of-the-art text-to-speech generation and voice cloning.
Voice
Audio generation
$0.016
per 1k characters
Available on
/v1/audio/speech
Voxtral Mini Transcribe 2
Premier
State-of-the-art transcription model
Transcription
Lightweight
Audio Input/min
$0.003
Available on
/v1/audio/transcriptions
Voxtral Mini Transcribe Realtime
Open
Open model fine-tuned and optimized for transcription.
Audio Input/min
$0.006
Available on
/v1/audio/transcriptions
Voxtral Small
Open
State-of-the-art performance on speech and audio understanding.
Transcription
Text-to-text
Audio Input (per min / per M tok)
$0.004
Text Input (per min / per M tok)
$0.1
Output (/M tokens)
$0.4
Available on
/v1/chat/completions
Devstral 2
Open
Open-weights agentic coding model for autonomous software engineering.
Coding
Agentic
Text-to-text
Input (/M tokens)
$0.4
Output (/M tokens)
$2
Devstral Small 2
Labs
The best lightweight, open model for coding agents.
Coding
Agentic
Text-to-text
Lightweight
Multimodal
Input (/M tokens)
$0.1
Output (/M tokens)
$0.3
Codestral
Premier
Low-latency coding model optimized for high-frequency completion, fill-in-the-middle, and code generation tasks.
Coding
Text-to-text
Input (/M tokens)
$0.3
Output (/M tokens)
$0.9
API endpoint
Free
Magistral Medium
Premier
Thinking model excelling in domain-specific, transparent, and multilingual reasoning.
Text-to-text
Reasoning
Multimodal
Input (/M tokens)
$2
Output (/M tokens)
$5
Magistral Small
Premier
Thinking model excelling in domain-specific, transparent, and multilingual reasoning.
Text-to-text
Reasoning
Multimodal
Lightweight
Input (/M tokens)
$0.5
Output (/M tokens)
$1.5
Ministral 3 - 3B
Open
Best-in-class frontier AI to the edge.
Text-to-text
Agentic
Lightweight
Input (/M tokens)
$0.1
Output (/M tokens)
$0.1
Ministral 3 - 8B
Open
Best-in-class frontier AI to the edge.
Text-to-text
Agentic
Lightweight
Input (/M tokens)
$0.15
Output (/M tokens)
$0.15
Ministral 3 - 14B
Open
Best-in-class frontier AI to the edge.
Text-to-text
Agentic
Lightweight
Input (/M tokens)
$0.2
Output (/M tokens)
$0.2
Classifier API model 3B
Fine-tune Ministral 3B for classification tasks, like moderation, sentiment analysis, fraud detection, and more.
Classifier APIs
Training cost (/M tokens)
$1
Storage cost (per month per model)
$2
Input (/M tokens)
$0.1
Output (/M tokens)
$0.1
Classifier API model 8B
Fine-tune Ministral 8B for classification tasks, like moderation, sentiment analysis, fraud detection, and more.
Classifier APIs
Training cost (/M tokens)
$1
Storage cost (per month per model)
$2
Input (/M tokens)
$0.04
Output (/M tokens)
$0.04
Mistral Moderation
A classifier service for text content moderation.
Classifier APIs
Input (/M tokens)
$0.1
Codestral Embed
Premier
State-of-the-art embeddings for code and natural language queries.
Embedding
Coding
Input (/M tokens)
$0.15
Mistral Embed
State-of-the-art semantic model for extracting representation of text extracts.
Embedding
Text-to-text
Input (/M tokens)
$0.1
Agent API
Enhances AI with built-in tools for code execution, web search, image generation, persistent memory, and agentic orchestration.
Tools
Price
Model cost per M token + tool call
Libraries
Upload and manage documents, enabling agents to access your external data.
Tools
OCR (per 1K pages)
$3
Indexing (per M tokens)
$1
Call (per call)
$0.01
Code execution
Execute and interpret code snippets within the chat interface.
Tools
Price (per 1K calls)
$30
Web search
Enhance your work, research, and learning with web search, complete with citations for accurate and up-to-date information.
Tools
Price (per 1K calls)
$30
Images
Generate images based on user prompts and preferences.
Tools
Price (per 1K images)
$100
Premium news
Access to news articles via integrated news provider verification for enhanced information retrieval.
Tools
Price (per 1K calls)
$50
Data capture
Easily record and access API call data for debugging and continuous optimization.
Tools
Price (per M tokens)
$0.04
Mistral NeMo
Open
State-of-the-art Mistral model trained specifically for code tasks.
Coding
Lightweight
Input (/M tokens)
$0.15
Output (/M tokens)
$0.15
Mixtral 8x7B
Open
A 7B sparse Mixture-of-Experts (SMoE). Uses 12.9B active parameters out of 45B total.
Text-to-text
Lightweight
Input (/M tokens)
$0.7
Output (/M tokens)
$0.7

Mixtral 8x22B
Open
Mixtral 8x22B is currently the most performant open model. A performant, 22B sparse Mixture-of-Experts (SMoE). Uses only 39B active parameters out of 141B.
Text-to-text
Input (/M tokens)
$2
Output (/M tokens)
$6




