AI Models
Explore available models and their capabilities
MiniMax
Released May 2026
MiniMax M3
Multimodal, long-horizon agentic, coding, tool use
Reasoning
Tool Use
Vision
About this model
MiniMax M3 is a multimodal foundation model supporting text, image, and video inputs with text output and a 1M-token context window. It is suited for long-horizon agentic work, coding, and tool use, and uses MiniMax Sparse Attention to cut compute cost at full context length while maintaining quality.
Context Window
1M
Input Cost
$0.30/M
Output Cost
$1.20/M
Input Types
Text, Image, Video