MiniMax M3 Launches on NVIDIA with Free Inference Endpoint, Targeting 24/7 Agent Workloads
Chinese AI lab MiniMax released M3, a 428-billion-parameter mixture-of-experts model with native multimodal support and a one-million-token context window, on NVIDIA's accelerated infrastructure. NVIDIA is offering a free GPU-accelerated endpoint for the model, positioned for long-context reasoning and agentic workflows on Blackwell hardware.