AMD's Ryzen AI Halo Brings 128GB Unified Memory to a $3,999 Mini-PC Built for Local AI Agents

AMD formally unveiled the Ryzen AI Halo on Wednesday: a mini-PC measuring 150 x 150 x 43.2mm that ships with a Ryzen AI Max+ 395 processor, 128GB of unified LPDDR5x-8000 memory, and a 2TB PCIe Gen4 SSD for $3,999. Pre-orders open in June, according to ServeTheHome.

The machine targets a specific bottleneck: memory. Running large local LLMs like OpenAI’s 120-billion parameter GPT OSS or video-generation models like LTX 2.3 requires more RAM than most consumer hardware provides, PCWorld noted. The Halo’s 128GB unified pool, shared between CPU and the 40 Radeon RDNA 3.5 GPU compute units, doubles the 64GB ceiling of Apple’s Mac mini M4, the current default hardware for OpenClaw and local agent operators.

Hardware Specs

The Ryzen AI Max+ 395 processor delivers 16 Zen 5 CPU cores, 32 threads, and boost clocks up to 5.1GHz, according to ServeTheHome. The integrated NPU provides 50 TOPS of AI throughput. Connectivity includes Wi-Fi 7, Bluetooth 5.4, 10GbE LAN, four USB-C ports, and HDMI 2.1. The box weighs just over 1kg.

AMD is supplying both Windows 11 and Linux configurations at parity, a distinction from Nvidia’s DGX Spark, which ships Linux-only, per ServeTheHome. AMD’s product page positions the Halo directly against Nvidia’s $4,699 DGX Spark on price.

The Software Play

The hardware itself is not new. AMD partners like Minisforum have been shipping Ryzen AI Max-based mini-PCs for a year, ServeTheHome observed. What AMD is adding is the software stack: a pre-loaded Ryzen AI Development Center with validated models, frameworks, and developer playbooks. The approach mirrors Nvidia’s DGX ecosystem strategy, bundling optimized software on top of purpose-built hardware rather than shipping a generic box.

The ROCm platform remains AMD’s CUDA equivalent, and the ecosystem gap is real. Most AI developers take a CUDA-first approach, treating Apple Metal and AMD ROCm as secondary, PCWorld noted. AMD is betting that raw memory capacity and an aggressive software investment can offset that disadvantage for local inference workloads.

Cost Analysis

AMD calculated the break-even point at six months, assuming current cloud AI spending of $773 per month, according to PCWorld. That figure is high for individuals but within range for small-to-medium businesses running agent workloads at scale. The counter-argument: AI hardware depreciation cycles are accelerating, and a $3,999 machine purchased today may be outclassed within two years.

Where Agent Builders Fit In

The Mac mini M4 became the default always-on agent host because of its unified memory architecture, low power consumption, and compact form factor. The Halo doubles the available memory while keeping a similar footprint, which matters for agent operators running multiple concurrent models or context-heavy workflows. The 10GbE LAN port is a practical addition for users connecting agent hosts to local networks.

The missing piece is ecosystem maturity. OpenClaw, Ollama, and most local agent toolchains are optimized for Apple Silicon first, ROCm second. Whether AMD’s software investment closes that gap determines whether the Halo becomes a real alternative or stays a spec-sheet winner.

AMD's Ryzen AI Halo Brings 128GB Unified Memory to a $3,999 Mini-PC Built for Local AI Agents

Hardware Specs

The Software Play

Cost Analysis

Where Agent Builders Fit In

Get our morning briefing in your inbox

Keep Reading

CopilotKit Ships AIMock and Pathfinder to Close the Gap Between Agent Demos and Production Systems

Manus AI Weighs $1 Billion Raise to Buy Itself Back From Meta After China Orders Acquisition Unwound

OpenAI Codex for Mac Adds Appshots, Giving Agents Visual and Text Context from Any App Window