AMD bets on AI to stretch the memory it can't buy

AMD bets on AI to stretch the memory it can't buy — type0 | type0

PREVIEWAMD bets on AI to stretch the memory it can't buy · MD

AMD has acquired Mext, a predictive-memory startup founded in 2023, for an undisclosed sum, betting that machine learning can do what chipmakers currently cannot: make a gigabyte of memory go further than its silicon was built to go. Mext's software watches how running programs touch data, learns which pages are about to go cold, and pre-stages them onto flash before the workload asks for them again, according to The Register's reporting on the deal. The company claims the approach can expand a system's effective memory by a factor of two to four by leaning on flash for everything DRAM cannot economically hold.

The bet is a software answer to a hardware problem AMD itself helped create. AI training and inference have driven DRAM demand faster than Samsung, SK hynix, and Micron can add fabs, and a kilogram of flash still costs a small fraction of a kilogram of DRAM at the same capacity. AMD sells the GPUs and CPUs that are filling those memory channels, so a way to spend less on memory per server is a way to spend less on every server it ships. The Register quotes an AMD senior vice president framing the buy as a way to relieve that crunch, rather than as a general-purpose data-platform acquisition.

The technical premise borrows the same speculative-execution logic AMD has spent years tuning in its CPUs. A modern branch predictor guesses which instructions a program will run next so the processor can fetch them in advance; Mext's predictor, by the same logic, guesses which data a program will need next so the memory system can fetch it from the right tier in advance. AMD is a natural buyer because the company already knows how to ship a predictor that wins often enough to pay for itself. The novelty is the application to memory placement, since memory tiering, the idea of mixing fast and slow storage, predates the company: Intel's Optane and 3D XPoint did the same job in specialized hardware for years, and OS-level paging has done a coarser version of it for decades.

The 2-4x figure should be read as marketing, not as a measurement. Mext did not publish the workloads, trace patterns, or hit ratios behind it, and no third party has benchmarked the software against a real DRAM baseline. Flash has continued to close the bandwidth gap with main memory, but the latency cost of a miss that falls through to flash is still in microseconds, not nanoseconds, so workloads with bursty or unpredictable access patterns will pay that tax on every cold fetch. The predictor has to be right often enough to make the average land in DRAM, and how often that holds for production AI training, where the working set shifts minute to minute, is the open question the deal does not answer.

What to watch next is whether AMD ships Mext's predictor as a stand-alone software tier, bundles it with its EPYC and Instinct lines, or holds it for the company's own internal infrastructure. The first public benchmarks on real training jobs, not synthetic access traces, will be the moment the 2-4x claim becomes a number anyone outside Mext can rely on. Until then, the acquisition is best read as a hedge: AMD paying a software price today for the chance that the memory crunch does not loosen fast enough to make flash optional.

AMD bets on AI to stretch the memory it can't buy

Sources