Apple's M5 Max Changes the Math on Local AI for Creators
The MacBook Pro M5 Max can run a 70-billion parameter language model at 18 to 25 tokens per second, generate a FLUX image 3.8 times faster than its predecessor, and fit everything in 128GB of unified memory without offloading to the CPU