Why strap pricey, power-hungry HBM to a job that doesn’t benefit from the bandwidth?

Analysis  Nvidia on Tuesday unveiled the Rubin CPX, a GPU designed specifically to accelerate extremely long-context AI workflows like those seen in code assistants such as Microsoft’s GitHub Copilot, while simultaneously cutting back on pricey and power-hungry high-bandwidth memory (HBM).…