Your new laptop or phone probably has an NPU. Here is what it is, how it differs from the CPU and GPU, and why chipmakers keep talking about it.
Your new laptop or phone probably has an NPU. Here is what it is, how it differs from the CPU and GPU, and why chipmakers keep talking about it.
An NPU, neural processing unit, is a chip block built specifically to run the math behind AI models efficiently. It exists because CPUs are flexible but power-hungry at this work, and GPUs are fast but thirsty; an NPU does the same neural math at a fraction of the energy, which is what makes on-device AI practical on a phone or a thin laptop.
On-device AI, live transcription, image cleanup, local assistants, has to run without draining your battery. The NPU is what lets those features run continuously and privately, instead of round-tripping to a server. That is why every new chip generation leads with its NPU numbers.
For most people, an NPU is a nice-to-have today and a baseline tomorrow. If you plan to lean on on-device AI features, a stronger NPU will age better, but do not buy on TOPS alone; real-world software support matters more. [VERIFY current device specifics.]
No. A GPU is a general parallel processor; an NPU is specialized for low-power neural inference.
Not strictly, but it is what makes on-device AI features fast and battery-friendly, and it is becoming standard.
Comments