-
公开(公告)号:US12159140B2
公开(公告)日:2024-12-03
申请号:US17732361
申请日:2022-04-28
Applicant: QUALCOMM Incorporated
Inventor: Srijesh Sudarsanan , Deepak Mathew , Marc Hoffman , Sundar Rajan Balasubramanian , Mansi Jain , James Lee , Gerald Sweeney
Abstract: An electronic device receives a single instruction to apply a neural network operation to a set of M-bit elements stored in one or more input vector registers to initiate a sequence of computational operations related to a neural network. In response to the single instruction, the electronic device implements the neural network operation on the set of M-bit elements to generate a set of P-bit elements by obtaining the set of M-bit elements from the one or more input vector registers, quantizing each of the set of M-bit elements from M bits to P bits, and packing the set of P-bit elements into an output vector register. P is smaller than M. In some embodiments, the neural network operation is a quantization operation including at least a multiplication with a quantization factor and an addition with a zero point.