Patent search ap:("QUALCOMM Incorporated") AND inv:"James Lee" Page 1

1.

发明授权
Instruction set architecture for neural network quantization and packing 有权

公开(公告)号：US12159140B2

公开(公告)日：2024-12-03

申请号：US17732361

申请日：2022-04-28

Applicant: QUALCOMM Incorporated

Inventor： Srijesh Sudarsanan , Deepak Mathew , Marc Hoffman , Sundar Rajan Balasubramanian , Mansi Jain , James Lee , Gerald Sweeney

IPC: G06F9/30 , G06N3/04

Abstract: An electronic device receives a single instruction to apply a neural network operation to a set of M-bit elements stored in one or more input vector registers to initiate a sequence of computational operations related to a neural network. In response to the single instruction, the electronic device implements the neural network operation on the set of M-bit elements to generate a set of P-bit elements by obtaining the set of M-bit elements from the one or more input vector registers, quantizing each of the set of M-bit elements from M bits to P bits, and packing the set of P-bit elements into an output vector register. P is smaller than M. In some embodiments, the neural network operation is a quantization operation including at least a multiplication with a quantization factor and an addition with a zero point.

Patent Agency Ranking