-
公开(公告)号:US20240104356A1
公开(公告)日:2024-03-28
申请号:US17934476
申请日:2022-09-22
Applicant: QUALCOMM Incorporated
Inventor: Srijesh SUDARSANAN , Deepak MATHEW , Marc HOFFMAN , Sundar Rajan BALASUBRAMANIAN , Gerald SWEENEY , Mansi JAIN , James LEE , Ankita NAYAK
IPC: G06N3/04
CPC classification number: G06N3/0481
Abstract: Certain aspects of the present disclosure provide techniques and apparatus for quantized machine learning. A quantized input matrix is accessed at a layer of a neural network, and a first interim value is generated in an accumulator by performing matrix multiplication, using the accumulator, of the quantized input matrix and a quantized weight matrix associated with the layer of the neural network. The first interim value is normalized based at least in part on one or more leading sign bits of the first interim value, and the normalized first interim value is dequantized. A second interim value is generated by applying a rounded right-shift operation to the dequantized normalized first interim value, and activation data is generated by applying an activation function to the second interim value.
-
2.
公开(公告)号:US20230351144A1
公开(公告)日:2023-11-02
申请号:US17732372
申请日:2022-04-28
Applicant: QUALCOMM Incorporated
Inventor: Srijesh SUDARSANAN , Deepak MATHEW , Marc HOFFMAN , Sundar Rajan BALASUBRAMANIAN , Mansi JAIN , James LEE , Gerald SWEENEY
Abstract: This application is directed to using a single instruction to initiate a sequence of computational operations related to a neural network activation function. An electronic device receives a single instruction to apply a linear activation operation to a set of first elements stored in one or more input vector registers. In response to the single instruction, the linear activation operation is implemented on the set of first elements to generate a set of output elements. For each first element, the electronic device detects a sign value of the respective first element, selects a respective scalar from one or more scalars based on the sign value, and applies the linear activation operation on the respective first element based on the selected respective scalar and a bias value to generate a respective element of the set of output elements. The electronic device quantizes the set of output elements.
-
公开(公告)号:US20230350678A1
公开(公告)日:2023-11-02
申请号:US17732361
申请日:2022-04-28
Applicant: QUALCOMM Incorporated
Inventor: Srijesh SUDARSANAN , Deepak MATHEW , Marc HOFFMAN , Sundar Rajan BALASUBRAMANIAN , Mansi JAIN , James LEE , Gerald SWEENEY
CPC classification number: G06F9/30101 , G06N3/04
Abstract: This application is directed to using a single instruction to initiate a sequence of computational operations related to a neural network. An electronic device receives a single instruction to apply a neural network operation to a set of M-bit elements stored in one or more input vector registers. In response to the single instruction, the electronic device implements the neural network operation on the set of M-bit elements to generate a set of P-bit elements by obtaining the set of M-bit elements from the one or more input vector registers, quantizing each of the set of M-bit elements from M bits to P bits, and packing the set of P-bit elements into an output vector register. P is smaller than M. In some embodiments, the neural network operation is a quantization operation including at least a multiplication with a quantization factor and an addition with a zero point.
-
公开(公告)号:US20230350640A1
公开(公告)日:2023-11-02
申请号:US17661707
申请日:2022-05-02
Applicant: QUALCOMM Incorporated
Inventor: Sundar Rajan BALASUBRAMANIAN , Srijesh SUDARSANAN , Marc HOFFMAN , Deepak MATHEW , Gerald SWEENEY , James LEE , Mansi JAIN
CPC classification number: G06F7/5443 , G06F17/16
Abstract: A device includes a processor that includes a rotation vector register file, a second vector register file, and multiply-accumulate circuitry (MAC). The rotation vector register file includes a rotation vector register. The rotation vector register file is configured to rotate data in the rotation vector register. The second vector register file includes a source vector register. The MAC is configured to receive first input data from the rotation vector register file and second input data from the source vector register.
-
-
-