ASYNCHRONOUS MIXED PRECISION UPDATE OF RESISTIVE PROCESSING UNIT ARRAY

    公开(公告)号:US20220391684A1

    公开(公告)日:2022-12-08

    申请号:US17336392

    申请日:2021-06-02

    IPC分类号: G06N3/08 G06N3/063 G06F15/80

    摘要: A computer-implemented method, computer program product, and/or computer system that performs the following operations: (i) receiving outputs pertaining to a first step of a training process being performed on an analog resistive processing unit (RPU) array, the analog RPU array corresponding to a layer of a deep neural network (DNN); (ii) converting the outputs into a format having less precision, yielding converted outputs; (iii) initiating a calculation of an update parameter for a first step update pass of the layer utilizing the converted outputs; and (v) based, at least in part, on receiving outputs pertaining to a second step of the training process being performed on the analog RPU array, applying the update parameter for the first step update pass of the layer to the analog RPU array.

    WEIGHT REPETITION ON RPU CROSSBAR ARRAYS

    公开(公告)号:US20220138579A1

    公开(公告)日:2022-05-05

    申请号:US17086856

    申请日:2020-11-02

    IPC分类号: G06N3/08 G06F16/22 G06N3/04

    摘要: A method is presented for artificial neural network training. The method includes storing weight values in an array of resistive processing unit (RPU) devices, wherein the array of RPU devices represents a weight matrix, defining the weight matrix to have an output dimension that is smaller than the input dimension such that the weight matrix has a rectangular configuration, and converting the weight matrix from a rectangular configuration to a more square-shaped configuration by repeating or concatenating the rectangular configuration of the weight matrix to increase a signal strength of a backward pass signal by copying an input of repeated weight elements during a forward cycle pass, summing output computations from the repeated weight elements, updating each of the repeated weight elements according to a backpropagated error or alternatively updating only one of the repeated weight elements by setting all forward values except one to zero during an update pass.

    DNN TRAINING ALGORITHM WITH DYNAMICALLY COMPUTED ZERO-REFERENCE

    公开(公告)号:US20240232610A9

    公开(公告)日:2024-07-11

    申请号:US18048436

    申请日:2022-10-20

    IPC分类号: G06N3/08 G06N3/047

    CPC分类号: G06N3/08 G06N3/047

    摘要: A computer implemented method includes performing a gradient update for a stochastic gradient descent (SGD) of a deep neural network (DNN) using a first set of hidden weights stored in a first matrix comprising a Resistive Processing Unit (RPU) crossbar array. A second matrix comprising a second set of hidden weights is stored in a digital medium. A third matrix comprising a set of reference values is computed upon a transfer cycle of the first set of weights from the first matrix to the second matrix, accounting for a sign-change (chopper). The third matrix is stored in the digital medium. A third set of weights is updated for the DNN from the second matrix when a threshold is reached for the second set of weights, in a fourth matrix comprising a RPU crossbar array.