PSEUDO RANDOM PROJECTION FOR MACHINE LEARNING COMPRESSION

    公开(公告)号:US20250156706A1

    公开(公告)日:2025-05-15

    申请号:US18824898

    申请日:2024-09-04

    Applicant: Apple Inc.

    Abstract: The subject technology provides for pseudo random projection for machine learning compression. An apparatus determines a first data structure comprising pseudo random values and a second data structure comprising one or more learned values based on a target compression ratio of a first dimension associated with a first weight matrix to a second dimension. The apparatus generates the second weight matrix comprising the second data structure and a seed value associated with the first data structure. The second weight matrix may be generated based at least in part on the pseudo random values and the one or more learned values. The second weight matrix is a compressed version of the first weight matrix based on the target compression ratio. The apparatus also trains a neural network with the second weight matrix to produce a trained machine learning model.

    CUSTOMIZABLE CHIP FOR AI APPLICATIONS

    公开(公告)号:US20220343135A1

    公开(公告)日:2022-10-27

    申请号:US17860031

    申请日:2022-07-07

    Applicant: Apple Inc.

    Abstract: In one embodiment, a computing device includes an input sensor providing an input data; a programmable logic device (PLD) implementing a convolutional neural network (CNN), wherein: each compute block of the PLD corresponds to one of a multiple of convolutional layers of the CNN, each compute block of the PLD is placed in proximity to at least two memory blocks, a first one of the memory blocks serves as a buffer for the corresponding layer of the CNN, and a second one of the memory blocks stores model-specific parameters for the corresponding layer of the CNN.

Patent Agency Ranking