-
公开(公告)号:US20250156706A1
公开(公告)日:2025-05-15
申请号:US18824898
申请日:2024-09-04
Applicant: Apple Inc.
Inventor: Saman NADERIPARIZI , David HARRISON , Mahyar NAJIBIKOHNEHSHAHRI , Mohammad RASTEGARI , Rasoul SHAFIPOUR
IPC: G06N3/08
Abstract: The subject technology provides for pseudo random projection for machine learning compression. An apparatus determines a first data structure comprising pseudo random values and a second data structure comprising one or more learned values based on a target compression ratio of a first dimension associated with a first weight matrix to a second dimension. The apparatus generates the second weight matrix comprising the second data structure and a seed value associated with the first data structure. The second weight matrix may be generated based at least in part on the pseudo random values and the one or more learned values. The second weight matrix is a compressed version of the first weight matrix based on the target compression ratio. The apparatus also trains a neural network with the second weight matrix to produce a trained machine learning model.
-
公开(公告)号:US20220343135A1
公开(公告)日:2022-10-27
申请号:US17860031
申请日:2022-07-07
Applicant: Apple Inc.
Inventor: Saman NADERIPARIZI , Mohammad RASTEGARI , Sayyed Karen KHATAMIFARD
Abstract: In one embodiment, a computing device includes an input sensor providing an input data; a programmable logic device (PLD) implementing a convolutional neural network (CNN), wherein: each compute block of the PLD corresponds to one of a multiple of convolutional layers of the CNN, each compute block of the PLD is placed in proximity to at least two memory blocks, a first one of the memory blocks serves as a buffer for the corresponding layer of the CNN, and a second one of the memory blocks stores model-specific parameters for the corresponding layer of the CNN.
-
公开(公告)号:US20240273335A1
公开(公告)日:2024-08-15
申请号:US18582487
申请日:2024-02-20
Applicant: Apple Inc.
Inventor: Saman NADERIPARIZI , Mohammad RASTEGARI , Sayyed Karen KHATAMIFARD
CPC classification number: G06N3/02 , G06F3/0604 , G06F3/0676 , G06F3/0677 , G06N3/045 , G06N3/063
Abstract: In one embodiment, a computing device includes an input sensor providing an input data; a programmable logic device (PLD) implementing a convolutional neural network (CNN), wherein: each compute block of the PLD corresponds to one of a multiple of convolutional layers of the CNN, each compute block of the PLD is placed in proximity to at least two memory blocks, a first one of the memory blocks serves as a buffer for the corresponding layer of the CNN, and a second one of the memory blocks stores model-specific parameters for the corresponding layer of the CNN.
-
-