VARYING AUDIO VISUAL COMPRESSION BASED ON AI DETECTION OR CLASSIFICATION RESULTS

    公开(公告)号:US20220353458A1

    公开(公告)日:2022-11-03

    申请号:US17746929

    申请日:2022-05-17

    Applicant: Apple Inc.

    Inventor: Peter ZATLOUKAL

    Abstract: In one embodiment, a computing device receives, from one or more cameras, a video stream comprising multiple frames, where the video stream is received at a first quality. The computing device analyzes, using a machine-learning model, images in the frames, where the machine-learning model has been trained to detect one or more objects-of-interest in the images. The computing device identifies a sequence-of-interest including consecutive frames of the video stream, where at least one object-of-interest was detected in at least one of the consecutive frames. The computing device generates a video package including the sequence-of-interest.

    MEMORY-EFFICIENT DIFFERENTIABLE WEIGHT CLUSTERING FOR LARGE LANGUAGE MODEL COMPRESSION

    公开(公告)号:US20250037018A1

    公开(公告)日:2025-01-30

    申请号:US18658919

    申请日:2024-05-08

    Applicant: Apple Inc.

    Abstract: The subject technology provides memory-efficient differentiable weight clustering for large language model compression. An apparatus determines a tensor including an attention map between learned weights of a trained machine learning model and corresponding centroids. The apparatus also determines a compressed attention table and a plurality of index lists during compression of the trained machine learning model based on an uniquification of the attention map and sharding of an associated index list. The apparatus determines whether the tensor exists at a destination device during compression of the trained machine learning model using a marshaling layer. The apparatus refrains from copying the tensor to the destination device when the tensor exists at the destination device, or copies the tensor to the destination device when the tensor does not exist at the destination device. The apparatus deploys a compressed machine learning model based on the compression of the trained machine learning model.

Patent Agency Ranking