LEARNED THRESHOLD PRUNING FOR DEEP NEURAL NETWORKS

    公开(公告)号:US20210110268A1

    公开(公告)日:2021-04-15

    申请号:US17067233

    申请日:2020-10-09

    IPC分类号: G06N3/08 G06N3/04

    摘要: A method for pruning weights of an artificial neural network based on a learned threshold includes determining a pruning threshold for pruning a first set of pre-trained weights of multiple pre-trained weights based on a function of a classification loss and a regularization loss. The first set of pre-trained weights is pruned in response to a first value of each pretrained weight in the first set of pre-trained weights being greater than the pruning threshold. A second set of pre-trained weights of the multiple pre-trained weights is fine-tuned or adjusted in response to a second value of each pre-trained weight in the second set of pre-trained weights being greater than the pruning threshold.

    EFFICIENT POSE ESTIMATION THROUGH ITERATIVE REFINEMENT

    公开(公告)号:US20220301216A1

    公开(公告)日:2022-09-22

    申请号:US17203607

    申请日:2021-03-16

    IPC分类号: G06T7/70 G06N20/00 G06K9/62

    摘要: Certain aspects of the present disclosure provide a method, including: processing input data with a feature extraction stage of a machine learning model to generate a feature map; applying an attention map to the feature map to generate an augmented feature map; processing the augmented feature map with a refinement stage of the machine learning model to generate a refined feature map; processing the refined feature map with a first regression stage of the machine learning model to generate multi-dimensional task output data; and processing the refined feature data with an attention stage of the machine learning model to generate an updated attention map.

    STRUCTURED CONVOLUTIONS AND ASSOCIATED ACCELERATION

    公开(公告)号:US20210374537A1

    公开(公告)日:2021-12-02

    申请号:US17336048

    申请日:2021-06-01

    摘要: Certain aspects of the present disclosure provide techniques for performing machine learning, including generating a set of basis masks for a convolution layer of a machine learning model, wherein each basis mask comprises a binary mask; determining a set of scaling factors, wherein each scaling factor of the set of scaling factors corresponds to a basis mask in the set of basis masks; generating a composite kernel based on the set of basis masks and the set of scaling factors; and performing a convolution operation based on the composite kernel.