FULLY ATTENTIONAL COMPUTER VISION

    公开(公告)号:US20220215654A1

    公开(公告)日:2022-07-07

    申请号:US17606976

    申请日:2020-05-22

    Applicant: Google LLC

    Abstract: A system implemented as computer programs on one or more computers in one or more locations that implements a computer vision model is described. The computer vision model includes a positional local self-attention layer that is configured to receive an input feature map and to generate an output feature map. For each input element in the input feature map, the positional local self-attention layer generates a respective output element for the output feature map by generating a memory block including neighboring input elements around the input element, generates a query vector using the input element and a query weight matrix, for each neighboring element in the memory block, performs positional local self-attention operations to generate a temporary output element, and generates the respective output element by summing temporary output elements of the neighboring elements in the memory block.

    Neural Network Layers with a Controlled Degree of Spatial Invariance

    公开(公告)号:US20210248472A1

    公开(公告)日:2021-08-12

    申请号:US17121161

    申请日:2020-12-14

    Applicant: Google LLC

    Abstract: The present disclosure provides a neural network including one or more layers with relaxed spatial invariance. Each of the one or more layers can be configured to receive a respective layer input. Each of the one or more layers can be configured to convolve a plurality of different kernels against the respective layer input to generate a plurality of intermediate outputs, each of the plurality of intermediate outputs having a plurality of portions. Each of the one or more layers can be configured to apply, for each of the plurality of intermediate outputs, a respective plurality of weights respectively associated with the plurality of portions to generate a respective weighted output. Each of the one or more layers can be configured to generate a respective layer output based on the weighted outputs.

    Neural network layers with a controlled degree of spatial invariance

    公开(公告)号:US12265911B2

    公开(公告)日:2025-04-01

    申请号:US17121161

    申请日:2020-12-14

    Applicant: Google LLC

    Abstract: A computing system can include one or more non-transitory computer-readable media that collectively store a neural network including one or more layers with relaxed spatial invariance. Each of the one or more layers can be configured to receive a respective layer input. Each of the one or more layers can be configured to convolve a plurality of different kernels against the respective layer input to generate a plurality of intermediate outputs, each of the plurality of intermediate outputs having a plurality of portions. Each of the one or more layers can be configured to apply, for each of the plurality of intermediate outputs, a respective plurality of weights respectively associated with the plurality of portions to generate a respective weighted output. Each of the one or more layers can be configured to generate a respective layer output based on the weighted outputs.

Patent Agency Ranking