专利检索 ap:("Google LLC") AND inv:"Si Si" 第 1 页

1.

发明申请
IMAGE PROCESSING NEURAL NETWORKS WITH DYNAMIC FILTER ACTIVATION 有权

公开(公告)号：US20220004849A1

公开(公告)日：2022-01-06

申请号：US17295561

申请日：2019-11-20

申请人： Google LLC

发明人： Zhourong Chen , Yang Li , Samuel Bengio , Si Si

IPC分类号： G06N3/04 , G06N3/08 , G06T5/20 , G06T5/00

摘要： Methods, systems, and apparatus, including computer programs encoded on computer storage media, for processing images using neural networks. One of the methods includes receiving a network input; processing the network input through a gater neural network to generate a gating vector that includes a respective value for each of a plurality of filters; determining, from the gating vector and for each of the plurality of filters, whether the filter is active or inactive; and processing the network input through the main convolutional neural network to generate an image processing output, comprising, for each convolutional layer in the first plurality of convolutional layers: receiving an input feature map for the convolutional layer; and generating an output feature map, the generating comprising: for each filter of the convolutional layer that is inactive: setting the output channel for the filter to have all zero elements.

2.

发明授权
Decreasing neural network inference times using softmax approximation 审中-公开

公开(公告)号：US10671909B2

公开(公告)日：2020-06-02

申请号：US16586702

申请日：2019-09-27

申请人： Google LLC

发明人： Yang Li , Sanjiv Kumar , Pei-Hung Chen , Si Si , Cho-Jui Hsieh

IPC分类号： G06N3/04 , G06F17/16 , G06F17/18 , G06K9/62

摘要： Methods, systems, and apparatus, including computer programs encoded on computer storage media, for decreasing neural network inference times using softmax approximation. One of the methods includes maintaining data specifying a respective softmax weight vector for each output in a vocabulary of possible neural network outputs; receiving a neural network input; processing the neural network input using one or more initial neural network layers to generate a context vector for the neural network input; and generating an approximate score distribution over the vocabulary of possible neural network outputs for the neural network input, comprising: processing the context vector using a screening model configured to predict a proper subset of the vocabulary for the context input; and generating a respective logit for each output that is in the proper subset, comprising applying the softmax weight vector for the output to the context vector.

3.

发明申请
DECREASING NEURAL NETWORK INFERENCE TIMES USING SOFTMAX APPROXIMATION 审中-公开

公开(公告)号：US20200104686A1

公开(公告)日：2020-04-02

申请号：US16586702

申请日：2019-09-27

申请人： Google LLC

发明人： Yang Li , Sanjiv Kumar , Pei-Hung Chen , Si Si , Cho-Jui Hsieh

IPC分类号： G06N3/04 , G06K9/62 , G06F17/18 , G06F17/16

摘要： Methods, systems, and apparatus, including computer programs encoded on computer storage media, for decreasing neural network inference times using softmax approximation. One of the methods includes maintaining data specifying a respective softmax weight vector for each output in a vocabulary of possible neural network outputs; receiving a neural network input; processing the neural network input using one or more initial neural network layers to generate a context vector for the neural network input; and generating an approximate score distribution over the vocabulary of possible neural network outputs for the neural network input, comprising: processing the context vector using a screening model configured to predict a proper subset of the vocabulary for the context input; and generating a respective logit for each output that is in the proper subset, comprising applying the softmax weight vector for the output to the context vector.