-
公开(公告)号:US20190304065A1
公开(公告)日:2019-10-03
申请号:US16442365
申请日:2019-06-14
Applicant: Google LLC
Inventor: Konstantinos Bousmalis , Nathan Silberman , David Martin Dohan , Dumitru Erhan , Dilip Krishnan
Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for processing images using an image processing neural network system. One of the systems includes a domain transformation neural network implemented by one or more computers, wherein the domain transformation neural network is configured to: receive an input image from a source domain; and process a network input comprising the input image from the source domain to generate a transformed image that is a transformation of the input image from the source domain to a target domain that is different from the source domain.
-
公开(公告)号:US20190095698A1
公开(公告)日:2019-03-28
申请号:US16061344
申请日:2017-09-27
Applicant: Google LLC
Inventor: Forrester H. Cole , Dilip Krishnan , William T. Freeman , David Benjamin Belanger
Abstract: The present disclosure provides systems and methods that perform face reconstruction based on an image of a face. In particular, one example system of the present disclosure combines a machine-learned image recognition model with a face modeler that uses a morphable model of a human's facial appearance. The image recognition model can be a deep learning model that generates an embedding in response to receipt of an image (e.g., an uncontrolled image of a face). The example system can further include a small, lightweight, translation model structurally positioned between the image recognition model and the face modeler. The translation model can be a machine-learned model that is trained to receive the embedding generated by the image recognition model and, in response, output a plurality of facial modeling parameter values usable by the face modeler to generate a model of the face.
-
公开(公告)号:US20220270402A1
公开(公告)日:2022-08-25
申请号:US17745158
申请日:2022-05-16
Applicant: Google LLC
Inventor: Forrester H. Cole , Dilip Krishnan , William T. Freeman , David Benjamin Belanger
Abstract: The present disclosure provides systems and methods that perform face reconstruction based on an image of a face. In particular, one example system of the present disclosure combines a machine-learned image recognition model with a face modeler that uses a morphable model of a human's facial appearance. The image recognition model can be a deep learning model that generates an embedding in response to receipt of an image (e.g., an uncontrolled image of a face). The example system can further include a small, lightweight, translation model structurally positioned between the image recognition model and the face modeler. The translation model can be a machine-learned model that is trained to receive the embedding generated by the image recognition model and, in response, output a plurality of facial modeling parameter values usable by the face modeler to generate a model of the face.
-
公开(公告)号:US11361531B2
公开(公告)日:2022-06-14
申请号:US17222782
申请日:2021-04-05
Applicant: Google LLC
Inventor: Konstantinos Bousmalis , Nathan Silberman , Dilip Krishnan , George Trigeorgis , Dumitru Erhan
Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for processing images using an image processing neural network system. One of the system includes a shared encoder neural network implemented by one or more computers, wherein the shared encoder neural network is configured to: receive an input image from a target domain; and process the input image to generate a shared feature representation of features of the input image that are shared between images from the target domain and images from a source domain different from the target domain; and a classifier neural network implemented by the one or more computers, wherein the classifier neural network is configured to: receive the shared feature representation; and process the shared feature representation to generate a network output for the input image that characterizes the input image.
-
公开(公告)号:US11335120B2
公开(公告)日:2022-05-17
申请号:US16857219
申请日:2020-04-24
Applicant: Google LLC
Inventor: Forrester H. Cole , Dilip Krishnan , William T. Freeman , David Benjamin Belanger
Abstract: The present disclosure provides systems and methods that perform face reconstruction based on an image of a face. In particular, one example system of the present disclosure combines a machine-learned image recognition model with a face modeler that uses a morphable model of a human's facial appearance. The image recognition model can be a deep learning model that generates an embedding in response to receipt of an image (e.g., an uncontrolled image of a face). The example system can further include a small, lightweight, translation model structurally positioned between the image recognition model and the face modeler. The translation model can be a machine-learned model that is trained to receive the embedding generated by the image recognition model and, in response, output a plurality of facial modeling parameter values usable by the face modeler to generate a model of the face.
-
公开(公告)号:US20240311960A1
公开(公告)日:2024-09-19
申请号:US18028063
申请日:2022-05-20
Applicant: GOOGLE LLC
Inventor: Xiao Feng , Yuanzhen LI , Yihui Wang , Omer Gimenez , Han Xu , Mengjie Wang , Huiwen Chang , AJ Maschinot , Dilip Krishnan
IPC: G06T3/4046
CPC classification number: G06T3/4046
Abstract: To adjust an aspect ratio of an image to match the aspect ratio of a display area for presenting the image, a computing device receives an image having a first aspect ratio, and obtains a second aspect ratio for a display area of a display in which to present the image, where the second aspect ratio is different from the first aspect ratio. The computing device extends the image to include one or more additional features which were not included in the image. Additionally, the computing device automatically crops the extended image around an identified region of interest by selecting a portion of the extended image that has an aspect ratio which matches the second aspect ratio of the display area, and provides the cropped image for presentation within the display area of the display.
-
公开(公告)号:US20220148299A1
公开(公告)日:2022-05-12
申请号:US17438687
申请日:2019-07-19
Applicant: Google LLC
Inventor: Mikael Pierre Bonnevie , Aaron Maschinot , Aaron Sarna , Shuchao Bi , Jingbin Wang , Michael Spencer Krainin , Wenchao Tong , Dilip Krishnan , Haifeng Gong , Ce Liu , Hossein Talebi , Raanan Sayag , Piotr Teterwak
Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for generating realistic extensions of images. In one aspect, a method comprises providing an input that comprises a provided image to a generative neural network having a plurality of generative neural network parameters. The generative neural network processes the input in accordance with trained values of the plurality of generative neural network parameters to generate an extended image. The extended image has (i) more rows, more columns, or both than the provided image, and (ii) is predicted to be a realistic extension of the provided image. The generative neural network is trained using an adversarial loss objective function.
-
公开(公告)号:US10853987B2
公开(公告)日:2020-12-01
申请号:US16702440
申请日:2019-12-03
Applicant: Google LLC
Inventor: Aaron Sarna , Dilip Krishnan , Forrester Cole , Inbar Mosseri
IPC: G06T13/80
Abstract: A system and method for generating cartoon images from photos are described. The method includes receiving an image of a user, determining a template for a cartoon avatar, determining an attribute needed for the template, processing the image with a classifier trained for classifying the attribute included in the image, determining a label generated by the classifier for the attribute, determining a cartoon asset for the attribute based on the label, and rendering the cartoon avatar personifying the user using the cartoon asset.
-
公开(公告)号:US20200257891A1
公开(公告)日:2020-08-13
申请号:US16857219
申请日:2020-04-24
Applicant: Google LLC
Inventor: Forrester H. Cole , Dilip Krishnan , William T. Freeman , David Benjamin Belanger
Abstract: The present disclosure provides systems and methods that perform face reconstruction based on an image of a face. In particular, one example system of the present disclosure combines a machine-learned image recognition model with a face modeler that uses a morphable model of a human's facial appearance. The image recognition model can be a deep learning model that generates an embedding in response to receipt of an image (e.g., an uncontrolled image of a face). The example system can further include a small, lightweight, translation model structurally positioned between the image recognition model and the face modeler. The translation model can be a machine-learned model that is trained to receive the embedding generated by the image recognition model and, in response, output a plurality of facial modeling parameter values usable by the face modeler to generate a model of the face.
-
公开(公告)号:US20180268595A1
公开(公告)日:2018-09-20
申请号:US15921207
申请日:2018-03-14
Applicant: Google LLC
Inventor: Aaron Sarna , Dilip Krishnan , Forrester Cole , Inbar Mosseri
IPC: G06T13/80
Abstract: A system and method for generating cartoon images from photos are described. The method includes receiving an image of a user, determining a template for a cartoon avatar, determining an attribute needed for the template, processing the image with a classifier trained for classifying the attribute included in the image, determining a label generated by the classifier for the attribute, determining a cartoon asset for the attribute based on the label, and rendering the cartoon avatar personifying the user using the cartoon asset.
-
-
-
-
-
-
-
-
-