-
公开(公告)号:US10853987B2
公开(公告)日:2020-12-01
申请号:US16702440
申请日:2019-12-03
Applicant: Google LLC
Inventor: Aaron Sarna , Dilip Krishnan , Forrester Cole , Inbar Mosseri
IPC: G06T13/80
Abstract: A system and method for generating cartoon images from photos are described. The method includes receiving an image of a user, determining a template for a cartoon avatar, determining an attribute needed for the template, processing the image with a classifier trained for classifying the attribute included in the image, determining a label generated by the classifier for the attribute, determining a cartoon asset for the attribute based on the label, and rendering the cartoon avatar personifying the user using the cartoon asset.
-
公开(公告)号:US20180268595A1
公开(公告)日:2018-09-20
申请号:US15921207
申请日:2018-03-14
Applicant: Google LLC
Inventor: Aaron Sarna , Dilip Krishnan , Forrester Cole , Inbar Mosseri
IPC: G06T13/80
Abstract: A system and method for generating cartoon images from photos are described. The method includes receiving an image of a user, determining a template for a cartoon avatar, determining an attribute needed for the template, processing the image with a classifier trained for classifying the attribute included in the image, determining a label generated by the classifier for the attribute, determining a cartoon asset for the attribute based on the label, and rendering the cartoon avatar personifying the user using the cartoon asset.
-
公开(公告)号:US20250069194A1
公开(公告)日:2025-02-27
申请号:US18946147
申请日:2024-11-13
Applicant: Google LLC
Inventor: Kfir Aberman , Yotam Nitzan , Orly Liba , Yael Pritch Knaan , Qiurui He , Inbar Mosseri , Yossi Gandelsman , Michal Yarom
Abstract: Systems and methods for identifying a personalized prior within a generative model's latent vector space based on a set of images of a given subject. In some examples, the present technology may further include using the personalized prior to confine the inputs of a generative model to a latent vector space associated with the given subject, such that when the model is tasked with editing an image of the subject (e.g., to perform inpainting to fill in masked areas, improve resolution, or deblur the image), the subject's identifying features will be reflected in the images the model produces.
-
公开(公告)号:US12169911B2
公开(公告)日:2024-12-17
申请号:US18334700
申请日:2023-06-14
Applicant: Google LLC
Inventor: Kfir Aberman , Yotam Nitzan , Orly Liba , Yael Pritch Knaan , Qiurui He , Inbar Mosseri , Yossi Gandelsman , Michal Yarom
Abstract: Systems and methods for identifying a personalized prior within a generative model's latent vector space based on a set of images of a given subject. In some examples, the present technology may further include using the personalized prior to confine the inputs of a generative model to a latent vector space associated with the given subject, such that when the model is tasked with editing an image of the subject (e.g., to perform inpainting to fill in masked areas, improve resolution, or deblur the image), the subject's identifying features will be reflected in the images the model produces.
-
公开(公告)号:US20230325998A1
公开(公告)日:2023-10-12
申请号:US18334700
申请日:2023-06-14
Applicant: Google LLC
Inventor: Kfir Aberman , Yotam Nitzan , Orly Liba , Yael Pritch Knaan , Qiurui He , Inbar Mosseri , Yossi Gandelsman , Michal Yarom
CPC classification number: G06T5/50 , G06T5/001 , G06T3/40 , G06T2207/20081 , G06T2207/20084
Abstract: Systems and methods for identifying a personalized prior within a generative model's latent vector space based on a set of images of a given subject. In some examples, the present technology may further include using the personalized prior to confine the inputs of a generative model to a latent vector space associated with the given subject, such that when the model is tasked with editing an image of the subject (e.g., to perform inpainting to fill in masked areas, improve resolution, or deblur the image), the subject's identifying features will be reflected in the images the model produces.
-
公开(公告)号:US11456005B2
公开(公告)日:2022-09-27
申请号:US16761707
申请日:2018-11-21
Applicant: GOOGLE LLC
Inventor: Inbar Mosseri , Michael Rubinstein , Ariel Ephrat , William Freeman , Oran Lang , Kevin William Wilson , Tali Dekel , Avinatan Hassidim
Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for audio-visual speech separation. A method includes: obtaining, for each frame in a stream of frames from a video in which faces of one or more speakers have been detected, a respective per-frame face embedding of the face of each speaker; processing, for each speaker, the per-frame face embeddings of the face of the speaker to generate visual features for the face of the speaker; obtaining a spectrogram of an audio soundtrack for the video; processing the spectrogram to generate an audio embedding for the audio soundtrack; combining the visual features for the one or more speakers and the audio embedding for the audio soundtrack to generate an audio-visual embedding for the video; determining a respective spectrogram mask for each of the one or more speakers; and determining a respective isolated speech spectrogram for each speaker.
-
公开(公告)号:US20200335121A1
公开(公告)日:2020-10-22
申请号:US16761707
申请日:2018-11-21
Applicant: GOOGLE LLC
Inventor: Inbar Mosseri , Michael Rubinstein , Ariel Ephrat , William Freeman , Oran Lang , Kevin William Wilson , Tali Dekel , Avinatan Hassidim
Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for audio-visual speech separation. A method includes: obtaining, for each frame in a stream of frames from a video in which faces of one or more speakers have been detected, a respective per-frame face embedding of the face of each speaker; processing, for each speaker, the per-frame face embeddings of the face of the speaker to generate visual features for the face of the speaker; obtaining a spectrogram of an audio soundtrack for the video; processing the spectrogram to generate an audio embedding for the audio soundtrack; combining the visual features for the one or more speakers and the audio embedding for the audio soundtrack to generate an audio-visual embedding for the video; determining a respective spectrogram mask for each of the one or more speakers; and determining a respective isolated speech spectrogram for each speaker.
-
-
-
-
-
-