-
公开(公告)号:US20240144568A1
公开(公告)日:2024-05-02
申请号:US17903585
申请日:2022-09-06
Applicant: Nvidia Corporation
Inventor: Siddharth Gururani , Arun Mallya , Ting-Chun Wang , Jose Rafael Valle da Costa , Ming-Yu Liu
CPC classification number: G06T13/205 , G06V10/82 , G06V40/171
Abstract: Apparatuses, systems, and techniques are presented to generate digital content. In at least one embodiment, one or more neural networks are used to generate video information based at least in part upon voice information and a combination of image features and facial landmarks corresponding to one or more images of a person.
-
公开(公告)号:US20240095989A1
公开(公告)日:2024-03-21
申请号:US17945951
申请日:2022-09-15
Applicant: NVIDIA Corporation
Inventor: Arun Mohanray Mallya , Ting-Chun Wang , Ming-Yu Liu
CPC classification number: G06T13/20 , G06T7/20 , G06V10/25 , G06V10/443 , G06V10/761 , G06V10/771 , G06V10/82 , G06T2207/20081 , G06T2207/30252
Abstract: Apparatuses, systems, and techniques to generate a video using two or more images comprising objects to be included in the video. In at least one embodiment, objects are identified in two or more images using one or more neural networks, to generate a video to include the objects in the video.
-
公开(公告)号:US11610435B2
公开(公告)日:2023-03-21
申请号:US17069478
申请日:2020-10-13
Applicant: NVIDIA Corporation
Inventor: Tero Tapani Karras , Samuli Matias Laine , David Patrick Luebke , Jaakko T. Lehtinen , Miika Samuli Aittala , Timo Oskari Aila , Ming-Yu Liu , Arun Mohanray Mallya , Ting-Chun Wang
Abstract: A latent code defined in an input space is processed by the mapping neural network to produce an intermediate latent code defined in an intermediate latent space. The intermediate latent code may be used as appearance vector that is processed by the synthesis neural network to generate an image. The appearance vector is a compressed encoding of data, such as video frames including a person's face, audio, and other data. Captured images may be converted into appearance vectors at a local device and transmitted to a remote device using much less bandwidth compared with transmitting the captured images. A synthesis neural network at the remote device reconstructs the images for display.
-
公开(公告)号:US11934959B2
公开(公告)日:2024-03-19
申请号:US16889376
申请日:2020-06-01
Applicant: Nvidia Corporation
Inventor: Arun Mallya , Ting-Chun Wang , Ming-Yu Liu , Karan Sapra
Abstract: Apparatuses, systems, and techniques are presented to synthesize consistent images or video. In at least one embodiment, one or more neural networks are used to generate one or more second images based, at least in part, on one or more point cloud representations of one or more first images.
-
公开(公告)号:US20230110206A1
公开(公告)日:2023-04-13
申请号:US18079772
申请日:2022-12-12
Applicant: NVIDIA Corporation
Inventor: Tero Tapani Karras , Samuli Matias Laine , David Patrick Luebke , Jaakko T. Lehtinen , Miika Samuli Aittala , Timo Oskari Aila , Ming-Yu Liu , Arun Mohanray Mallya , Ting-Chun Wang
Abstract: A latent code defined in an input space is processed by the mapping neural network to produce an intermediate latent code defined in an intermediate latent space. The intermediate latent code may be used as appearance vector that is processed by the synthesis neural network to generate an image. The appearance vector is a compressed encoding of data, such as video frames including a person's face, audio, and other data. Captured images may be converted into appearance vectors at a local device and transmitted to a remote device using much less bandwidth compared with transmitting the captured images. A synthesis neural network at the remote device reconstructs the images for display.
-
公开(公告)号:US20220207770A1
公开(公告)日:2022-06-30
申请号:US17165701
申请日:2021-02-02
Applicant: NVIDIA Corporation
Inventor: Ming-Yu Liu , Ting-Chun Wang , Xihui Liu
Abstract: Apparatuses, systems, and techniques to produce an image of a first subject positioned in a pose demonstrated by an image of a second subject. In at least one embodiment, an image of a first subject can be generated from a variety of points of view.
-
公开(公告)号:US20210374552A1
公开(公告)日:2021-12-02
申请号:US16889376
申请日:2020-06-01
Applicant: Nvidia Corporation
Inventor: Arun Mallya , Ting-Chun Wang , Ming-Yu Liu , Karan Spara
Abstract: Apparatuses, systems, and techniques are presented to synthesize consistent images or video. In at least one embodiment, one or more neural networks are used to generate one or more second images based, at least in part, on one or more point cloud representations of one or more first images.
-
8.
公开(公告)号:US20200242774A1
公开(公告)日:2020-07-30
申请号:US16721852
申请日:2019-12-19
Applicant: Nvidia Corporation
Inventor: Taesung Park , Ming-Yu Liu , Ting-Chun Wang , Junyan Zhu
Abstract: A user can create a basic semantic layout that includes two or more regions identified by the user, each region being associated with a semantic label indicating a type of object(s) to be rendered in that region. The semantic layout can be provided as input to an image synthesis network. The network can be a trained machine learning network, such as a generative adversarial network (GAN), that includes a conditional, spatially-adaptive normalization layer for propagating semantic information from the semantic layout to other layers of the network. The synthesis can involve both normalization and de-normalization, where each region of the layout can utilize different normalization parameter values. An image is inferred from the network, and rendered for display to the user. The user can change labels or regions in order to cause a new or updated image to be generated.
-
公开(公告)号:US11610122B2
公开(公告)日:2023-03-21
申请号:US17143608
申请日:2021-01-07
Applicant: NVIDIA Corporation
Inventor: Tero Tapani Karras , Samuli Matias Laine , David Patrick Luebke , Jaakko T. Lehtinen , Miika Samuli Aittala , Timo Oskari Aila , Ming-Yu Liu , Arun Mohanray Mallya , Ting-Chun Wang
Abstract: A latent code defined in an input space is processed by the mapping neural network to produce an intermediate latent code defined in an intermediate latent space. The intermediate latent code may be used as appearance vector that is processed by the synthesis neural network to generate an image. The appearance vector is a compressed encoding of data, such as video frames including a person's face, audio, and other data. Captured images may be converted into appearance vectors at a local device and transmitted to a remote device using much less bandwidth compared with transmitting the captured images. A synthesis neural network at the remote device reconstructs the images for display.
-
公开(公告)号:US11580395B2
公开(公告)日:2023-02-14
申请号:US17069449
申请日:2020-10-13
Applicant: NVIDIA Corporation
Inventor: Tero Tapani Karras , Samuli Matias Laine , David Patrick Luebke , Jaakko T. Lehtinen , Miika Samuli Aittala , Timo Oskari Aila , Ming-Yu Liu , Arun Mohanray Mallya , Ting-Chun Wang
Abstract: A latent code defined in an input space is processed by the mapping neural network to produce an intermediate latent code defined in an intermediate latent space. The intermediate latent code may be used as appearance vector that is processed by the synthesis neural network to generate an image. The appearance vector is a compressed encoding of data, such as video frames including a person's face, audio, and other data. Captured images may be converted into appearance vectors at a local device and transmitted to a remote device using much less bandwidth compared with transmitting the captured images. A synthesis neural network at the remote device reconstructs the images for display.
-
-
-
-
-
-
-
-
-