-
公开(公告)号:US20230110206A1
公开(公告)日:2023-04-13
申请号:US18079772
申请日:2022-12-12
Applicant: NVIDIA Corporation
Inventor: Tero Tapani Karras , Samuli Matias Laine , David Patrick Luebke , Jaakko T. Lehtinen , Miika Samuli Aittala , Timo Oskari Aila , Ming-Yu Liu , Arun Mohanray Mallya , Ting-Chun Wang
Abstract: A latent code defined in an input space is processed by the mapping neural network to produce an intermediate latent code defined in an intermediate latent space. The intermediate latent code may be used as appearance vector that is processed by the synthesis neural network to generate an image. The appearance vector is a compressed encoding of data, such as video frames including a person's face, audio, and other data. Captured images may be converted into appearance vectors at a local device and transmitted to a remote device using much less bandwidth compared with transmitting the captured images. A synthesis neural network at the remote device reconstructs the images for display.
-
公开(公告)号:US20210150369A1
公开(公告)日:2021-05-20
申请号:US17160585
申请日:2021-01-28
Applicant: NVIDIA Corporation
Inventor: Tero Tapani Karras , Samuli Matias Laine , Jaakko T. Lehtinen , Miika Samuli Aittala , Janne Johannes Hellsten , Timo Oskari Aila
Abstract: A style-based generative network architecture enables scale-specific control of synthesized output data, such as images. During training, the style-based generative neural network (generator neural network) includes a mapping network and a synthesis network. During prediction, the mapping network may be omitted, replicated, or evaluated several times. The synthesis network may be used to generate highly varied, high-quality output data with a wide variety of attributes. For example, when used to generate images of people's faces, the attributes that may vary are age, ethnicity, camera viewpoint, pose, face shape, eyeglasses, colors (eyes, hair, etc.), hair style, lighting, background, etc. Depending on the task, generated output data may include images, audio, video, three-dimensional (3D) objects, text, etc.
-
公开(公告)号:US11620521B2
公开(公告)日:2023-04-04
申请号:US17160648
申请日:2021-01-28
Applicant: NVIDIA Corporation
Inventor: Tero Tapani Karras , Samuli Matias Laine , Jaakko T. Lehtinen , Miika Samuli Aittala , Janne Johannes Hellsten , Timo Oskari Aila
Abstract: A style-based generative network architecture enables scale-specific control of synthesized output data, such as images. During training, the style-based generative neural network (generator neural network) includes a mapping network and a synthesis network. During prediction, the mapping network may be omitted, replicated, or evaluated several times. The synthesis network may be used to generate highly varied, high-quality output data with a wide variety of attributes. For example, when used to generate images of people's faces, the attributes that may vary are age, ethnicity, camera viewpoint, pose, face shape, eyeglasses, colors (eyes, hair, etc.), hair style, lighting, background, etc. Depending on the task, generated output data may include images, audio, video, three-dimensional (3D) objects, text, etc.
-
公开(公告)号:US11610435B2
公开(公告)日:2023-03-21
申请号:US17069478
申请日:2020-10-13
Applicant: NVIDIA Corporation
Inventor: Tero Tapani Karras , Samuli Matias Laine , David Patrick Luebke , Jaakko T. Lehtinen , Miika Samuli Aittala , Timo Oskari Aila , Ming-Yu Liu , Arun Mohanray Mallya , Ting-Chun Wang
Abstract: A latent code defined in an input space is processed by the mapping neural network to produce an intermediate latent code defined in an intermediate latent space. The intermediate latent code may be used as appearance vector that is processed by the synthesis neural network to generate an image. The appearance vector is a compressed encoding of data, such as video frames including a person's face, audio, and other data. Captured images may be converted into appearance vectors at a local device and transmitted to a remote device using much less bandwidth compared with transmitting the captured images. A synthesis neural network at the remote device reconstructs the images for display.
-
公开(公告)号:US20220405980A1
公开(公告)日:2022-12-22
申请号:US17562521
申请日:2021-12-27
Applicant: NVIDIA Corporation
Inventor: Tero Tapani Karras , Miika Samuli Aittala , Samuli Matias Laine , Erik Andreas Härkönen , Janne Johannes Hellsten , Jaakko T. Lehtinen , Timo Oskari Aila
Abstract: Systems and methods are disclosed for fused processing of a continuous mathematical operator. Fused processing of continuous mathematical operations, such as pointwise non-linear functions without storing intermediate results to memory improves performance when the memory bus bandwidth is limited. In an embodiment, a continuous mathematical operation including at least two of convolution, upsampling, pointwise non-linear function, and downsampling is executed to process input data and generate alias-free output data. In an embodiment, the input data is spatially tiled for processing in parallel such that the intermediate results generated during processing of the input data for each tile may be stored in a shared memory within the processor. Storing the intermediate data in the shared memory improves performance compared with storing the intermediate data to the external memory and loading the intermediate data from the external memory.
-
公开(公告)号:US12142016B2
公开(公告)日:2024-11-12
申请号:US17562521
申请日:2021-12-27
Applicant: NVIDIA Corporation
Inventor: Tero Tapani Karras , Miika Samuli Aittala , Samuli Matias Laine , Erik Andreas Härkönen , Janne Johannes Hellsten , Jaakko T. Lehtinen , Timo Oskari Aila
Abstract: Systems and methods are disclosed for fused processing of a continuous mathematical operator. Fused processing of continuous mathematical operations, such as pointwise non-linear functions without storing intermediate results to memory improves performance when the memory bus bandwidth is limited. In an embodiment, a continuous mathematical operation including at least two of convolution, upsampling, pointwise non-linear function, and downsampling is executed to process input data and generate alias-free output data. In an embodiment, the input data is spatially tiled for processing in parallel such that the intermediate results generated during processing of the input data for each tile may be stored in a shared memory within the processor. Storing the intermediate data in the shared memory improves performance compared with storing the intermediate data to the external memory and loading the intermediate data from the external memory.
-
公开(公告)号:US11775829B2
公开(公告)日:2023-10-03
申请号:US18079772
申请日:2022-12-12
Applicant: NVIDIA Corporation
Inventor: Tero Tapani Karras , Samuli Matias Laine , David Patrick Luebke , Jaakko T. Lehtinen , Miika Samuli Aittala , Timo Oskari Aila , Ming-Yu Liu , Arun Mohanray Mallya , Ting-Chun Wang
CPC classification number: G06N3/08 , G06T5/003 , G06T7/73 , G06T9/002 , G06V40/168 , H04N7/157 , H04N19/20 , G06T2207/20081 , G06T2207/20084 , G06T2207/30201
Abstract: A latent code defined in an input space is processed by the mapping neural network to produce an intermediate latent code defined in an intermediate latent space. The intermediate latent code may be used as appearance vector that is processed by the synthesis neural network to generate an image. The appearance vector is a compressed encoding of data, such as video frames including a person's face, audio, and other data. Captured images may be converted into appearance vectors at a local device and transmitted to a remote device using much less bandwidth compared with transmitting the captured images. A synthesis neural network at the remote device reconstructs the images for display.
-
公开(公告)号:US20210383241A1
公开(公告)日:2021-12-09
申请号:US17210934
申请日:2021-03-24
Applicant: NVIDIA Corporation
Inventor: Tero Tapani Karras , Miika Samuli Aittala , Janne Johannes Hellsten , Samuli Matias Laine , Jaakko T. Lehtinen , Timo Oskari Aila
Abstract: Embodiments of the present disclosure relate to a technique for training neural networks, such as a generative adversarial neural network (GAN), using a limited amount of data. Training GANs using too little example data typically leads to discriminator overfitting, causing training to diverge and produce poor results. An adaptive discriminator augmentation mechanism is used that significantly stabilizes training with limited data providing the ability to train high-quality GANs. An augmentation operator is applied to the distribution of inputs to a discriminator used to train a generator, representing a transformation that is invertible to ensure there is no leakage of the augmentations into the images generated by the generator. Reducing the amount of training data that is needed to achieve convergence has the potential to considerably help many applications and may the increase use of generative models in fields such as medicine.
-
公开(公告)号:US20210150187A1
公开(公告)日:2021-05-20
申请号:US17143516
申请日:2021-01-07
Applicant: NVIDIA Corporation
Inventor: Tero Tapani Karras , Samuli Matias Laine , David Patrick Luebke , Jaakko T. Lehtinen , Miika Samuli Aittala , Timo Oskari Aila , Ming-Yu Liu , Arun Mohanray Mallya , Ting-Chun Wang
Abstract: A latent code defined in an input space is processed by the mapping neural network to produce an intermediate latent code defined in an intermediate latent space. The intermediate latent code may be used as appearance vector that is processed by the synthesis neural network to generate an image. The appearance vector is a compressed encoding of data, such as video frames including a person's face, audio, and other data. Captured images may be converted into appearance vectors at a local device and transmitted to a remote device using much less bandwidth compared with transmitting the captured images. A synthesis neural network at the remote device reconstructs the images for display.
-
公开(公告)号:US20250111227A1
公开(公告)日:2025-04-03
申请号:US18897244
申请日:2024-09-26
Applicant: NVIDIA Corporation
Inventor: Tero Tapani Karras , Miika Samuli Aittala , Janne Johannes Hellsten , Jaakko T. Lehtinen , Timo Oskari Aila , Samuli Matias Laine
IPC: G06N3/08
Abstract: Apparatuses, systems, and techniques to train neural networks and to use neural networks to perform inference. In at least one embodiment, a balanced concatenation layer performs a balanced concatenation operation during a forward pass of a training iteration during the training of a neural network. In at least one embodiment, a balanced concatenation layer performs a balanced concatenation operation during the use of a neural network to perform inference.
-
-
-
-
-
-
-
-
-