-
公开(公告)号:US20210329306A1
公开(公告)日:2021-10-21
申请号:US17069253
申请日:2020-10-13
Applicant: NVIDIA Corporation
Inventor: Ming-Yu Liu , Ting-Chun Wang , Arun Mohanray Mallya , Tero Tapani Karras , Samuli Matias Laine , David Patrick Luebke , Jaakko Lehtinen , Miika Samuli Aittala , Timo Oskari Aila
Abstract: Apparatuses, systems, and techniques to perform compression of video data using neural networks to facilitate video streaming, such as video conferencing. In at least one embodiment, a sender transmits to a receiver a key frame from video data and one or more keypoints identified by a neural network from said video data, and a receiver reconstructs video data using said key frame and one or more received keypoints.
-
公开(公告)号:US20210049468A1
公开(公告)日:2021-02-18
申请号:US17069449
申请日:2020-10-13
Applicant: NVIDIA Corporation
Inventor: Tero Tapani Karras , Samuli Matias Laine , David Patrick Luebke , Jaakko T. Lehtinen , Miika Samuli Aittala , Timo Oskari Aila , Ming-Yu Liu , Arun Mohanray Mallya , Ting-Chun Wang
Abstract: A latent code defined in an input space is processed by the mapping neural network to produce an intermediate latent code defined in an intermediate latent space. The intermediate latent code may be used as appearance vector that is processed by the synthesis neural network to generate an image. The appearance vector is a compressed encoding of data, such as video frames including a person's face, audio, and other data. Captured images may be converted into appearance vectors at a local device and transmitted to a remote device using much less bandwidth compared with transmitting the captured images. A synthesis neural network at the remote device reconstructs the images for display.
-
公开(公告)号:US20190295228A1
公开(公告)日:2019-09-26
申请号:US16360895
申请日:2019-03-21
Applicant: NVIDIA Corporation
Inventor: Guilin Liu , Fitsum A. Reda , Kevin Shih , Ting-Chun Wang , Andrew Tao , Bryan Catanzaro
Abstract: A neural network architecture is disclosed for performing image in-painting using partial convolution operations. The neural network processes an image and a corresponding mask that identifies holes in the image utilizing partial convolution operations, where the mask is used by the partial convolution operation to zero out coefficients of the convolution kernel corresponding to invalid pixel data for the holes. The mask is updated after each partial convolution operation is performed in an encoder section of the neural network. In one embodiment, the neural network is implemented using an encoder-decoder framework with skip links to forward representations of the features at different sections of the encoder to corresponding sections of the decoder.
-
公开(公告)号:US20250166237A1
公开(公告)日:2025-05-22
申请号:US18518430
申请日:2023-11-22
Applicant: NVIDIA Corporation
Inventor: Yu Zeng , Yogesh Balaji , Ting-Chun Wang , Xun Huang , Ming-Yu Liu
Abstract: Apparatuses, processors, computing systems, devices, non-transitory computer medium, and/or methods for using neural networks for generating multiple related images. In at least one embodiment, a processor includes circuitry to use one or more neural networks to generate several images, where each image includes a same object (e.g., same subject) and different backgrounds. For example, a processor including one or more circuits to use one or more neural networks to generate one or more objects (e.g., an animal, a vehicle, a person) within two or more different images (e.g., different backgrounds such as weather, season, environment) based, at least in part, on one or more indications (e.g., text prompts) by one or more users indicating content of at least one of the two or more different images (e.g., objects and/or backgrounds for each image in text such as adjectives and nouns) other than the one or more objects.
-
公开(公告)号:US20230186428A1
公开(公告)日:2023-06-15
申请号:US18106348
申请日:2023-02-06
Applicant: NVIDIA Corporation
Inventor: Guilin Liu , Andrew Tao , Bryan Christopher Catanzaro , Ting-Chun Wang , Zhiding Yu , Shiqiu Liu , Fitsum Reda , Karan Sapra , Brandon Rowlett
CPC classification number: G06T3/4038 , G06T3/4046 , G06T7/40 , G06N3/08 , G06V10/776 , G06V10/82 , G06V10/454 , G06V10/54 , G06T2207/20081 , G06T2207/20084
Abstract: Apparatuses, systems, and techniques for texture synthesis from small input textures in images using convolutional neural networks. In at least one embodiment, one or more convolutional layers are used in conjunction with one or more transposed convolution operations to generate a large textured output image from a small input textured image while preserving global features and texture, according to various novel techniques described herein.
-
公开(公告)号:US11625613B2
公开(公告)日:2023-04-11
申请号:US17143516
申请日:2021-01-07
Applicant: NVIDIA Corporation
Inventor: Tero Tapani Karras , Samuli Matias Laine , David Patrick Luebke , Jaakko T. Lehtinen , Miika Samuli Aittala , Timo Oskari Aila , Ming-Yu Liu , Arun Mohanray Mallya , Ting-Chun Wang
Abstract: A latent code defined in an input space is processed by the mapping neural network to produce an intermediate latent code defined in an intermediate latent space. The intermediate latent code may be used as appearance vector that is processed by the synthesis neural network to generate an image. The appearance vector is a compressed encoding of data, such as video frames including a person's face, audio, and other data. Captured images may be converted into appearance vectors at a local device and transmitted to a remote device using much less bandwidth compared with transmitting the captured images. A synthesis neural network at the remote device reconstructs the images for display.
-
公开(公告)号:US20220012536A1
公开(公告)日:2022-01-13
申请号:US17483688
申请日:2021-09-23
Applicant: NVIDIA Corporation
Inventor: Ting-Chun Wang , Ming-Yu Liu , Bryan Christopher Catanzaro , Jan Kautz , Andrew J. Tao
Abstract: A method, computer readable medium, and system are disclosed for creating an image utilizing a map representing different classes of specific pixels within a scene. One or more computing systems use the map to create a preliminary image. This preliminary image is then compared to an original image that was used to create the map. A determination is made whether the preliminary image matches the original image, and results of the determination are used to adjust the computing systems that created the preliminary image, which improves a performance of such computing systems. The adjusted computing systems are then used to create images based on different input maps representing various object classes of specific pixels within a scene.
-
公开(公告)号:US20210150354A1
公开(公告)日:2021-05-20
申请号:US17143608
申请日:2021-01-07
Applicant: NVIDIA Corporation
Inventor: Tero Tapani Karras , Samuli Matias Laine , David Patrick Luebke , Jaakko T. Lehtinen , Miika Samuli Aittala , Timo Oskari Aila , Ming-Yu Liu , Arun Mohanray Mallya , Ting-Chun Wang
Abstract: A latent code defined in an input space is processed by the mapping neural network to produce an intermediate latent code defined in an intermediate latent space. The intermediate latent code may be used as appearance vector that is processed by the synthesis neural network to generate an image. The appearance vector is a compressed encoding of data, such as video frames including a person's face, audio, and other data. Captured images may be converted into appearance vectors at a local device and transmitted to a remote device using much less bandwidth compared with transmitting the captured images. A synthesis neural network at the remote device reconstructs the images for display.
-
公开(公告)号:US10984286B2
公开(公告)日:2021-04-20
申请号:US16265725
申请日:2019-02-01
Applicant: NVIDIA Corporation
Inventor: Aysegul Dundar , Ming-Yu Liu , Ting-Chun Wang , John Zedlewski , Jan Kautz
IPC: G06K9/62 , G06K9/32 , G06K9/00 , G01N3/08 , G06N3/04 , G06T7/10 , G06T3/00 , G06T11/00 , G06T15/00 , G06N3/08
Abstract: A style transfer neural network may be used to generate stylized synthetic images, where real images provide the style (e.g., seasons, weather, lighting) for transfer to synthetic images. The stylized synthetic images may then be used to train a recognition neural network. In turn, the trained neural network may be used to predict semantic labels for the real images, providing recognition data for the real images. Finally, the real training dataset (real images and predicted recognition data) and the synthetic training dataset are used by the style transfer neural network to generate stylized synthetic images. The training of the neural network, prediction of recognition data for the real images, and stylizing of the synthetic images may be repeated for a number of iterations. The stylization operation more closely aligns a covariate of the synthetic images to the covariate of the real images, improving accuracy of the recognition neural network.
-
公开(公告)号:US20210042503A1
公开(公告)日:2021-02-11
申请号:US17069478
申请日:2020-10-13
Applicant: NVIDIA Corporation
Inventor: Tero Tapani Karras , Samuli Matias Laine , David Patrick Luebke , Jaakko T. Lehtinen , Miika Samuli Aittala , Timo Oskari Aila , Ming-Yu Liu , Arun Mohanray Mallya , Ting-Chun Wang
Abstract: A latent code defined in an input space is processed by the mapping neural network to produce an intermediate latent code defined in an intermediate latent space. The intermediate latent code may be used as appearance vector that is processed by the synthesis neural network to generate an image. The appearance vector is a compressed encoding of data, such as video frames including a person's face, audio, and other data. Captured images may be converted into appearance vectors at a local device and transmitted to a remote device using much less bandwidth compared with transmitting the captured images. A synthesis neural network at the remote device reconstructs the images for display.
-
-
-
-
-
-
-
-
-