-
公开(公告)号:US20190355103A1
公开(公告)日:2019-11-21
申请号:US16353195
申请日:2019-03-14
Applicant: NVIDIA Corporation
Inventor: Seung-Hwan Baek , Kihwan Kim , Jinwei Gu , Orazio Gallo , Alejandro Jose Troccoli , Ming-Yu Liu , Jan Kautz
Abstract: Missing image content is generated using a neural network. In an embodiment, a high resolution image and associated high resolution semantic label map are generated from a low resolution image and associated low resolution semantic label map. The input image/map pair (low resolution image and associated low resolution semantic label map) lacks detail and is therefore missing content. Rather than simply enhancing the input image/map pair, data missing in the input image/map pair is improvised or hallucinated by a neural network, creating plausible content while maintaining spatio-temporal consistency. Missing content is hallucinated to generate a detailed zoomed in portion of an image. Missing content is hallucinated to generate different variations of an image, such as different seasons or weather conditions for a driving video.
-
公开(公告)号:US20190279075A1
公开(公告)日:2019-09-12
申请号:US16279671
申请日:2019-02-19
Applicant: Nvidia Corporation
Inventor: Ming-Yu Liu , Xun Huang
Abstract: A source image is processed using an encoder network to determine a content code representative of a visual aspect of the source object represented in the source image. A target class is determined, which can correspond to an entire population of objects of a particular type. The user may specify specific objects within the target class, or a sampling can be done to select objects within the target class to use for the translation. Style codes for the selected target objects are determined that are representative of the appearance of those target objects. The target style codes are provided with the source content code as input to a translation network, which can use the codes to infer a set of images including representations of the selected target objects having the visual aspect determined from the source image.
-
公开(公告)号:US20190244329A1
公开(公告)日:2019-08-08
申请号:US16246375
申请日:2019-01-11
Applicant: NVIDIA Corporation
Inventor: Yijun Li , Ming-Yu Liu , Ming-Hsuan Yang , Jan Kautz
Abstract: Photorealistic image stylization concerns transferring style of a reference photo to a content photo with the constraint that the stylized photo should remain photorealistic. Examples of styles include seasons (summer, winter, etc.), weather (sunny, rainy, foggy, etc.), lighting (daytime, nighttime, etc.). A photorealistic image stylization process includes a stylization step and a smoothing step. The stylization step transfers the style of the reference photo to the content photo. A photo style transfer neural network model receives a photorealistic content image and a photorealistic style image and generates an intermediate stylized photorealistic image that includes the content of the content image modified according to the style image. A smoothing function receives the intermediate stylized photorealistic image and pixel similarity data and generates the stylized photorealistic image, ensuring spatially consistent stylizations.
-
公开(公告)号:US20180293737A1
公开(公告)日:2018-10-11
申请号:US15942213
申请日:2018-03-30
Applicant: NVIDIA Corporation
Inventor: Deqing Sun , Xiaodong Yang , Ming-Yu Liu , Jan Kautz
CPC classification number: G06T7/207 , G06N3/0454 , G06N3/08 , G06N5/046 , G06T3/0093 , G06T7/246 , G06T7/251 , G06T7/97 , G06T2200/28 , G06T2207/10016 , G06T2207/20016 , G06T2207/20032 , G06T2207/20084
Abstract: A method, computer readable medium, and system are disclosed for estimating optical flow between two images. A first pyramidal set of features is generated for a first image and a partial cost volume for a level of the first pyramidal set of features is computed, by a neural network, using features at the level of the first pyramidal set of features and warped features extracted from a second image, where the partial cost volume is computed across a limited range of pixels that is less than a full resolution of the first image, in pixels, at the level. The neural network processes the features and the partial cost volume to produce a refined optical flow estimate for the first image and the second image.
-
公开(公告)号:US20250111222A1
公开(公告)日:2025-04-03
申请号:US18375377
申请日:2023-09-29
Applicant: NVIDIA Corporation
Inventor: Zekun Hao , Ming-Yu Liu , Arun Mallya
Abstract: Performance of a neural network is usually a function of the capacity, or complexity, of the neural network, including the depth of the neural network (i.e. the number of layers in the neural network) and/or the width of the neural network (i.e. the number of hidden channels). However, improving performance of a neural network by simply increasing its capacity has drawbacks, the most notable being the increased computational cost of a higher-capacity neural network. Since modern neural networks are configured such that the same neural network is evaluated regardless of the input, a higher capacity neural network means a higher computational cost incurred per input processed. The present disclosure provides for a multi-layer neural network that allows for dynamic path selection through the neural network when processing an input, which in turn can allow for increased neural network capacity without incurring the typical increased computation cost associated therewith.
-
公开(公告)号:US12175350B2
公开(公告)日:2024-12-24
申请号:US16566797
申请日:2019-09-10
Applicant: NVIDIA Corporation
Inventor: Arash Vahdat , Arun Mohanray Mallya , Ming-Yu Liu , Jan Kautz
Abstract: In at least one embodiment, differentiable neural architecture search and reinforcement learning are combined under one framework to discover network architectures with desired properties such as high accuracy, low latency, or both. In at least one embodiment, an objective function for search based on generalization error prevents the selection of architectures prone to overfitting.
-
公开(公告)号:US12075061B2
公开(公告)日:2024-08-27
申请号:US17955740
申请日:2022-09-29
Applicant: Nvidia Corporation
Inventor: Aurobinda Maharana , Vignesh Ungrapalli , Ming-Yu Liu
IPC: H04N19/137 , H04N19/186 , H04N19/513
CPC classification number: H04N19/137 , H04N19/186 , H04N19/513
Abstract: Systems and methods herein address reference frame selection in video streaming applications using one or more processing units to identify a frame of a sequence of frames as a blurred frame based at least in part on a first variance of motion (VoM) of the frame being less than or equal to an adaptive threshold that is based in part on a moving average of variance of motion (MAoV) determined using one or more reference frames.
-
公开(公告)号:US20240257460A1
公开(公告)日:2024-08-01
申请号:US17990614
申请日:2022-11-18
Applicant: NVIDIA Corporation
Inventor: Chen-Hsuan Lin , Zhaoshuo Li , Thomas Müller-Höhne , Alex John Bauld Evans , Ming-Yu Liu , Alexander Georg Keller
IPC: G06T17/10
CPC classification number: G06T17/10 , G06T2210/52
Abstract: Apparatuses, systems, and techniques to generate pixels based on other pixels. In at least one embodiment, one or more neural networks are used to generate one or more pixels based, at least in part, on sets of pixels surrounding the one or more pixels.
-
公开(公告)号:US20240114144A1
公开(公告)日:2024-04-04
申请号:US17955740
申请日:2022-09-29
Applicant: Nvidia Corporation
Inventor: Aurobinda Maharana , Vignesh Ungrapalli , Ming-Yu Liu
IPC: H04N19/137 , H04N19/186 , H04N19/513
CPC classification number: H04N19/137 , H04N19/186 , H04N19/513
Abstract: Systems and methods herein address reference frame selection in video streaming applications using one or more processing units to identify a frame of a sequence of frames as a blurred frame based at least in part on a first variance of motion (VoM) of the frame being less than or equal to an adaptive threshold that is based in part on a moving average of variance of motion (MAoV) determined using one or more reference frames.
-
公开(公告)号:US11934959B2
公开(公告)日:2024-03-19
申请号:US16889376
申请日:2020-06-01
Applicant: Nvidia Corporation
Inventor: Arun Mallya , Ting-Chun Wang , Ming-Yu Liu , Karan Sapra
Abstract: Apparatuses, systems, and techniques are presented to synthesize consistent images or video. In at least one embodiment, one or more neural networks are used to generate one or more second images based, at least in part, on one or more point cloud representations of one or more first images.
-
-
-
-
-
-
-
-
-