Patent search ap:("NVIDIA Corporation") AND inv:"Jinwei Gu" Page 1

1.

发明授权
Switchable propagation neural network 有权

公开(公告)号：US11328173B2

公开(公告)日：2022-05-10

申请号：US17081805

申请日：2020-10-27

Applicant: NVIDIA Corporation

Inventor： Sifei Liu , Shalini De Mello , Jinwei Gu , Varun Jampani , Jan Kautz

IPC: G06K9/62 , G06T7/90 , G06T5/00 , G06T7/10 , G06N3/08 , G06N3/04 , G06T5/50 , G06V20/40

Abstract: A temporal propagation network (TPN) system learns the affinity matrix for video image processing tasks. An affinity matrix is a generic matrix that defines the similarity of two points in space. The TPN system includes a guidance neural network model and a temporal propagation module and is trained for a particular computer vision task to propagate visual properties from a key-frame represented by dense data (color), to another frame that is represented by coarse data (grey-scale). The guidance neural network model generates an affinity matrix referred to as a global transformation matrix from task-specific data for the key-frame and the other frame. The temporal propagation module applies the global transformation matrix to the key-frame property data to produce propagated property data (color) for the other frame. For example, the TPN system may be used to colorize several frames of greyscale video using a single manually colorized key-frame.

2.

发明申请
DEEP-LEARNING METHOD FOR SEPARATING REFLECTION AND TRANSMISSION IMAGES VISIBLE AT A SEMI-REFLECTIVE SURFACE IN A COMPUTER IMAGE OF A REAL-WORLD SCENE 审中-公开

公开(公告)号：US20200342263A1

公开(公告)日：2020-10-29

申请号：US16924005

申请日：2020-07-08

Applicant: NVIDIA Corporation

Inventor： Orazio Gallo , Jinwei Gu , Jan Kautz , Patrick Wieschollek

IPC: G06K9/62

Abstract: When a computer image is generated from a real-world scene having a semi-reflective surface (e.g. window), the computer image will create, at the semi-reflective surface from the viewpoint of the camera, both a reflection of a scene in front of the semi-reflective surface and a transmission of a scene located behind the semi-reflective surface. Similar to a person viewing the real-world scene from different locations, angles, etc., the reflection and transmission may change, and also move relative to each other, as the viewpoint of the camera changes. Unfortunately, the dynamic nature of the reflection and transmission negatively impacts the performance of many computer applications, but performance can generally be improved if the reflection and transmission are separated. The present disclosure uses deep learning to separate reflection and transmission at a semi-reflective surface of a computer image generated from a real-world scene.

3.

发明申请
ESTIMATING DEPTH FOR A VIDEO STREAM CAPTURED WITH A MONOCULAR RGB CAMERA 审中-公开

公开(公告)号：US20200160546A1

公开(公告)日：2020-05-21

申请号：US16439539

申请日：2019-06-12

Applicant: NVIDIA Corporation

Inventor： Jinwei Gu , Kihwan Kim , Chao Liu

IPC: G06T7/55 , G06T3/00 , G06N3/04 , G06N3/08 , G06N20/20 , G06N20/10 , H04N5/232

Abstract: Techniques for estimating depth for a video stream captured by a monocular image sensor are disclosed. A sequence of image frames are captured by the monocular image sensor. A first neural network is configured to process at least a portion of the sequence of image frames to generate a depth probability volume. The depth probability volume includes a plurality of probability maps corresponding to a number of discrete depth candidate locations over a range of depths defined for the scene. The depth probability volume can be updated using a second neural network that is configured to generate adaptive gain parameters to integrate the DPVs over time. A third neural network is configured to refine the updated depth probability volume from a lower resolution to a higher resolution that matches the original resolution of the sequence of image frames. A depth map can be calculated based on the depth probability volume.

4.

发明申请
GUIDED HALLUCINATION FOR MISSING IMAGE CONTENT USING A NEURAL NETWORK 审中-公开

公开(公告)号：US20190355103A1

公开(公告)日：2019-11-21

申请号：US16353195

申请日：2019-03-14

Applicant: NVIDIA Corporation

Inventor： Seung-Hwan Baek , Kihwan Kim , Jinwei Gu , Orazio Gallo , Alejandro Jose Troccoli , Ming-Yu Liu , Jan Kautz

IPC: G06T5/00 , G06K9/72 , G06T5/50 , G06T3/40 , G06K9/62

Abstract: Missing image content is generated using a neural network. In an embodiment, a high resolution image and associated high resolution semantic label map are generated from a low resolution image and associated low resolution semantic label map. The input image/map pair (low resolution image and associated low resolution semantic label map) lacks detail and is therefore missing content. Rather than simply enhancing the input image/map pair, data missing in the input image/map pair is improvised or hallucinated by a neural network, creating plausible content while maintaining spatio-temporal consistency. Missing content is hallucinated to generate a detailed zoomed in portion of an image. Missing content is hallucinated to generate different variations of an image, such as different seasons or weather conditions for a driving video.

5.

发明授权
Systems and methods for dynamic facial analysis using a recurrent neural network 有权

公开(公告)号：US10373332B2

公开(公告)日：2019-08-06

申请号：US15836549

申请日：2017-12-08

Applicant: NVIDIA Corporation

Inventor： Jinwei Gu , Xiaodong Yang , Shalini De Mello , Jan Kautz

IPC: G06T3/40 , G06T13/40 , G06N3/08 , G06T7/73

Abstract: A method, computer readable medium, and system are disclosed for dynamic facial analysis. The method includes the steps of receiving video data representing a sequence of image frames including at least one head and extracting, by a neural network, spatial features comprising pitch, yaw, and roll angles of the at least one head from the video data. The method also includes the step of processing, by a recurrent neural network, the spatial features for two or more image frames in the sequence of image frames to produce head pose estimates for the at least one head.

6.

发明授权
Learning based camera pose estimation from images of an environment 有权

公开(公告)号：US10692244B2

公开(公告)日：2020-06-23

申请号：US16137064

申请日：2018-09-20

Applicant: NVIDIA Corporation

Inventor： Jinwei Gu , Samarth Manoj Brahmbhatt , Kihwan Kim , Jan Kautz

IPC: G06T7/80 , G06T7/00 , G06K9/00 , G06K9/20 , G06K9/46 , G06N3/00 , G06T7/579 , G06T7/20

Abstract: A deep neural network (DNN) system learns a map representation for estimating a camera position and orientation (pose). The DNN is trained to learn a map representation corresponding to the environment, defining positions and attributes of structures, trees, walls, vehicles, etc. The DNN system learns a map representation that is versatile and performs well for many different environments (indoor, outdoor, natural, synthetic, etc.). The DNN system receives images of an environment captured by a camera (observations) and outputs an estimated camera pose within the environment. The estimated camera pose is used to perform camera localization, i.e., recover the three-dimensional (3D) position and orientation of a moving camera, which is a fundamental task in computer vision with a wide variety of applications in robot navigation, car localization for autonomous driving, device localization for mobile navigation, and augmented/virtual reality.

7.

发明申请
3D PLANE DETECTION AND RECONSTRUCTION USING A MONOCULAR IMAGE 审中-公开

公开(公告)号：US20200167943A1

公开(公告)日：2020-05-28

申请号：US16565885

申请日：2019-09-10

Applicant: NVIDIA Corporation

Inventor： Kihwan Kim , Jinwei Gu , Chen Liu , Jan Kautz

IPC: G06T7/593 , G06T7/11 , G06T7/143 , G06N3/04

Abstract: Planar regions in three-dimensional scenes offer important geometric cues in a variety of three-dimensional perception tasks such as scene understanding, scene reconstruction, and robot navigation. Image analysis to detect planar regions can be performed by a deep learning architecture that includes a number of neural networks configured to estimate parameters for the planar regions. The neural networks process an image to detect an arbitrary number of plane objects in the image. Each plane object is associated with a number of estimated parameters including bounding box parameters, plane normal parameters, and a segmentation mask. Global parameters for the image, including a depth map, can also be estimated by one of the neural networks. Then, a segmentation refinement network jointly optimizes (i.e., refines) the segmentation masks for each instance of the plane objects and combines the refined segmentation masks to generate an aggregate segmentation mask for the image.

8.

发明申请
DEEP-LEARNING METHOD FOR SEPARATING REFLECTION AND TRANSMISSION IMAGES VISIBLE AT A SEMI-REFLECTIVE SURFACE IN A COMPUTER IMAGE OF A REAL-WORLD SCENE 审中-公开

公开(公告)号：US20190164268A1

公开(公告)日：2019-05-30

申请号：US16200192

申请日：2018-11-26

Applicant: NVIDIA Corporation

Inventor： Orazio Gallo , Jinwei Gu , Jan Kautz , Patrick Wieschollek

IPC: G06T7/00 , G06K9/62 , G06T1/20 , G06T11/40 , G06N20/00

Abstract: When a computer image is generated from a real-world scene having a semi-reflective surface (e.g. window), the computer image will create, at the semi-reflective surface from the viewpoint of the camera, both a reflection of a scene in front of the semi-reflective surface and a transmission of a scene located behind the semi-reflective surface. Similar to a person viewing the real-world scene from different locations, angles, etc., the reflection and transmission may change, and also move relative to each other, as the viewpoint of the camera changes. Unfortunately, the dynamic nature of the reflection and transmission negatively impacts the performance of many computer applications, but performance can generally be improved if the reflection and transmission are separated. The present disclosure uses deep learning to separate reflection and transmission at a semi-reflective surface of a computer image generated from a real-world scene.

9.

发明授权
Inverse rendering of a scene from a single image 有权

公开(公告)号：US11295514B2

公开(公告)日：2022-04-05

申请号：US16685538

申请日：2019-11-15

Applicant: NVIDIA Corporation

Inventor： Jinwei Gu , Kihwan Kim , Jan Kautz , Guilin Liu , Soumyadip Sengupta

IPC: G06T15/50 , G06T9/00 , G06N3/08 , G06N3/04

Abstract: Inverse rendering estimates physical scene attributes (e.g., reflectance, geometry, and lighting) from image(s) and is used for gaming, virtual reality, augmented reality, and robotics. An inverse rendering network (IRN) receives a single input image of a 3D scene and generates the physical scene attributes for the image. The IRN is trained by using the estimated physical scene attributes generated by the IRN to reproduce the input image and updating parameters of the IRN to reduce differences between the reproduced input image and the input image. A direct renderer and a residual appearance renderer (RAR) reproduce the input image. The RAR predicts a residual image representing complex appearance effects of the real (not synthetic) image based on features extracted from the image and the reflectance and geometry properties. The residual image represents near-field illumination, cast shadows, inter-reflections, and realistic shading that are not provided by the direct renderer.

10.

发明授权
Learning-based camera pose estimation from images of an environment 有权

公开(公告)号：US10964061B2

公开(公告)日：2021-03-30

申请号：US16872752

申请日：2020-05-12

Applicant: NVIDIA Corporation

Inventor： Jinwei Gu , Samarth Manoj Brahmbhatt , Kihwan Kim , Jan Kautz

IPC: G06T7/80 , G06T7/00 , G06K9/00 , G06K9/20 , G06K9/46 , G06N3/00 , G06T7/579 , G06T7/20

Abstract: A deep neural network (DNN) system learns a map representation for estimating a camera position and orientation (pose). The DNN is trained to learn a map representation corresponding to the environment, defining positions and attributes of structures, trees, walls, vehicles, etc. The DNN system learns a map representation that is versatile and performs well for many different environments (indoor, outdoor, natural, synthetic, etc.). The DNN system receives images of an environment captured by a camera (observations) and outputs an estimated camera pose within the environment. The estimated camera pose is used to perform camera localization, i.e., recover the three-dimensional (3D) position and orientation of a moving camera, which is a fundamental task in computer vision with a wide variety of applications in robot navigation, car localization for autonomous driving, device localization for mobile navigation, and augmented/virtual reality.

Search Results

Country/Region

Patent validity

Application date

Publication (announcement) day

applicant

The country/region where the applicant is located

Inventor

IPC

IPC Department

IPC class

IPC subclass

IPC group

IPC team

Appearance classification