Estimating depth for a video stream captured with a monocular rgb camera

Invention Grant

US10984545B2 Estimating depth for a video stream captured with a monocular rgb camera 有权

Please log in to see more content

Patent Title: Estimating depth for a video stream captured with a monocular rgb camera
Application No.: US16439539

Application Date: 2019-06-12
Publication No.: US10984545B2

Publication Date: 2021-04-20
Inventor: Jinwei Gu , Kihwan Kim , Chao Liu
Applicant: NVIDIA Corporation
Applicant Address: US CA Santa Clara
Assignee: NVIDIA Corporation
Current Assignee: NVIDIA Corporation
Current Assignee Address: US CA Santa Clara
Agency: Leydig, Voit & Mayer, Ltd.
Main IPC: G06T7/55
IPC: G06T7/55 ; G06T3/00 ; G06N3/04 ; G06N3/08 ; G06N20/10 ; H04N5/232 ; G06N20/20

Estimating depth for a video stream captured with a monocular rgb camera

Abstract:

Techniques for estimating depth for a video stream captured by a monocular image sensor are disclosed. A sequence of image frames are captured by the monocular image sensor. A first neural network is configured to process at least a portion of the sequence of image frames to generate a depth probability volume. The depth probability volume includes a plurality of probability maps corresponding to a number of discrete depth candidate locations over a range of depths defined for the scene. The depth probability volume can be updated using a second neural network that is configured to generate adaptive gain parameters to integrate the DPVs over time. A third neural network is configured to refine the updated depth probability volume from a lower resolution to a higher resolution that matches the original resolution of the sequence of image frames. A depth map can be calculated based on the depth probability volume.

Public/Granted literature

US20200160546A1 ESTIMATING DEPTH FOR A VIDEO STREAM CAPTURED WITH A MONOCULAR RGB CAMERA Public/Granted day:2020-05-21

Information query

Espacenet

IPC分类:

G	物理
G06	计算；推算或计数
G06T	一般的图像数据处理或产生
G06T7/00	图像分析
G06T7/50	.深度或形状恢复
G06T7/55	..从多幅图像的