DISPARITY-BASED DEPTH REFINEMENT USING CONFIDENCE INFORMATION AND STEREOSCOPIC DEPTH INFORMATION

    公开(公告)号:US20240404093A1

    公开(公告)日:2024-12-05

    申请号:US18327380

    申请日:2023-06-01

    Abstract: Systems and techniques are provided for generating disparity information from two or more images. For example, a process can include obtaining first disparity information corresponding to a pair of images, the pair of images including a first image of a scene and a second image of the scene. The process can include obtaining confidence information associated with the first disparity information. The process can include processing, using a machine learning network, the first disparity information and the confidence information to generate second disparity information corresponding to the pair of images. The process can include combining, based on the confidence information, the first disparity information with the second disparity information to generate a refined disparity map corresponding to the pair of images.

    PHYSICALLY-BASED EMITTER ESTIMATION FOR INDOOR SCENES

    公开(公告)号:US20240303913A1

    公开(公告)日:2024-09-12

    申请号:US18180797

    申请日:2023-03-08

    CPC classification number: G06T15/506 G06T7/593

    Abstract: Systems and techniques are provided for physical-based light estimation for inverse rendering of indoor scenes. For example, a computing device can obtain an estimated scene geometry based on a multi-view observation of a scene. The computing device can further obtain a light emission mask based on the multi-view observation of the scene. The computing device can also obtain an emitted radiance field based on the multi-view observation of the scene. The computing device can then determine, based on the light emission mask and the emitted radiance field, a geometry of at least one light source of the estimated scene geometry.

    PLANAR MESH RECONSTRUCTION USING IMAGES FROM MULTIPLE CAMERA POSES

    公开(公告)号:US20240386650A1

    公开(公告)日:2024-11-21

    申请号:US18509113

    申请日:2023-11-14

    Abstract: Systems and techniques are provided for processing image data corresponding to a scene. A process can include generating a planar distance map including a planar distance value for each pixel of at least one image corresponding to the scene. Planar segmentation is performed based on the planar distance map, a normal map corresponding to the at least one image, and positional encoding information of the planar distance map. A triangular mesh fragment is initialized based on sampling points from each planar segment of a plurality of planar segments from the planar segmentation. Ray-triangle intersections are determined based on performing ray casting for a reconstructed planar mesh including a plurality of triangular mesh fragments each corresponding to a different image. A planar reconstruction and segmentation machine learning network is optimized for the scene, based on training the planar reconstruction and segmentation machine learning network using one or more loss functions.

Patent Agency Ranking