GENERATING DEPTH IMAGES FOR IMAGE DATA
    12.
    发明公开

    公开(公告)号:US20230281843A1

    公开(公告)日:2023-09-07

    申请号:US17688694

    申请日:2022-03-07

    Inventor: Tiecheng Wu Bo Li

    CPC classification number: G06T7/50 G06T7/13 G06T2207/20081

    Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for training a machine learning model configured to generate a predicted depth image, comprising receiving data representing training samples that include a plurality of image pairs, each image pair includes a target image and a reference image both capturing a particular scene from different orientations; for each of the plurality of image pairs, generating a compressed cost volume for the image pair; providing the compressed cost volume as an input to the machine learning model; generating, using the machine learning model, output data representing a predicted disparity map for the compressed cost volume; and generating a total loss using the predicted disparity map for the compressed cost volume, the total loss includes a boundary loss, an occlusion loss, and a transfer loss; and updating the plurality of parameters of the machine learning model by minimizing the total losses.

    Precise object segmentation with multi-modal input for realtime video application

    公开(公告)号:US11636683B2

    公开(公告)日:2023-04-25

    申请号:US17474774

    申请日:2021-09-14

    Inventor: Fangwen Tu Bo Li

    Abstract: The present invention discloses a system for precise representation of object segmentation with multi-modal input for real-time video applications. The multi-modal segmentation system takes advantage of optical, temporal as well as spatial information to enhance the segmentation for AR and VR or other entrainment purpose with accurate details. The system can segment foreground objects such as human and salient objects within a video frame and allows locating object-of-interest for multiple-purposes.

    SYSTEM FOR DETECTING FACE LIVELINESS IN AN IMAGE

    公开(公告)号:US20230084980A1

    公开(公告)日:2023-03-16

    申请号:US17474965

    申请日:2021-09-14

    Inventor: Shuen Lyu Bo Li

    Abstract: The present invention discloses a liveliness detection technique. The technique is described for identifying facial attributes. The technique identifies the presented face in the image as real or deceptive. The system and method includes identifying the facial attributes and utilizing a multi task learning network. The neural network includes segmentation and classification functionalities. The final output is used to get pixel level semantic information and high level semantic information.

    Generating depth images for image data

    公开(公告)号:US12190535B2

    公开(公告)日:2025-01-07

    申请号:US17688694

    申请日:2022-03-07

    Inventor: Tiecheng Wu Bo Li

    Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for training a machine learning model configured to generate a predicted depth image, comprising receiving data representing training samples that include a plurality of image pairs, each image pair includes a target image and a reference image both capturing a particular scene from different orientations; for each of the plurality of image pairs, generating a compressed cost volume for the image pair; providing the compressed cost volume as an input to the machine learning model; generating, using the machine learning model, output data representing a predicted disparity map for the compressed cost volume; and generating a total loss using the predicted disparity map for the compressed cost volume, the total loss includes a boundary loss, an occlusion loss, and a transfer loss; and updating the plurality of parameters of the machine learning model by minimizing the total losses.

Patent Agency Ranking