ADAPTIVE TEXT RECOGNITION
    2.
    发明公开

    公开(公告)号:US20230237816A1

    公开(公告)日:2023-07-27

    申请号:US17586724

    申请日:2022-01-27

    Inventor: Zi Sian Wong Bo Li

    CPC classification number: G06V20/625 G06V20/582 G06N20/20

    Abstract: Methods, systems, and apparatuses, including computer programs encoded on computer storage media, for generating a prediction of at least a text and a particular type associated with an object are described in this specification. A first model output is generated by using a first machine learning model to process input data including one or more objects. The first model output identifies an existence of a particular object in the input data and specifies characteristics of the particular object. A type of the particular object is determined based on the specified characteristics. The type comprises a single-row type and a multi-row type. A single-row representation of the particular object is generated. A second model output is generated by processing the single-row representation. The second model output comprises a prediction of characters corresponding to the particular vehicle license plate.

    PRECISE OBJECT SEGMENTATION WITH MULTI- MODAL INPUT FOR REALTIME VIDEO APPLICATION

    公开(公告)号:US20230083896A1

    公开(公告)日:2023-03-16

    申请号:US17474774

    申请日:2021-09-14

    Inventor: Fangwen Tu Bo Li

    Abstract: The present invention discloses a system for precise representation of object segmentation with multi-modal input for real-time video applications. The multi-modal segmentation system takes advantage of optical, temporal as well as spatial information to enhance the segmentation for AR and VR or other entrainment purpose with accurate details. The system can segment foreground objects such as human and salient objects within a video frame and allows locating object-of-interest for multiple-purposes.

    GENERATING STEREO-BASED DENSE DEPTH IMAGES

    公开(公告)号:US20230035671A1

    公开(公告)日:2023-02-02

    申请号:US17376027

    申请日:2021-07-14

    Inventor: Tiecheng Wu Bo Li

    Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for generating a depth image, comprising obtaining data representing a first image generated by a first sensor and a second image generated by a second sensor, wherein each of the first and second images includes a plurality of pixels; determining, for each pixel of the plurality of pixels included in the first image, whether the pixel is a boundary pixel associated with a boundary of an object that is represented in the first image; determining, from a plurality of candidate penalty values and for each pixel in the first image, an optimized penalty value for the pixel; generating an optimized cost function for the first image based on the optimized penalty values for the plurality of pixels; and generating a depth image for the first image based on the optimized cost function.

    Generating stereo-based dense depth images

    公开(公告)号:US11961249B2

    公开(公告)日:2024-04-16

    申请号:US17376027

    申请日:2021-07-14

    Inventor: Tiecheng Wu Bo Li

    CPC classification number: G06T7/55 G06N20/00 G06T5/70 G06T2207/20081

    Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for generating a depth image, comprising obtaining data representing a first image generated by a first sensor and a second image generated by a second sensor, wherein each of the first and second images includes a plurality of pixels; determining, for each pixel of the plurality of pixels included in the first image, whether the pixel is a boundary pixel associated with a boundary of an object that is represented in the first image; determining, from a plurality of candidate penalty values and for each pixel in the first image, an optimized penalty value for the pixel; generating an optimized cost function for the first image based on the optimized penalty values for the plurality of pixels; and generating a depth image for the first image based on the optimized cost function.

    Adaptive text recognition
    10.
    发明授权

    公开(公告)号:US12046054B2

    公开(公告)日:2024-07-23

    申请号:US17586724

    申请日:2022-01-27

    Inventor: Zi Sian Wong Bo Li

    CPC classification number: G06V20/625 G06N20/20 G06V20/582

    Abstract: Methods, systems, and apparatuses, including computer programs encoded on computer storage media, for generating a prediction of at least a text and a particular type associated with an object are described in this specification. A first model output is generated by using a first machine learning model to process input data including one or more objects. The first model output identifies an existence of a particular object in the input data and specifies characteristics of the particular object. A type of the particular object is determined based on the specified characteristics. The type comprises a single-row type and a multi-row type. A single-row representation of the particular object is generated. A second model output is generated by processing the single-row representation. The second model output comprises a prediction of characters corresponding to the particular vehicle license plate.

Patent Agency Ranking