ADAPTIVE TEXT RECOGNITION
    3.
    发明公开

    公开(公告)号:US20230237816A1

    公开(公告)日:2023-07-27

    申请号:US17586724

    申请日:2022-01-27

    发明人: Zi Sian Wong Bo Li

    IPC分类号: G06V20/62 G06V20/58 G06N20/20

    摘要: Methods, systems, and apparatuses, including computer programs encoded on computer storage media, for generating a prediction of at least a text and a particular type associated with an object are described in this specification. A first model output is generated by using a first machine learning model to process input data including one or more objects. The first model output identifies an existence of a particular object in the input data and specifies characteristics of the particular object. A type of the particular object is determined based on the specified characteristics. The type comprises a single-row type and a multi-row type. A single-row representation of the particular object is generated. A second model output is generated by processing the single-row representation. The second model output comprises a prediction of characters corresponding to the particular vehicle license plate.

    TEMPORAL VIDEO ENHANCEMENT
    5.
    发明公开

    公开(公告)号:US20230177639A1

    公开(公告)日:2023-06-08

    申请号:US17545424

    申请日:2021-12-08

    IPC分类号: G06T3/40 G06T5/50

    摘要: A method of age and gender estimation, comprising receiving an input image, detecting a facial image within the input image, estimating a head pose based on a set of facial image intensities of the facial image, Wherein the head pose is expressed as a yaw, a pitch and a roll, determining whether the yaw, the pitch and the roll of the head pose is less than a predetermined threshold, aligning the facial image if the yaw, the pitch and the roll of the head pose are less than the predetermined threshold and predicting an age and a gender of the aligned facial image.

    NEURAL NETWORKS GRAPH PARTITIONING SYSTEM AND METHOD FOR THE SAME

    公开(公告)号:US20230177311A1

    公开(公告)日:2023-06-08

    申请号:US17545799

    申请日:2021-12-08

    IPC分类号: G06N3/04 G06F17/18

    CPC分类号: G06N3/0454 G06F17/18

    摘要: The present invention discloses a graph partitioning system for running neural networks on resource constrained hardware systems. The graph partitioning system used for partitioning a neural network graph into a series of sub-graphs and further allow the multiple sub-graphs to be executed in available hardware subsystems. The system based on cost function as estimated computation time and memory bandwidth of partitioned sub-graphs. The graph partitioning system is a cycle estimation model of hardware that can run fast and parameterize memory latency. The graph partitioning system supports heterogeneous partition for different type accelerators such as CPU, GPU, ASIC. The present invention also discloses a method for partitioning neural network graph in to series of sub-graphs.

    Infrared temperature measurement fused with facial identification in an access control system

    公开(公告)号:US11610432B2

    公开(公告)日:2023-03-21

    申请号:US17953958

    申请日:2022-09-27

    IPC分类号: G06V40/16 G01J5/02 H04N23/611

    摘要: An example method of infrared access, comprising, receiving a plurality of visual images, receiving a plurality of infrared images, calibrating pairs of at least one of the plurality of visual images to a respective at least one of the plurality of infrared images, determining an average temperature of the plurality of infrared images, determining a temperature of the respective calibrated pairs, and granting access if a visual image of the calibrated pair is authenticated by a facial recognition library and if the temperature of the calibrated pair is within a predefined threshold of the average temperature.

    PRECISE OBJECT SEGMENTATION WITH MULTI- MODAL INPUT FOR REALTIME VIDEO APPLICATION

    公开(公告)号:US20230083896A1

    公开(公告)日:2023-03-16

    申请号:US17474774

    申请日:2021-09-14

    发明人: Fangwen Tu Bo Li

    IPC分类号: G06V20/40 G06T7/215 G06T7/194

    摘要: The present invention discloses a system for precise representation of object segmentation with multi-modal input for real-time video applications. The multi-modal segmentation system takes advantage of optical, temporal as well as spatial information to enhance the segmentation for AR and VR or other entrainment purpose with accurate details. The system can segment foreground objects such as human and salient objects within a video frame and allows locating object-of-interest for multiple-purposes.

    ANTI-SHAKE IMAGE PROCESSING METHOD, APPARATUS, ELECTRONIC DEVICE AND STORAGE MEDIUM

    公开(公告)号:US20230036081A1

    公开(公告)日:2023-02-02

    申请号:US17829903

    申请日:2022-06-01

    发明人: Zhenwei Yuan

    IPC分类号: H04N5/232

    摘要: An anti-shake image processing method, apparatus, electronic device and storage medium are disclosed. The method includes the following steps. A current actual posture of an imaging device at a current shooting moment of capturing an original image of current frame and multiple reference actual postures of the imaging device at multiple shooting moments of capturing the original images of adjacent multiple frames of the original image of the current frame are obtained. A path smoothing process is performed to determine a current virtual posture after path smoothing at the current shooting moment. A coordinate transformation is performed on the original image of the current frame captured in the current actual posture to an estimated position of the original image of the current frame when captured in the current virtual posture in a pixel coordinate system to obtain a first correction image of the current frame.

    GENERATING STEREO-BASED DENSE DEPTH IMAGES

    公开(公告)号:US20230035671A1

    公开(公告)日:2023-02-02

    申请号:US17376027

    申请日:2021-07-14

    发明人: Tiecheng Wu Bo Li

    IPC分类号: G06T7/55 G06T5/00 G06N20/00

    摘要: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for generating a depth image, comprising obtaining data representing a first image generated by a first sensor and a second image generated by a second sensor, wherein each of the first and second images includes a plurality of pixels; determining, for each pixel of the plurality of pixels included in the first image, whether the pixel is a boundary pixel associated with a boundary of an object that is represented in the first image; determining, from a plurality of candidate penalty values and for each pixel in the first image, an optimized penalty value for the pixel; generating an optimized cost function for the first image based on the optimized penalty values for the plurality of pixels; and generating a depth image for the first image based on the optimized cost function.