Residual context refinement network architecture for optical character recognition

    公开(公告)号:US11308354B1

    公开(公告)日:2022-04-19

    申请号:US16834997

    申请日:2020-03-30

    Abstract: Techniques for recognizing text in an image are described. An exemplary method may include receiving a request to recognize text in an image; extracting features from the image and generating a visual feature sequence from the extracted features; performing selective contextual refinement at least one selective contextual refinement block of a stack of selective contextual refinement blocks to generate a text prediction by: generating a contextual feature map and combining the contextual feature map with the visual feature sequence into a visual feature space, and applying a selective decoder that utilizes a two-step attention on the visual feature space to generate a text prediction, wherein the two-step attention includes performing a 1-D self-attention computation to generate attentional features and decoding the attentional features to generate the text prediction; and outputting the generated text prediction.

    Enhanced compression of video data

    公开(公告)号:US10659787B1

    公开(公告)日:2020-05-19

    申请号:US16137398

    申请日:2018-09-20

    Abstract: Techniques are generally described for enhanced compression of video data. In various examples, the techniques may include receiving first video data representing a scene in an environment. The techniques may further include generating illumination map data representing illumination of the scene in the first video data. The techniques may further comprise generating reflectance map data representing a reflectance of at least one object in the first video data. In some examples, the techniques may include sending, to a second computing device, the illumination map data and the reflectance map data. The techniques may further include receiving second video data representing the scene. The techniques may include determining a first illumination difference between the second video data and the first video data. The techniques may comprise sending, to the second computing device, the first illumination difference.

Patent Agency Ranking