DENSE CAPTIONING WITH JOINT INTERFERENCE AND VISUAL CONTEXT

    公开(公告)号:US20200320353A1

    公开(公告)日:2020-10-08

    申请号:US16946346

    申请日:2020-06-17

    Applicant: Snap Inc.

    Abstract: A dense captioning system and method is provided for analyzing an image to generate proposed bounding regions for a plurality of visual concepts within the image, generating a region feature for each proposed bounding region to generate a plurality of region features of the image, and determining a context feature for the image using a proposed bounding region that is a largest in size of the proposed bounding regions. For each region feature of the plurality of region features of the image, the dense captioning system and method further provides for analyzing the region feature to determine for the region feature a detection score that indicates a likelihood that the region feature comprises an actual object, and generating a caption for a visual concept in the image using the region feature and the context feature when a detection score is above a specified threshold value.

    Weakly supervised semantic parsing
    27.
    发明授权

    公开(公告)号:US11182603B1

    公开(公告)日:2021-11-23

    申请号:US16450376

    申请日:2019-06-24

    Applicant: Snap Inc.

    Abstract: Segmentation of an image into individual body parts is performed based on a trained model. The model is trained with a plurality of training images, each training image representing a corresponding training figure. The model is also trained with a corresponding plurality of segmentations of the training figures. Each segmentation is generated by positioning body parts between defined positions of joints of the represented figure. The body parts are represented by body part templates obtained from a template library, with the templates defining characteristics of body parts represented by the templates.

    Dense captioning with joint interference and visual context

    公开(公告)号:US10198671B1

    公开(公告)日:2019-02-05

    申请号:US15348501

    申请日:2016-11-10

    Applicant: SNAP INC.

    Abstract: A dense captioning system and method is provided for processing an image to produce a feature map of the image, analyzing the feature map to generate proposed bounding boxes for a plurality of visual concepts within the image, analyzing the feature map to determine a plurality of region features of the image, and analyzing the feature map to determine a context feature for the image. For each region feature of the plurality of region features of the image, the dense captioning system further provides for analyzing the region feature to determine a detection score for the region feature, calculating a caption for a bounding box for a visual concept in the image using the region feature and the context feature, and localizing the visual concept by adjusting the bounding box around the visual concept based on the caption to generate an adjusted bounding box for the visual concept.

Patent Agency Ranking