-
公开(公告)号:US11361489B2
公开(公告)日:2022-06-14
申请号:US16946346
申请日:2020-06-17
Applicant: Snap Inc.
Inventor: Linjie Yang , Kevin Dechau Tang , Jianchao Yang , Jia Li
Abstract: A dense captioning system and method is provided for analyzing an image to generate proposed bounding regions for a plurality of visual concepts within the image, generating a region feature for each proposed bounding region to generate a plurality of region features of the image, and determining a context feature for the image using a proposed bounding region that is a largest in size of the proposed bounding regions. For each region feature of the plurality of region features of the image, the dense captioning system and method further provides for analyzing the region feature to determine for the region feature a detection score that indicates a likelihood that the region feature comprises an actual object, and generating a caption for a visual concept in the image using the region feature and the context feature when a detection score is above a specified threshold value.
-
公开(公告)号:US11308706B2
公开(公告)日:2022-04-19
申请号:US16927273
申请日:2020-07-13
Applicant: Snap Inc.
Inventor: Jia Li , Linjie Luo , Rahul Bhupendra Sheth , Ning Xu , Jianchao Yang
Abstract: Systems and methods for local augmented reality (AR) tracking of an AR object are disclosed. In one example embodiment a device captures a series of video image frames. A user input is received at the device associating a first portion of a first image of the video image frames with an AR sticker object and a target. A first target template is generated to track the target across frames of the video image frames. In some embodiments, global tracking based on a determination that the target is outside a boundary area is used. The global tracking comprises using a global tracking template for tracking movement in the video image frames captured following the determination that the target is outside the boundary area. When the global tracking determines that the target is within the boundary area, local tracking is resumed along with presentation of the AR sticker object on an output display of the device.
-
公开(公告)号:US11288879B2
公开(公告)日:2022-03-29
申请号:US15929374
申请日:2020-04-29
Applicant: Snap Inc.
Inventor: Travis Chen , Samuel Edward Hare , Yuncheng Li , Tony Mathew , Jonathan Solichin , Jianchao Yang , Ning Zhang
Abstract: Systems, devices, media, and methods are presented for object detection and inserting graphical elements into an image stream in response to detecting the object. The systems and methods detect an object of interest in received frames of a video stream. The systems and methods identify a bounding box for the object of interest and estimate a three-dimensional position of the object of interest based on a scale of the object of interest. The systems and methods generate one or more graphical elements having a size based on the scale of the object of interest and a position based on the three-dimensional position estimated for the object of interest. The one or more graphical elements are generated within the video stream to form a modified video stream. The systems and methods cause presentation of the modified video stream including the object of interest and the one or more graphical elements.
-
公开(公告)号:US20210295874A1
公开(公告)日:2021-09-23
申请号:US17303817
申请日:2021-06-08
Applicant: Snap Inc.
Inventor: Jia Li , Nathan Litke , Jose Jesus (Joseph) Paredes , Rahul Bhupendra Sheth , Daniel Szeto , Ning Xu , Jianchao Yang
IPC: G11B27/10 , G11B27/031
Abstract: Systems and methods are described for determining a first media item related to an event, of a plurality of stored media items each comprising video content related to the event, that was captured in a device orientation corresponding to a first device orientation detected for the first computing device; providing, to the first computing device, the first media item to be displayed on the first computing device; in response to a detected change to a second device orientation for the first computing device, determining a second media item that was captured in a device orientation corresponding to the second device orientation detected for the first computing device; and providing, to the first computing device, the second media item to be displayed on the first computing device.
-
公开(公告)号:US11122218B2
公开(公告)日:2021-09-14
申请号:US16722721
申请日:2019-12-20
Applicant: Snap Inc.
Inventor: Jia Li , Nathan Litke , Jose Jesus (Joseph) Paredes , Rahul Bhupendra Sheth , Daniel Szeto , Ning Xu , Jianchao Yang
Abstract: Systems and methods are described for determining that the user interaction with a display of a computing device during display of a video comprising a sequence of frames indicates a region of interest in a current frame of the sequence of frames of the displayed video. For each frame of the sequence of frames after the current frame, the frame is cropped to generate a cropped frame comprising a portion of the frame including the region of interest in the frame, the cropped frame is enlarged based on a display size corresponding to an angle or orientation of the computing device during display of the video, and the enlarged cropped frame replaces the frame such that the enlarged cropped frame is displayed in the sequence of frames of the video on the display of the computing device instead of the frame.
-
公开(公告)号:US11100311B2
公开(公告)日:2021-08-24
申请号:US16509083
申请日:2019-07-11
Applicant: Snap Inc.
Inventor: Jia Li , Xutao Lv , Xiaoyu Wang , Xuehan Xiong , Jianchao Yang
Abstract: Systems, devices, media, and methods are presented for modeling facial representations using image segmentation with a client device. The systems and methods receive an image depicting a face, detect at least a portion of the face within the image, and identify a set of facial features within the portion of the face. The systems and methods generate a descriptor function representing the set of facial features, fit object functions of the descriptor function, identify an identification probability for each facial feature, and assign an identification to each facial feature.
-
公开(公告)号:US10726306B1
公开(公告)日:2020-07-28
申请号:US16226035
申请日:2018-12-19
Applicant: Snap Inc.
Inventor: Linjie Yang , Kevin Dechau Tang , Jianchao Yang , Jia Li
Abstract: A dense captioning system and method is provided for analyzing an image to generate proposed bounding regions for a plurality of visual concepts within the image, generating a region feature for each proposed bounding region to generate a plurality of region features of the image, and determining a context feature for the image using a proposed bounding region that is a largest in size of the proposed bounding regions. For each region feature of the plurality of region features of the image, the dense captioning system and method further provides for analyzing the region feature to determine for the region feature a detection score that indicates a likelihood that the region feature comprises an actual object, and generating a caption for a visual concept in the image using the region feature and the context feature when a detection score is above a specified threshold value.
-
公开(公告)号:US20200050866A1
公开(公告)日:2020-02-13
申请号:US16654898
申请日:2019-10-16
Applicant: Snap Inc.
Inventor: Samuel Edward Hare , Fedir Poliakov , Guohui Wang , Xuehan Xiong , Jianchao Yang , Linjie Yang , Shah Tanmay Anilkumar
Abstract: A mobile device can generate real-time complex visual image effects using asynchronous processing pipeline. A first pipeline applies a complex image process, such as a neural network, to keyframes of a live image sequence. A second pipeline generates flow maps that describe feature transformations in the image sequence. The flow maps can be used to process non-keyframes on the fly. The processed keyframes and non-keyframes can be used to display a complex visual effect on the mobile device in real-time or near real-time.
-
公开(公告)号:US10395100B1
公开(公告)日:2019-08-27
申请号:US16226084
申请日:2018-12-19
Applicant: Snap Inc.
Inventor: Jia Li , Xutao Lv , Xiaoyu Wang , Xuehan Xiong , Jianchao Yang
Abstract: Systems, devices, media, and methods are presented for modeling facial representations using image segmentation with a client device. The systems and methods receive an image depicting a face, detect at least a portion of the face within the image, and identify a set of facial features within the portion of the face. The systems and methods generate a descriptor function representing the set of facial features, fit object functions of the descriptor function, identify an identification probability for each facial feature, and assign an identification to each facial feature.
-
公开(公告)号:US20190087660A1
公开(公告)日:2019-03-21
申请号:US15706096
申请日:2017-09-15
Applicant: Snap Inc.
Inventor: Samuel Edward Hare , Fedir Poliakov , Guohui Wang , Xuehan Xiong , Jianchao Yang , Linjie Yang , Shah Tanmay Anilkumar
Abstract: A mobile device can generate real-time complex visual image effects using asynchronous processing pipeline. A first pipeline applies a complex image process, such as a neural network, to keyframes of a live image sequence. A second pipeline generates flow maps that describe feature transformations in the image sequence. The flow maps can be used to process non-keyframes on the fly. The processed keyframes and non-keyframes can be used to display a complex visual effect on the mobile device in real-time or near real-time.
-
-
-
-
-
-
-
-
-