-
公开(公告)号:US11734844B2
公开(公告)日:2023-08-22
申请号:US17823764
申请日:2022-08-31
Applicant: Snap Inc.
Inventor: Liuhao Ge , Zhou Ren , Yuncheng Li , Zehao Xue , Yingying Wang
CPC classification number: G06T7/50 , G06N3/045 , G06N3/08 , G06T7/75 , G06T17/20 , G06V10/774 , G06V10/82 , G06V40/107 , G06T2207/10004
Abstract: Aspects of the present disclosure involve a system comprising a computer-readable storage medium storing a program and a method for receiving a monocular image that includes a depiction of a hand and extracting features of the monocular image using a plurality of machine learning techniques. The program and method further include modeling, based on the extracted features, a pose of the hand depicted in the monocular image by adjusting skeletal joint positions of a three-dimensional (3D) hand mesh using a trained graph convolutional neural network (CNN); modeling, based on the extracted features, a shape of the hand in the monocular image by adjusting blend shape values of the 3D hand mesh representing surface features of the hand depicted in the monocular image using the trained graph CNN; and generating, for display, the 3D hand mesh adjusted to model the pose and shape of the hand depicted in the monocular image.
-
公开(公告)号:US11410439B2
公开(公告)日:2022-08-09
申请号:US16870138
申请日:2020-05-08
Applicant: Snap Inc.
Inventor: Yuncheng Li , Zhou Ren , Ning Xu , Enxu Yan , Tan Yu
Abstract: Systems and methods are disclosed for capturing multiple sequences of views of a three-dimensional object using a plurality of virtual cameras. The systems and methods generate aligned sequences from the multiple sequences based on an arrangement of the plurality of virtual cameras in relation to the three-dimensional object. Using a convolutional network, the systems and methods classify the three-dimensional object based on the aligned sequences and identify the three-dimensional object using the classification.
-
公开(公告)号:US11328008B2
公开(公告)日:2022-05-10
申请号:US16844300
申请日:2020-04-09
Applicant: Snap Inc.
Inventor: Xinran He , Jie Luo , Sushobhan Nayak , Zhou Ren , Christophe Jacky Henri Van Gysel
IPC: G06F16/435 , G06Q50/00 , G06N20/00 , G06F16/951 , G06F16/2457
Abstract: Systems and methods are provided for generating training data from queries and user interactions associated with media collections related to the queries, and training a machine learning model using the generated training data to generate a trained machine learning model. The systems and methods further provide for receiving a prediction request comprising a query for relevant media collections, analyzing the query to determine query features, determining a plurality of media collections for the query, analyzing the plurality of media collections to determine media collection features for each media collection of the plurality of media collections, and generating, using the trained machine learning model, a semantic matching score for each media collection of the plurality of media collections based on matching the query features to the media collection features for each media collection of the plurality of media collections.
-
公开(公告)号:US20210407548A1
公开(公告)日:2021-12-30
申请号:US17465001
申请日:2021-09-02
Applicant: Snap Inc.
Inventor: Zhou Ren , Yuncheng Li , Ning Xu , Enxu Yan , Tan Yu
Abstract: Aspects of the present disclosure involve a system comprising a storage medium storing a program and method for receiving a video comprising a plurality of video segments; selecting a target action sequence that includes a sequence of action phases; receiving features of each of the video segments; computing, based on the received features, for each of the plurality of video segments, a plurality of action phase confidence scores indicating a likelihood that a given video segment includes a given action phase of the sequence of action phases; identifying a set of consecutive video segments of the plurality of video segments that corresponds to the target action sequence, wherein video segments in the set of consecutive video segments are arranged according to the sequence of action phases; and generating a display of the video that includes the set of consecutive video segments and skips other video segments in the video.
-
公开(公告)号:US10997787B2
公开(公告)日:2021-05-04
申请号:US17010256
申请日:2020-09-02
Applicant: Snap Inc.
Inventor: Liuhao Ge , Zhou Ren , Yuncheng Li , Zehao Xue , Yingying Wang
Abstract: Aspects of the present disclosure involve a system comprising a computer-readable storage medium storing a program and a method for receiving a monocular image that includes a depiction of a hand and extracting features of the monocular image using a plurality of machine learning techniques. The program and method further include modeling, based on the extracted features, a pose of the hand depicted in the monocular image by adjusting skeletal joint positions of a three-dimensional (3D) hand mesh using a trained graph convolutional neural network (CNN); modeling, based on the extracted features, a shape of the hand in the monocular image by adjusting blend shape values of the 3D hand mesh representing surface features of the hand depicted in the monocular image using the trained graph CNN; and generating, for display, the 3D hand mesh adjusted to model the pose and shape of the hand depicted in the monocular image.
-
公开(公告)号:US20200184721A1
公开(公告)日:2020-06-11
申请号:US16210927
申请日:2018-12-05
Applicant: Snap Inc.
Inventor: Liuhao Ge , Zhou Ren , Yuncheng Li , Zehao Xue , Yingying Wang
Abstract: Aspects of the present disclosure involve a system comprising a computer-readable storage medium storing a program and a method for receiving a monocular image that includes a depiction of a hand and extracting features of the monocular image using a plurality of machine learning techniques. The program and method further include modeling, based on the extracted features, a pose of the hand depicted in the monocular image by adjusting skeletal joint positions of a three-dimensional (3D) hand mesh using a trained graph convolutional neural network (CNN); modeling, based on the extracted features, a shape of the hand in the monocular image by adjusting blend shape values of the 3D hand mesh representing surface features of the hand depicted in the monocular image using the trained graph CNN; and generating, for display, the 3D hand mesh adjusted to model the pose and shape of the hand depicted in the monocular image.
-
公开(公告)号:US20190297461A1
公开(公告)日:2019-09-26
申请号:US16296153
申请日:2019-03-07
Applicant: Snap Inc.
Inventor: Ebony James Charlton , Sumant Milind Hanumante , Zhou Ren , Dhritiman Sagar
Abstract: A venue system of a client device can submit a location request to a server, which returns multiple venues that are near the client device. The client device can use one or more machine learning schemes (e.g., convolutional neural networks) to determine that the client device is located in one of specific venues of the possible venues. The venue system can further select imagery for presentation based on the venue selection. The presentation may be published as ephemeral message on a network platform.
-
公开(公告)号:US12033078B2
公开(公告)日:2024-07-09
申请号:US18230499
申请日:2023-08-04
Applicant: Snap Inc.
Inventor: Yuncheng Li , Zhou Ren , Ning Xu , Enxu Yan , Tan Yu
IPC: G06N3/08 , G06F18/214 , G06F18/2431 , G06T7/33 , G06T7/55 , G06V10/44 , G06V10/82 , G06V20/20 , G06V20/40 , G06V20/64
CPC classification number: G06N3/08 , G06F18/214 , G06F18/2431 , G06T7/344 , G06T7/55 , G06V10/454 , G06V10/82 , G06V20/20 , G06V20/41 , G06V20/64
Abstract: Systems and methods are disclosed for capturing multiple sequences of views of a three-dimensional object using a plurality of virtual cameras. The systems and methods generate aligned sequences from the multiple sequences based on an arrangement of the plurality of virtual cameras in relation to the three-dimensional object. Using a convolutional network, the systems and methods classify the three-dimensional object based on the aligned sequences and identify the three-dimensional object using the classification.
-
公开(公告)号:US11704893B2
公开(公告)日:2023-07-18
申请号:US17465001
申请日:2021-09-02
Applicant: Snap Inc.
Inventor: Zhou Ren , Yuncheng Li , Ning Xu , Enxu Yan , Tan Yu
CPC classification number: G06V10/454 , G06V10/764 , G06V10/82 , G06V20/41 , G06V20/46 , G11B27/102 , H04N9/87
Abstract: Aspects of the present disclosure involve a system comprising a storage medium storing a program and method for receiving a video comprising a plurality of video segments; selecting a target action sequence that includes a sequence of action phases; receiving features of each of the video segments; computing, based on the received features, for each of the plurality of video segments, a plurality of action phase confidence scores indicating a likelihood that a given video segment includes a given action phase of the sequence of action phases; identifying a set of consecutive video segments of the plurality of video segments that corresponds to the target action sequence, wherein video segments in the set of consecutive video segments are arranged according to the sequence of action phases; and generating a display of the video that includes the set of consecutive video segments and skips other video segments in the video.
-
公开(公告)号:US11341177B1
公开(公告)日:2022-05-24
申请号:US16722776
申请日:2019-12-20
Applicant: Snap Inc.
Inventor: Zhou Ren , Xiaoyu Wang , Ning Zhang , Xutao Lv , Jia Li
IPC: G06K9/00 , G06F16/338 , G06F16/33 , G06V10/44 , G06V10/94 , G06V10/70 , G06V30/194 , G06N5/02
Abstract: An image captioning system and method is provided for generating a caption for an image. The image captioning system utilizes a policy network and a value network to generate the caption. The policy network serves as a local guidance and the value network serves as a global and lookahead guidance.
-
-
-
-
-
-
-
-
-