-
公开(公告)号:US11748895B2
公开(公告)日:2023-09-05
申请号:US17184379
申请日:2021-02-24
Inventor: Tianwei Lin , Xin Li , Fu Li , Dongliang He , Hao Sun , Henan Zhang
CPC classification number: G06T7/246 , G06F18/253 , G06N3/04 , G06V20/41 , G06V20/46
Abstract: A method and apparatus for processing a video frame are provided. The method may include: converting, using an optical flow generated based on a previous frame and a next frame of adjacent frames in a video, a feature map of the previous frame to obtain a converted feature map; determining, based on an error of the optical flow, a weight of the converted feature map, and obtaining a fused feature map based on a weighted result of a feature of the converted feature map and a feature of a feature map of the next frame; and updating the feature map of the next frame as the fused feature map.
-
2.
公开(公告)号:US11615140B2
公开(公告)日:2023-03-28
申请号:US17144523
申请日:2021-01-08
Inventor: Xiang Long , Dongliang He , Fu Li , Xiang Zhao , Tianwei Lin , Hao Sun , Shilei Wen , Errui Ding
IPC: G06F16/738 , G06V20/40 , G06F18/214 , G06F18/25
Abstract: A method includes screening, by a video-clip screening module in a video description model, a plurality of video proposal clips acquired from a video to be analyzed, to acquire a plurality of video clips suitable for description. The plural video proposal clips acquired from the video to be analyzed may be screened by the video-clip screening module to acquire the plural video clips suitable for description; and then, each video clip is described by a video-clip describing module, thus avoiding description of all the video proposal clips, only describing the screened video clips which have strong correlation with the video and are suitable for description, removing the interference of the description of the video clips which are not suitable for description in the description of the video, guaranteeing the accuracy of the final descriptions of the video clips, and improving the quality of the descriptions of the video clips.
-
公开(公告)号:US20210350508A1
公开(公告)日:2021-11-11
申请号:US17382582
申请日:2021-07-22
Inventor: Xin Li , Fu Li , Tianwei Lin , Henan Zhang
Abstract: A meme generation method, an electronic device, and a storage medium are provided. The method includes: determining a plurality of second expression images corresponding to a target face image based on a plurality of first expression images contained in a first meme; generating a second meme corresponding to the target face image based on the plurality of second expression images corresponding to the target face image; wherein, determining an affine transformation parameter between the target face image and an i-th first expression image in the plurality of first expression images according to a corresponding relation between a face key point in the target face image and a face key point in the i-th first expression image; and transforming the target face image based on the affine transformation parameter to obtain an i-th second expression image corresponding to the target face image.
-
4.
公开(公告)号:US11600069B2
公开(公告)日:2023-03-07
申请号:US17144205
申请日:2021-01-08
Inventor: Tianwei Lin , Xin Li , Dongliang He , Fu Li , Hao Sun , Shilei Wen , Errui Ding
Abstract: A method and apparatus for detecting a temporal action of a video, an electronic device and a storage medium are disclosed, which relates to the field of video processing technologies. An implementation includes: acquiring an initial temporal feature sequence of a video to be detected; acquiring, by a pre-trained video-temporal-action detecting module, implicit features and explicit features of a plurality of configured temporal anchor boxes based on the initial temporal feature sequence; and acquiring, by the video-temporal-action detecting module, the starting position and the ending position of a video clip containing a specified action, the category of the specified action and the probability that the specified action belongs to the category from the plural temporal anchor boxes according to the explicit features and the implicit features of the plural temporal anchor boxes.
-
公开(公告)号:US11741684B2
公开(公告)日:2023-08-29
申请号:US17332520
申请日:2021-05-27
Inventor: Hao Sun , Fu Li , Xin Li , Tianwei Lin
IPC: G06V10/20 , G06V10/25 , G06T7/11 , G06T7/90 , G06T3/60 , G06V40/16 , G06V10/75 , G06V10/772 , G06V20/20
CPC classification number: G06V10/25 , G06T3/60 , G06T7/11 , G06T7/90 , G06V10/758 , G06V10/772 , G06V20/20 , G06V40/162 , G06T2207/30201
Abstract: The disclosure provides an image processing method, an image processing apparatus, an electronic device and a storage medium, which belongs to the field of computer technologies, and specifically relates to computing vision, image processing, face recognition, and deep learning technologies in artificial intelligence. The method includes: performing skin color recognition on a face image to be processed to determine a target skin color of a face contained in the face image; obtaining a reference transformation image corresponding to the face image by processing the face image using any style transfer model in response that a style transfer model set does not comprise a style transfer model corresponding to the target skin color; and obtaining a target transformation image matching the target skin color by adjusting a hue value, a saturation value, and a lightness value of each pixel in the target region based on the target skin color.
-
公开(公告)号:US11463631B2
公开(公告)日:2022-10-04
申请号:US17025255
申请日:2020-09-18
Inventor: Henan Zhang , Xin Li , Fu Li , Tianwei Lin , Hao Sun , Shilei Wen , Hongwu Zhang , Errui Ding
Abstract: Embodiments of the present disclosure provide a method and apparatus for generating an image. The method may include: receiving a first image including a face input by a user in an interactive scene; presenting the first image to the user; inputting the first image into a pre-trained generative adversarial network in a backend to obtain a second image output by the generative adversarial network; where the generative adversarial network uses face attribute information generated based on the input image as a constraint; and presenting the second image to the user in response to obtaining the second image output by the generative adversarial network in the backend.
-
7.
公开(公告)号:US20210312686A1
公开(公告)日:2021-10-07
申请号:US17350449
申请日:2021-06-17
Inventor: Tianwei Lin , Fu Li , Xiaoqing Ye , Henan Zhang , Xin Li
Abstract: The present disclosure discloses a method and apparatus for generating a human body three-dimensional model, a device and a storage medium. The method may include: receiving a single human body image, and extracting an SMPL human body three-dimensional model corresponding to the human body image and a PIFu human body three-dimensional model corresponding to the human body image; matching the SMPL human body three-dimensional model with the PIFu human body three-dimensional model to obtain a matching result; determining a vertex of the SMPL human body three-dimensional model closest to a vertex of the PIFu human body three-dimensional model based on the matching result to obtain a binding weight of the vertex of the PIFu human body three-dimensional model and each skeleton point of the SMPL human body three-dimensional model; and outputting a drivable human body three-dimensional model.
-
公开(公告)号:US11875601B2
公开(公告)日:2024-01-16
申请号:US17382582
申请日:2021-07-22
Inventor: Xin Li , Fu Li , Tianwei Lin , Henan Zhang
IPC: G06V40/16 , G06T3/00 , G06T5/00 , G06F18/214 , G06V10/75
CPC classification number: G06V40/171 , G06F18/214 , G06T3/0075 , G06T5/005 , G06V10/757 , G06V40/165 , G06V40/174
Abstract: A meme generation method, an electronic device, and a storage medium are provided. The method includes: determining a plurality of second expression images corresponding to a target face image based on a plurality of first expression images contained in a first meme; generating a second meme corresponding to the target face image based on the plurality of second expression images corresponding to the target face image; wherein, determining an affine transformation parameter between the target face image and an i-th first expression image in the plurality of first expression images according to a corresponding relation between a face key point in the target face image and a face key point in the i-th first expression image; and transforming the target face image based on the affine transformation parameter to obtain an i-th second expression image corresponding to the target face image.
-
公开(公告)号:US11568590B2
公开(公告)日:2023-01-31
申请号:US17373420
申请日:2021-07-12
Inventor: Tianwei Lin , Fu Li , Xin Li , Henan Zhang , Hao Sun
Abstract: The disclosure discloses a cartoonlization processing method for an image, and relates to a field of computational vision, image processing, face recognition, deep learning technologies. The method includes: performing skin color recognition on a facial image to be processed to determine a target skin color of a face in the facial image; processing the facial image by utilizing any cartoonizing model in a cartoonizing model set to obtain a reference cartoonized image corresponding to the facial image in a case that the cartoonizing model set does not contain a cartoonizing model corresponding to the target skin color; determining a pixel adjustment parameter based on the target skin color and a reference skin color corresponding to the any cartoonizing model; and adjusting a pixel value of each pixel point in the reference cartoonized image based on the pixel adjustment parameter, to obtain a target cartoonized image corresponding to the facial image.
-
公开(公告)号:US20220198828A1
公开(公告)日:2022-06-23
申请号:US17671125
申请日:2022-02-14
Inventor: Hao Sun , Fu Li , Xin Li , Tianwei Lin
Abstract: A method and apparatus for generating an image are provided. The method comprises: acquiring key point information of at least one of the five sense organs in a real facial image; according to the key point information, determining a target area where the five sense organs are located in a first cartoon facial image, wherein the first cartoon facial image is generated by means of the real facial image; and adding the five pre-established sense organ material to the target area to generate a second cartoon facial image.
-
-
-
-
-
-
-
-
-