-
公开(公告)号:US11741684B2
公开(公告)日:2023-08-29
申请号:US17332520
申请日:2021-05-27
Inventor: Hao Sun , Fu Li , Xin Li , Tianwei Lin
IPC: G06V10/20 , G06V10/25 , G06T7/11 , G06T7/90 , G06T3/60 , G06V40/16 , G06V10/75 , G06V10/772 , G06V20/20
CPC classification number: G06V10/25 , G06T3/60 , G06T7/11 , G06T7/90 , G06V10/758 , G06V10/772 , G06V20/20 , G06V40/162 , G06T2207/30201
Abstract: The disclosure provides an image processing method, an image processing apparatus, an electronic device and a storage medium, which belongs to the field of computer technologies, and specifically relates to computing vision, image processing, face recognition, and deep learning technologies in artificial intelligence. The method includes: performing skin color recognition on a face image to be processed to determine a target skin color of a face contained in the face image; obtaining a reference transformation image corresponding to the face image by processing the face image using any style transfer model in response that a style transfer model set does not comprise a style transfer model corresponding to the target skin color; and obtaining a target transformation image matching the target skin color by adjusting a hue value, a saturation value, and a lightness value of each pixel in the target region based on the target skin color.
-
公开(公告)号:US11463631B2
公开(公告)日:2022-10-04
申请号:US17025255
申请日:2020-09-18
Inventor: Henan Zhang , Xin Li , Fu Li , Tianwei Lin , Hao Sun , Shilei Wen , Hongwu Zhang , Errui Ding
Abstract: Embodiments of the present disclosure provide a method and apparatus for generating an image. The method may include: receiving a first image including a face input by a user in an interactive scene; presenting the first image to the user; inputting the first image into a pre-trained generative adversarial network in a backend to obtain a second image output by the generative adversarial network; where the generative adversarial network uses face attribute information generated based on the input image as a constraint; and presenting the second image to the user in response to obtaining the second image output by the generative adversarial network in the backend.
-
公开(公告)号:US11363271B2
公开(公告)日:2022-06-14
申请号:US17125370
申请日:2020-12-17
Inventor: Chao Li , Yukang Ding , Dongliang He , Fu Li , Hao Sun , Shilei Wen , Hongwu Zhang , Errui Ding
IPC: H04N19/132 , H04N19/172 , G06K9/62 , G06N3/04
Abstract: A method for video frame interpolation, a related electronic device and a storage medium is disclosed. A video is obtained. An (i−1)th frame and an ith frame of the video are obtained. Visual semantic feature maps and depth maps of the (i−1)th frame and the ith frame are obtained. Frame interpolation information is obtained based on the visual semantic feature maps and the depth maps. An interpolated frame between the (i−1)th frame and the ith frame is generated based on the frame interpolation information and the (i−1)th frame and is inserted between the (i−1)th frame and the ith frame.
-
4.
公开(公告)号:US20210312686A1
公开(公告)日:2021-10-07
申请号:US17350449
申请日:2021-06-17
Inventor: Tianwei Lin , Fu Li , Xiaoqing Ye , Henan Zhang , Xin Li
Abstract: The present disclosure discloses a method and apparatus for generating a human body three-dimensional model, a device and a storage medium. The method may include: receiving a single human body image, and extracting an SMPL human body three-dimensional model corresponding to the human body image and a PIFu human body three-dimensional model corresponding to the human body image; matching the SMPL human body three-dimensional model with the PIFu human body three-dimensional model to obtain a matching result; determining a vertex of the SMPL human body three-dimensional model closest to a vertex of the PIFu human body three-dimensional model based on the matching result to obtain a binding weight of the vertex of the PIFu human body three-dimensional model and each skeleton point of the SMPL human body three-dimensional model; and outputting a drivable human body three-dimensional model.
-
公开(公告)号:US11514676B2
公开(公告)日:2022-11-29
申请号:US17116578
申请日:2020-12-09
Inventor: Zhichao Zhou , Dongliang He , Fu Li , Hao Sun
Abstract: The present disclosure provides a method and apparatus for detecting a region of interest in a video, a device and a storage medium. The method may include: acquiring a current to-be-processed frame from a picture frame sequence of a video; detecting a region of interest (ROI) in the current to-be-processed frame, in response to determining that the current to-be-processed frame is a detection picture frame, to determine at least one ROI in the current to-be-processed frame; and updating a to-be-tracked ROI, based on the ROI in the current to-be-processed frame and a tracking result determined by a pre-order tracking picture frame; and tracking the current to-be-processed frame based on the existing to-be-tracked ROI, in response to determining that the current to-be-processed frame is a tracking picture frame, to determine at least one tracking result as the ROI of the current to-be-processed frame.
-
公开(公告)号:US11490168B2
公开(公告)日:2022-11-01
申请号:US17026488
申请日:2020-09-21
Inventor: Fu Li , Dongliang He , Hao Sun
IPC: H04N21/234 , H04N21/439 , H04N21/44 , H04N21/466 , H04N21/81 , H04N21/658 , H04N21/8549
Abstract: Embodiments of the present disclosure relate to a method and apparatus for selecting a video clip, a server and a medium. The method may include: determining at least two video clips from a video; for each video clip, perform following excitement determination steps: inputting a feature sequence of a video frame in the video clip and title information of the video into a pre-established prediction model to obtain a relevance between the inputted video frame and a title of the video; and determining an excitement of the video clip, based on the relevance between the video frame in the video clip and the title; and determining a target video clip from the video clips, based on the excitement of each of the video clips.
-
公开(公告)号:US11256920B2
公开(公告)日:2022-02-22
申请号:US16830895
申请日:2020-03-26
Inventor: Xiang Long , Dongliang He , Fu Li , Zhizhen Chi , Zhichao Zhou , Xiang Zhao , Ping Wang , Hao Sun , Shilei Wen , Errui Ding
Abstract: A method and an apparatus for classifying a video are provided. The method may include: acquiring a to-be-classified video; extracting a set of multimodal features of the to-be-classified video; inputting the set of multimodal features into a post-fusion model corresponding to each modal respectively, to obtain multimodal category information of the to-be-classified video; and fusing the multimodal category information of the to-be-classified video, to obtain category information of the to-be-classified video. This embodiment improves the accuracy of video classification.
-
公开(公告)号:US20200374526A1
公开(公告)日:2020-11-26
申请号:US16797911
申请日:2020-02-21
Inventor: Zhichao Zhou , Dongliang He , Fu Li , Xiang Zhao , Xin Li , Zhizhen Chi , Xiang Long , Hao Sun
IPC: H04N19/14 , H04N19/12 , H04N19/625 , G06N3/08
Abstract: A method, device, apparatus for predicting a video coding complexity and a computer-readable storage medium are provided. The method includes: acquiring an attribute feature of a target video; extracting a plurality of first target image frames from the target video; performing a frame difference calculation on the plurality of the first target image frames, to acquire a plurality of first frame difference images; determining a histogram feature for frame difference images of the target video according to a statistical histogram of each first frame difference image; and inputting a plurality of features of the target video into a coding complexity prediction model to acquire a coding complexity prediction value of the target video. Through the above method, the BPP prediction value can be acquired intelligently.
-
公开(公告)号:US11983849B2
公开(公告)日:2024-05-14
申请号:US17203437
申请日:2021-03-16
Inventor: Chao Li , Dongliang He , Fu Li , Hao Sun
IPC: G06T9/00 , G06F18/214 , G06T3/4046 , G06T5/00 , G06T11/40 , G06T11/60 , G06V10/44 , G06V10/774 , G06V10/82
CPC classification number: G06T5/005 , G06F18/214 , G06T3/4046 , G06T9/00 , G06T11/40 , G06T11/60 , G06V10/454 , G06V10/774 , G06V10/82
Abstract: An image filling method and apparatus, a device and a storage medium are disclosed. The image filling method includes: performing multilevel encoding processing on features of an image to be filled to generate multilevel encoded feature layers, sizes of the multilevel encoded feature layers being reduced layer by layer; performing layer-by-layer decoding processing on the multilevel encoded feature layers to obtain multilevel decoded feature layers and a first image, there being no missing region in the first image, wherein the layer-by-layer decoding processing includes a concatenation operation on a decoded feature layer and an encoded feature layer with a same size; and performing up-sampling processing on the first image to obtain multilevel up-sampled feature layers and a second image optimized by the up-sampling processing, the up-sampling processing including a concatenation operation on an up-sampled feature layer and a decoded feature layer with a same size.
-
公开(公告)号:US11875601B2
公开(公告)日:2024-01-16
申请号:US17382582
申请日:2021-07-22
Inventor: Xin Li , Fu Li , Tianwei Lin , Henan Zhang
IPC: G06V40/16 , G06T3/00 , G06T5/00 , G06F18/214 , G06V10/75
CPC classification number: G06V40/171 , G06F18/214 , G06T3/0075 , G06T5/005 , G06V10/757 , G06V40/165 , G06V40/174
Abstract: A meme generation method, an electronic device, and a storage medium are provided. The method includes: determining a plurality of second expression images corresponding to a target face image based on a plurality of first expression images contained in a first meme; generating a second meme corresponding to the target face image based on the plurality of second expression images corresponding to the target face image; wherein, determining an affine transformation parameter between the target face image and an i-th first expression image in the plurality of first expression images according to a corresponding relation between a face key point in the target face image and a face key point in the i-th first expression image; and transforming the target face image based on the affine transformation parameter to obtain an i-th second expression image corresponding to the target face image.
-
-
-
-
-
-
-
-
-