-
公开(公告)号:US11830288B2
公开(公告)日:2023-11-28
申请号:US17210827
申请日:2021-03-24
Inventor: Kun Yao , Zhibin Hong , Jieting Xue
IPC: G06F18/214 , G06F18/21 , G06F18/24 , G06F18/2413 , G06F18/25 , G06V40/16 , G06N3/047 , G06N20/00 , G06T5/50 , G06T7/73 , G06V10/764 , G06V10/774 , G06V10/80 , G06V10/82 , G06N3/088 , G06N3/045
CPC classification number: G06V40/169 , G06F18/2148 , G06F18/2185 , G06F18/24765 , G06T7/74 , G06V10/764 , G06V10/774 , G06V10/80 , G06V10/82 , G06V40/168 , G06N3/045 , G06N3/088 , G06T2207/20084 , G06T2207/30201
Abstract: Embodiments of the present disclosure provide a method for training a face fusion model and an electronic device. The method includes: performing a first face changing process on a user image and a template image to generate a reference template image; adjusting poses of facial features of the template image based on the reference template image to generate a first input image; performing a second face changing process on the template image to generate a second input image; inputting the first input image and the second input image into a generator of an initial face fusion model to generate a fused face area image; and inputting the fused image and the template image into a discriminator of the initial face fusion model to obtain a result, and performing backpropagation correction on the initial face fusion model based on the result to generate a face fusion model.
-
公开(公告)号:US20210357710A1
公开(公告)日:2021-11-18
申请号:US17352668
申请日:2021-06-21
Inventor: Chengquan Zhang , Pengyuan Lv , Kun Yao , Junyu Han , Jingtuo Liu
Abstract: A text recognition method includes: acquiring an image including text information, the text information including M characters, M being a positive integer greater than 1; performing text recognition on the image to acquire character information about the M characters; recognizing reading direction information about each character in accordance with the character information about the M characters, the reading direction information being used to indicate a next character corresponding to a current character in a semantic reading order; and ranking the M characters in accordance with the reading direction information about the M characters to acquire a text recognition result of the text information.
-
公开(公告)号:US11861919B2
公开(公告)日:2024-01-02
申请号:US17352668
申请日:2021-06-21
Inventor: Chengquan Zhang , Pengyuan Lv , Kun Yao , Junyu Han , Jingtuo Liu
IPC: G06V20/00 , G06V20/62 , G06N3/08 , G06V30/262 , G06V20/58 , G06V30/148 , G06N3/045 , G06V30/28 , G06V30/10
CPC classification number: G06V20/62 , G06N3/045 , G06N3/08 , G06V20/582 , G06V20/63 , G06V30/153 , G06V30/262 , G06V30/274 , G06V30/10 , G06V30/287 , G06V30/293
Abstract: A text recognition method includes: acquiring an image including text information, the text information including M characters, M being a positive integer greater than 1; performing text recognition on the image to acquire character information about the M characters; recognizing reading direction information about each character in accordance with the character information about the M characters, the reading direction information being used to indicate a next character corresponding to a current character in a semantic reading order; and ranking the M characters in accordance with the reading direction information about the M characters to acquire a text recognition result of the text information.
-
公开(公告)号:US11074437B2
公开(公告)日:2021-07-27
申请号:US15930714
申请日:2020-05-13
Inventor: Shihu Li , Xiangda Yan , Yuanzhang Chang , Zhibin Hong , Tianshu Hu , Kun Yao , Junyu Han , Jingtuo Liu , Shengxian Zhu
Abstract: A method, an electronic device and a storage medium for expression driving are disclosed. The method may include: performing facial key point detection on a driven character in a first image to obtain a first facial key point sequence; performing the following processing for each second image of a plurality of second images obtained successively: performing facial key point detection on a driving character in the second image to obtain a second facial key point sequence; obtaining a difference between the second facial key point sequence and an expressionless key point sequence which has been determined previously according to an analysis on the second facial key point sequence for a previous second image, and performing expression drive rendering on the driven character based on the difference and the first facial key point sequence. The technical solution may enhance flexibility, interactivity, accuracy etc.
-
公开(公告)号:US11881044B2
公开(公告)日:2024-01-23
申请号:US17353540
申请日:2021-06-21
Inventor: Chengquan Zhang , Mengyi En , Ju Huang , Qunyi Xie , Xiameng Qin , Kun Yao , Junyu Han , Jingtuo Liu , Errui Ding
IPC: G06V30/414 , G06T7/136 , G06T7/11 , G06F18/213 , G06V30/146 , G06V30/18 , G06V10/764 , G06V10/82 , G06V30/10
CPC classification number: G06V30/414 , G06F18/213 , G06T7/11 , G06T7/136 , G06V10/764 , G06V10/82 , G06V30/147 , G06V30/18057 , G06T2207/30176 , G06V30/10
Abstract: A method and apparatus for processing an image, a device and a storage medium are provided. An implementation of the method includes: acquiring a template image, the template image including at least one region of interest; determining a first feature map corresponding to each region of interest in the template image; acquiring a target image; determining a second feature map of the target image; and determining at least one region of interest in the target image according to the first feature map and the second feature map.
-
公开(公告)号:US11908219B2
公开(公告)日:2024-02-20
申请号:US17244291
申请日:2021-04-29
Inventor: Zihan Ni , Yipeng Sun , Kun Yao , Junyu Han , Errui Ding , Jingtuo Liu , Haifeng Wang
IPC: G06V30/413 , G06F40/30 , G06V30/414 , G06V10/70
CPC classification number: G06V30/413 , G06F40/30 , G06V10/70 , G06V30/414
Abstract: The disclosure provides a method and a device for processing information, an electronic device, and a storage medium, belonging to a field of artificial intelligence including computer vision, deep learning, and natural language processing. In the method, the computing device recognizes multiple text items in the image. The computing device classifies multiple text items into a first set of name text items and a second set of content text items based on semantics of the text items. The computing device performs a matching operation between the first set and the second set based on a layout of the text items in the image, and determines matched name-content text items. The matched name-content text items include a name text item in the first set and a content text item matching the name text item and in the second set. The computing device outputs the matched name-content text items.
-
公开(公告)号:US11354875B2
公开(公告)日:2022-06-07
申请号:US17020968
申请日:2020-09-15
Inventor: Kun Yao , Zhibin Hong , Hanqi Guo , Xusheng Zeng
Abstract: The present disclosure provides a video blending method, apparatus, electronic device and readable storage medium, and relates to computer vision technologies. A specific implementation solution is as follows: obtaining a predicted 3D face mesh of a facial image in each video frame images of the user video according to each video frame image of a user video and each video frame image of a template video; obtaining a predicted texture of the predicted 3D face mesh according to a user texture of a user 3D face mesh of the facial image in each video frame image of the user video and a template texture of a template 3D face mesh of the facial image in each video frame image of the template video; obtaining a rendered facial image of the predicted 3D face mesh according to the predicted 3D face mesh, the predicted texture and user face posture, and template face posture; performing blending processing for the rendered facial image and each video frame image of the template video to obtain a blended video frame image after the blending; performing synthesis processing for the blended video frame image to obtain a blended video.
-
公开(公告)号:US20210271870A1
公开(公告)日:2021-09-02
申请号:US17244291
申请日:2021-04-29
Inventor: Zihan Ni , Yipeng Sun , Kun Yao , Junyu Han , Errui Ding , Jingtuo Liu , Haifeng Wang
Abstract: The disclosure provides a method and a device for processing information, an electronic device, and a storage medium, belonging to a field of artificial intelligence including computer vision, deep learning, and natural language processing. In the method, the computing device recognizes multiple text items in the image. The computing device classifies multiple text items into a first set of name text items and a second set of content text items based on semantics of the text items. The computing device performs a matching operation between the first set and the second set based on a layout of the text items in the image, and determines matched name-content text items. The matched name-content text items include a name text item in the first set and a content text item matching the name text item and in the second set. The computing device outputs the matched name-content text items.
-
-
-
-
-
-
-