Patent search ap:("Beijing Baidu Netcom Science AND Technology Co. Page Ltd.") AND inv:"Kun Yao"

1.

发明授权
Method and apparatus for training face fusion model and electronic device 有权

公开(公告)号：US11830288B2

公开(公告)日：2023-11-28

申请号：US17210827

申请日：2021-03-24

Applicant: BEIJING BAIDU NETCOM SCIENCE AND TECHNOLOGY CO., LTD.

Inventor： Kun Yao , Zhibin Hong , Jieting Xue

IPC: G06F18/214 , G06F18/21 , G06F18/24 , G06F18/2413 , G06F18/25 , G06V40/16 , G06N3/047 , G06N20/00 , G06T5/50 , G06T7/73 , G06V10/764 , G06V10/774 , G06V10/80 , G06V10/82 , G06N3/088 , G06N3/045

CPC classification number: G06V40/169 , G06F18/2148 , G06F18/2185 , G06F18/24765 , G06T7/74 , G06V10/764 , G06V10/774 , G06V10/80 , G06V10/82 , G06V40/168 , G06N3/045 , G06N3/088 , G06T2207/20084 , G06T2207/30201

Abstract: Embodiments of the present disclosure provide a method for training a face fusion model and an electronic device. The method includes: performing a first face changing process on a user image and a template image to generate a reference template image; adjusting poses of facial features of the template image based on the reference template image to generate a first input image; performing a second face changing process on the template image to generate a second input image; inputting the first input image and the second input image into a generator of an initial face fusion model to generate a fused face area image; and inputting the fused image and the template image into a discriminator of the initial face fusion model to obtain a result, and performing backpropagation correction on the initial face fusion model based on the result to generate a face fusion model.

2.

发明申请
TEXT RECOGNITION METHOD AND DEVICE, AND ELECTRONIC DEVICE 有权

公开(公告)号：US20210357710A1

公开(公告)日：2021-11-18

申请号：US17352668

申请日：2021-06-21

Applicant: Beijing Baidu Netcom Science and Technology Co., Ltd.

Inventor： Chengquan Zhang , Pengyuan Lv , Kun Yao , Junyu Han , Jingtuo Liu

IPC: G06K9/72 , G06K9/34 , G06K9/00 , G06K9/46 , G06N3/04 , G06N3/08

Abstract: A text recognition method includes: acquiring an image including text information, the text information including M characters, M being a positive integer greater than 1; performing text recognition on the image to acquire character information about the M characters; recognizing reading direction information about each character in accordance with the character information about the M characters, the reading direction information being used to indicate a next character corresponding to a current character in a semantic reading order; and ranking the M characters in accordance with the reading direction information about the M characters to acquire a text recognition result of the text information.

3.

发明授权
Text recognition method and device, and electronic device 有权

公开(公告)号：US11861919B2

公开(公告)日：2024-01-02

申请号：US17352668

申请日：2021-06-21

Applicant: Beijing Baidu Netcom Science and Technology Co., Ltd.

Inventor： Chengquan Zhang , Pengyuan Lv , Kun Yao , Junyu Han , Jingtuo Liu

IPC: G06V20/00 , G06V20/62 , G06N3/08 , G06V30/262 , G06V20/58 , G06V30/148 , G06N3/045 , G06V30/28 , G06V30/10

CPC classification number: G06V20/62 , G06N3/045 , G06N3/08 , G06V20/582 , G06V20/63 , G06V30/153 , G06V30/262 , G06V30/274 , G06V30/10 , G06V30/287 , G06V30/293

Abstract: A text recognition method includes: acquiring an image including text information, the text information including M characters, M being a positive integer greater than 1; performing text recognition on the image to acquire character information about the M characters; recognizing reading direction information about each character in accordance with the character information about the M characters, the reading direction information being used to indicate a next character corresponding to a current character in a semantic reading order; and ranking the M characters in accordance with the reading direction information about the M characters to acquire a text recognition result of the text information.

4.

发明授权
Method, apparatus, electronic device and storage medium for expression driving 有权

公开(公告)号：US11074437B2

公开(公告)日：2021-07-27

申请号：US15930714

申请日：2020-05-13

Applicant: BEIJING BAIDU NETCOM SCIENCE AND TECHNOLOGY CO., LTD.

Inventor： Shihu Li , Xiangda Yan , Yuanzhang Chang , Zhibin Hong , Tianshu Hu , Kun Yao , Junyu Han , Jingtuo Liu , Shengxian Zhu

IPC: G06K9/00 , G06K9/62

Abstract: A method, an electronic device and a storage medium for expression driving are disclosed. The method may include: performing facial key point detection on a driven character in a first image to obtain a first facial key point sequence; performing the following processing for each second image of a plurality of second images obtained successively: performing facial key point detection on a driving character in the second image to obtain a second facial key point sequence; obtaining a difference between the second facial key point sequence and an expressionless key point sequence which has been determined previously according to an analysis on the second facial key point sequence for a previous second image, and performing expression drive rendering on the driven character based on the difference and the first facial key point sequence. The technical solution may enhance flexibility, interactivity, accuracy etc.

5.

发明授权
Method and apparatus for processing image, device and storage medium 有权

公开(公告)号：US11881044B2

公开(公告)日：2024-01-23

申请号：US17353540

申请日：2021-06-21

Applicant: Beijing Baidu Netcom Science and Technology Co., Ltd.

Inventor： Chengquan Zhang , Mengyi En , Ju Huang , Qunyi Xie , Xiameng Qin , Kun Yao , Junyu Han , Jingtuo Liu , Errui Ding

IPC: G06V30/414 , G06T7/136 , G06T7/11 , G06F18/213 , G06V30/146 , G06V30/18 , G06V10/764 , G06V10/82 , G06V30/10

CPC classification number: G06V30/414 , G06F18/213 , G06T7/11 , G06T7/136 , G06V10/764 , G06V10/82 , G06V30/147 , G06V30/18057 , G06T2207/30176 , G06V30/10

Abstract: A method and apparatus for processing an image, a device and a storage medium are provided. An implementation of the method includes: acquiring a template image, the template image including at least one region of interest; determining a first feature map corresponding to each region of interest in the template image; acquiring a target image; determining a second feature map of the target image; and determining at least one region of interest in the target image according to the first feature map and the second feature map.

6.

发明授权
Method and device for processing information, electronic device, and storage medium 有权

公开(公告)号：US11908219B2

公开(公告)日：2024-02-20

申请号：US17244291

申请日：2021-04-29

Applicant: BEIJING BAIDU NETCOM SCIENCE AND TECHNOLOGY CO., LTD.

Inventor： Zihan Ni , Yipeng Sun , Kun Yao , Junyu Han , Errui Ding , Jingtuo Liu , Haifeng Wang

IPC: G06V30/413 , G06F40/30 , G06V30/414 , G06V10/70

CPC classification number: G06V30/413 , G06F40/30 , G06V10/70 , G06V30/414

Abstract: The disclosure provides a method and a device for processing information, an electronic device, and a storage medium, belonging to a field of artificial intelligence including computer vision, deep learning, and natural language processing. In the method, the computing device recognizes multiple text items in the image. The computing device classifies multiple text items into a first set of name text items and a second set of content text items based on semantics of the text items. The computing device performs a matching operation between the first set and the second set based on a layout of the text items in the image, and determines matched name-content text items. The matched name-content text items include a name text item in the first set and a content text item matching the name text item and in the second set. The computing device outputs the matched name-content text items.

7.

发明授权
Video blending method, apparatus, electronic device and readable storage medium 有权

公开(公告)号：US11354875B2

公开(公告)日：2022-06-07

申请号：US17020968

申请日：2020-09-15

Applicant: BEIJING BAIDU NETCOM SCIENCE AND TECHNOLOGY CO., LTD.

Inventor： Kun Yao , Zhibin Hong , Hanqi Guo , Xusheng Zeng

IPC: G06T19/20 , G06T15/04 , G06T17/20

Abstract: The present disclosure provides a video blending method, apparatus, electronic device and readable storage medium, and relates to computer vision technologies. A specific implementation solution is as follows: obtaining a predicted 3D face mesh of a facial image in each video frame images of the user video according to each video frame image of a user video and each video frame image of a template video; obtaining a predicted texture of the predicted 3D face mesh according to a user texture of a user 3D face mesh of the facial image in each video frame image of the user video and a template texture of a template 3D face mesh of the facial image in each video frame image of the template video; obtaining a rendered facial image of the predicted 3D face mesh according to the predicted 3D face mesh, the predicted texture and user face posture, and template face posture; performing blending processing for the rendered facial image and each video frame image of the template video to obtain a blended video frame image after the blending; performing synthesis processing for the blended video frame image to obtain a blended video.

8.

发明申请
METHOD AND DEVICE FOR PROCESSING INFORMATION, ELECTRONIC DEVICE, AND STORAGE MEDIUM 有权

公开(公告)号：US20210271870A1

公开(公告)日：2021-09-02

申请号：US17244291

申请日：2021-04-29

Applicant: BEIJING BAIDU NETCOM SCIENCE AND TECHNOLOGY CO., LTD.

Inventor： Zihan Ni , Yipeng Sun , Kun Yao , Junyu Han , Errui Ding , Jingtuo Liu , Haifeng Wang

IPC: G06K9/00 , G06N3/08 , G06F40/30

Abstract: The disclosure provides a method and a device for processing information, an electronic device, and a storage medium, belonging to a field of artificial intelligence including computer vision, deep learning, and natural language processing. In the method, the computing device recognizes multiple text items in the image. The computing device classifies multiple text items into a first set of name text items and a second set of content text items based on semantics of the text items. The computing device performs a matching operation between the first set and the second set based on a layout of the text items in the image, and determines matched name-content text items. The matched name-content text items include a name text item in the first set and a content text item matching the name text item and in the second set. The computing device outputs the matched name-content text items.

Search Results

Country/Region

Patent validity

Application date

Publication (announcement) day

applicant

The country/region where the applicant is located

Inventor

IPC

IPC Department

IPC class

IPC subclass

IPC group

IPC team

Appearance classification