Patent search ap:("BEIJING BAIDU NETCOM SCIENCE AND TECHNOLOGY CO. Page LTD.") AND inv:"Tianshu Hu"

1.

发明授权
Method for synthesizing figure of virtual object, electronic device, and storage medium 有权

公开(公告)号：US11645801B2

公开(公告)日：2023-05-09

申请号：US17348452

申请日：2021-06-15

Applicant: BEIJING BAIDU NETCOM SCIENCE AND TECHNOLOGY CO., LTD.

Inventor： Hanqi Guo , Tianshu Hu , Mingming Ma , Zhibin Hong

IPC: G06T17/00 , G06T15/04 , G06T13/40 , G06T7/73 , G06T17/20 , G06V40/16

CPC classification number: G06T13/40 , G06T7/73 , G06T15/04 , G06T17/20 , G06V40/171 , G06T2207/20221 , G06T2207/30201

Abstract: A method for synthesizing a figure of a virtual object includes: obtaining a figure image of the virtual object, and original face images corresponding to a speech segment; extracting a first face key point of the face of the virtual object, and a second face key point of each of the original face images; processing the first face key point to generate a position and posture information of a first three-dimensional 3D face; processing each second face key point to generate vertex information of a second 3D face; generating a target face image corresponding to each original face image based on the position and the posture information of the first 3D face and the vertex information of each second 3D face; and synthesizing a speaking figure segment of the virtual object, corresponding to the speech segment, based on the figure image of the virtual object and each target face image.

2.

发明申请
FACE SUPER-RESOLUTION REALIZATION METHOD AND APPARATUS, ELECTRONIC DEVICE AND STORAGE MEDIUM 有权

公开(公告)号：US20210209732A1

公开(公告)日：2021-07-08

申请号：US17208651

申请日：2021-03-22

Applicant: Beijing Baidu Netcom Science and Technology Co., Ltd.

Inventor： Tianshu Hu , Jiaming Liu , Zhibin Hong

IPC: G06T3/40 , G06T7/174 , G06K9/00 , G06N3/08

Abstract: The present application discloses a face super-resolution realization method and apparatus, an electronic device and a storage medium, and relate to fields of face image processing and deep learning. The specific implementation solution is as follows: a face part in a first image is extracted; the face part is input into a pre-trained face super-resolution model to obtain a super-sharp face image; a semantic segmentation image corresponding to the super-sharp face image is acquired; and the face part in the first image is replaced with the super-sharp face image, by utilizing the semantic segmentation image, to obtain a face super-resolution image.

3.

发明授权
Face super-resolution realization method and apparatus, electronic device and storage medium 有权

公开(公告)号：US11710215B2

公开(公告)日：2023-07-25

申请号：US17208651

申请日：2021-03-22

Applicant: Beijing Baidu Netcom Science and Technology Co., Ltd.

Inventor： Tianshu Hu , Jiaming Liu , Zhibin Hong

IPC: G06T3/40 , G06T7/174 , G06N3/08 , G06V40/16 , G06V10/26

CPC classification number: G06T3/4046 , G06N3/08 , G06T3/4053 , G06T7/174 , G06V10/26 , G06V40/165 , G06V40/171 , G06T2207/20081 , G06T2207/20084 , G06T2207/30201

Abstract: The present application discloses a face super-resolution realization method and apparatus, an electronic device and a storage medium, and relate to fields of face image processing and deep learning. The specific implementation solution is as follows: a face part in a first image is extracted; the face part is input into a pre-trained face super-resolution model to obtain a super-sharp face image; a semantic segmentation image corresponding to the super-sharp face image is acquired; and the face part in the first image is replaced with the super-sharp face image, by utilizing the semantic segmentation image, to obtain a face super-resolution image.

4.

发明申请
METHOD AND APPARATUS FOR GENERATING IMAGE, DEVICE AND MEDIUM 有权

公开(公告)号：US20210232932A1

公开(公告)日：2021-07-29

申请号：US17201681

申请日：2021-03-15

Applicant: BEIJING BAIDU NETCOM SCIENCE AND TECHNOLOGY CO., LTD.

Inventor： Jiaming Liu , Tianshu Hu , Shengyi He , Zhibin Hong

IPC: G06N3/08 , G06T11/00 , G06N3/04

Abstract: A method and an apparatus for generating an image, a device and a medium are provided The method includes: acquiring a first random vector set; determining an image category to which at least one random vector in the first random vector set belongs, based on a trained classifier; and inputting a random vector belonging to the image category into a trained image generator to generate a virtual image belonging to the image category. Through the technical solution in the present disclosure, a large number of images that are definitely classified can be automatically generated.

5.

发明授权
Method and apparatus for processing character image data 有权

公开(公告)号：US11670029B2

公开(公告)日：2023-06-06

申请号：US17191444

申请日：2021-03-03

Applicant: BEIJING BAIDU NETCOM SCIENCE AND TECHNOLOGY CO., LTD.

Inventor： Tianshu Hu , Mingming Ma , Shengyi He , Hanqi Guo , Tonghui Li , Zhibin Hong

IPC: G06T13/40 , G06N20/00 , G06V40/16

CPC classification number: G06T13/40 , G06N20/00 , G06V40/168 , G06V40/176

Abstract: A method and apparatus for processing character image data are provided. A specific embodiment of the method can include: acquiring a video including character images of a first character as first data, and acquiring a video including character images of a second character as second data, the video including character images of the first character including a plurality of types of movement-state images; replacing a head image extracted from the first data with a head image extracted from the second data; replacing a face extracted from the first data with a face extracted from the second data; and replacing the face in the replaced head image with the replaced face to obtain a target head image.

6.

发明授权
Virtual object image display method and apparatus, electronic device and storage medium 有权

公开(公告)号：US11423907B2

公开(公告)日：2022-08-23

申请号：US17204771

申请日：2021-03-17

Applicant: Beijing Baidu Netcom Science and Technology Co., Ltd.

Inventor： Tianshu Hu , Mingming Ma , Tonghui Li , Zhibin Hong

IPC: G10L15/25 , G10L15/06 , G10L15/08

Abstract: The application provides a virtual object image display method and apparatus, an electronic device and a storage medium, relates to the field of artificial intelligence, in particular to the field of computer vision and deep learning, and may be applied to virtual object dialogue scenarios. The specific implementation scheme includes: segmenting acquired voice to obtain voice segments; predicting lip shape sequence information for the voice segments; searching for a corresponding lip shape image sequence based on the lip shape sequence information; performing lip fusion between the lip shape image sequence and a virtual object baseplate to obtain a virtual object image; displaying the virtual object image. The application improves ability to obtain virtual object image.

7.

发明申请
METHOD AND APPARATUS FOR DETERMINING SHAPE OF LIPS OF VIRTUAL CHARACTER, DEVICE AND COMPUTER STORAGE MEDIUM 有权

公开(公告)号：US20220084502A1

公开(公告)日：2022-03-17

申请号：US17205624

申请日：2021-03-18

Applicant: Beijing Baidu Netcom Science and Technology Co., Ltd.

Inventor： Mingming Ma , Tianshu Hu , Zhibin Hong

IPC: G10L15/02 , G10L15/25 , G10L15/06 , G10L25/30

Abstract: The present application discloses a method and apparatus for determining the shape of the lips of a virtual character, a device and a computer storage medium, and relates to an artificial intelligence technology, and particularly to computer vision and deep learning technologies. An implementation includes: determining a phoneme sequence corresponding to a voice, the phoneme sequence including a phoneme corresponding to each time point; determining lip-shape key point information corresponding to each phoneme in the phoneme sequence; searching a pre-established lip shape library according to each piece of determined lip-shape key point information, so as to obtain a lip shape image of each phoneme; and corresponding the searched lip shape image of each phoneme with each time point to obtain a lip-shape image sequence corresponding to the voice. With the present application, the voice may be synchronized with the shapes of the lips in the images.

8.

发明申请
METHOD FOR SYNTHESIZING FIGURE OF VIRTUAL OBJECT, ELECTRONIC DEVICE, AND STORAGE MEDIUM 有权

公开(公告)号：US20210312685A1

公开(公告)日：2021-10-07

申请号：US17348452

申请日：2021-06-15

Applicant: BEIJING BAIDU NETCOM SCIENCE AND TECHNOLOGY CO., LTD.

Inventor： Hanqi Guo , Tianshu Hu , Mingming Ma , Zhibin Hong

IPC: G06T13/40 , G06K9/00 , G06T7/73 , G06T17/20 , G06T15/04

Abstract: A method for synthesizing a figure of a virtual object includes: obtaining a figure image of the virtual object, and original face images corresponding to a speech segment; extracting a first face key point of the face of the virtual object, and a second face key point of each of the original face images; processing the first face key point to generate a position and posture information of a first three-dimensional 3D face; processing each second face key point to generate vertex information of a second 3D face; generating a target face image corresponding to each original face image based on the position and the posture information of the first 3D face and the vertex information of each second 3D face; and synthesizing a speaking figure segment of the virtual object, corresponding to the speech segment, based on the figure image of the virtual object and each target face image.

9.

发明授权
Method, apparatus, electronic device and storage medium for expression driving 有权

公开(公告)号：US11074437B2

公开(公告)日：2021-07-27

申请号：US15930714

申请日：2020-05-13

Applicant: BEIJING BAIDU NETCOM SCIENCE AND TECHNOLOGY CO., LTD.

Inventor： Shihu Li , Xiangda Yan , Yuanzhang Chang , Zhibin Hong , Tianshu Hu , Kun Yao , Junyu Han , Jingtuo Liu , Shengxian Zhu

IPC: G06K9/00 , G06K9/62

Abstract: A method, an electronic device and a storage medium for expression driving are disclosed. The method may include: performing facial key point detection on a driven character in a first image to obtain a first facial key point sequence; performing the following processing for each second image of a plurality of second images obtained successively: performing facial key point detection on a driving character in the second image to obtain a second facial key point sequence; obtaining a difference between the second facial key point sequence and an expressionless key point sequence which has been determined previously according to an analysis on the second facial key point sequence for a previous second image, and performing expression drive rendering on the driven character based on the difference and the first facial key point sequence. The technical solution may enhance flexibility, interactivity, accuracy etc.

10.

发明申请
METHOD AND DEVICE FOR DIALOGUE WITH VIRTUAL OBJECT, CLIENT END, AND STORAGE MEDIUM 有权

公开(公告)号：US20210201886A1

公开(公告)日：2021-07-01

申请号：US17204167

申请日：2021-03-17

Applicant: Beijing Baidu Netcom Science and Technology Co., Ltd.

Inventor： Tonghui Li , Tianshu Hu , Mingming Ma , Zhibin Hong

IPC: G10L13/04 , G06T13/40 , G06F40/56 , G10L13/08 , G10L15/26

Abstract: This application discloses a method and a device for dialogue with a virtual object, a client end and a storage medium. A specific implementation scheme of the method applied to the client end includes: converting a first voice collected by the client end into a first text content, in a case that the client end is in an offline mode; acquiring a second text content responding to the first text content based on offline natural language processing (NLP) and/or a target database pre-stored by the client end; performing voice synthesis on the second text content to acquire a second voice; simulating a lip shape of the second voice by using the virtual object to acquire a target video in which the virtual object says the second voice; and playing the target video.

Search Results

Country/Region

Patent validity

Application date

Publication (announcement) day

applicant

The country/region where the applicant is located

Inventor

IPC

IPC Department

IPC class

IPC subclass

IPC group

IPC team

Appearance classification