Patent search ap:("Beijing Baidu Netcom Science AND Technology Co. Page Ltd.") AND inv:"Zhibin Hong"

11.

发明申请
METHOD AND APPARATUS FOR DETERMINING SHAPE OF LIPS OF VIRTUAL CHARACTER, DEVICE AND COMPUTER STORAGE MEDIUM 有权

公开(公告)号：US20220084502A1

公开(公告)日：2022-03-17

申请号：US17205624

申请日：2021-03-18

Applicant: Beijing Baidu Netcom Science and Technology Co., Ltd.

Inventor： Mingming Ma , Tianshu Hu , Zhibin Hong

IPC: G10L15/02 , G10L15/25 , G10L15/06 , G10L25/30

Abstract: The present application discloses a method and apparatus for determining the shape of the lips of a virtual character, a device and a computer storage medium, and relates to an artificial intelligence technology, and particularly to computer vision and deep learning technologies. An implementation includes: determining a phoneme sequence corresponding to a voice, the phoneme sequence including a phoneme corresponding to each time point; determining lip-shape key point information corresponding to each phoneme in the phoneme sequence; searching a pre-established lip shape library according to each piece of determined lip-shape key point information, so as to obtain a lip shape image of each phoneme; and corresponding the searched lip shape image of each phoneme with each time point to obtain a lip-shape image sequence corresponding to the voice. With the present application, the voice may be synchronized with the shapes of the lips in the images.

12.

发明申请
METHOD FOR SYNTHESIZING FIGURE OF VIRTUAL OBJECT, ELECTRONIC DEVICE, AND STORAGE MEDIUM 有权

公开(公告)号：US20210312685A1

公开(公告)日：2021-10-07

申请号：US17348452

申请日：2021-06-15

Applicant: BEIJING BAIDU NETCOM SCIENCE AND TECHNOLOGY CO., LTD.

Inventor： Hanqi Guo , Tianshu Hu , Mingming Ma , Zhibin Hong

IPC: G06T13/40 , G06K9/00 , G06T7/73 , G06T17/20 , G06T15/04

Abstract: A method for synthesizing a figure of a virtual object includes: obtaining a figure image of the virtual object, and original face images corresponding to a speech segment; extracting a first face key point of the face of the virtual object, and a second face key point of each of the original face images; processing the first face key point to generate a position and posture information of a first three-dimensional 3D face; processing each second face key point to generate vertex information of a second 3D face; generating a target face image corresponding to each original face image based on the position and the posture information of the first 3D face and the vertex information of each second 3D face; and synthesizing a speaking figure segment of the virtual object, corresponding to the speech segment, based on the figure image of the virtual object and each target face image.

13.

发明授权
Method, apparatus, electronic device and storage medium for expression driving 有权

公开(公告)号：US11074437B2

公开(公告)日：2021-07-27

申请号：US15930714

申请日：2020-05-13

Applicant: BEIJING BAIDU NETCOM SCIENCE AND TECHNOLOGY CO., LTD.

Inventor： Shihu Li , Xiangda Yan , Yuanzhang Chang , Zhibin Hong , Tianshu Hu , Kun Yao , Junyu Han , Jingtuo Liu , Shengxian Zhu

IPC: G06K9/00 , G06K9/62

Abstract: A method, an electronic device and a storage medium for expression driving are disclosed. The method may include: performing facial key point detection on a driven character in a first image to obtain a first facial key point sequence; performing the following processing for each second image of a plurality of second images obtained successively: performing facial key point detection on a driving character in the second image to obtain a second facial key point sequence; obtaining a difference between the second facial key point sequence and an expressionless key point sequence which has been determined previously according to an analysis on the second facial key point sequence for a previous second image, and performing expression drive rendering on the driven character based on the difference and the first facial key point sequence. The technical solution may enhance flexibility, interactivity, accuracy etc.

14.

发明申请
METHOD AND DEVICE FOR DIALOGUE WITH VIRTUAL OBJECT, CLIENT END, AND STORAGE MEDIUM 有权

公开(公告)号：US20210201886A1

公开(公告)日：2021-07-01

申请号：US17204167

申请日：2021-03-17

Applicant: Beijing Baidu Netcom Science and Technology Co., Ltd.

Inventor： Tonghui Li , Tianshu Hu , Mingming Ma , Zhibin Hong

IPC: G10L13/04 , G06T13/40 , G06F40/56 , G10L13/08 , G10L15/26

Abstract: This application discloses a method and a device for dialogue with a virtual object, a client end and a storage medium. A specific implementation scheme of the method applied to the client end includes: converting a first voice collected by the client end into a first text content, in a case that the client end is in an offline mode; acquiring a second text content responding to the first text content based on offline natural language processing (NLP) and/or a target database pre-stored by the client end; performing voice synthesis on the second text content to acquire a second voice; simulating a lip shape of the second voice by using the virtual object to acquire a target video in which the virtual object says the second voice; and playing the target video.

Patent Agency Ranking