Patent search ap:("BEIJING BAIDU NETCOM SCIENCE AND TECHNOLOGY CO. Page LTD.") AND inv:"Saisai Zou"

1.

发明授权
Control method and control apparatus for speech interaction 有权

公开(公告)号：US11615784B2

公开(公告)日：2023-03-28

申请号：US17118869

申请日：2020-12-11

Applicant: BEIJING BAIDU NETCOM SCIENCE AND TECHNOLOGY CO., LTD.

Inventor： Cong Gao , Saisai Zou , Jinfeng Bai , Lei Jia

IPC: G10L15/00 , G10L15/08 , G10L15/22 , G10L15/02 , G10L15/14

Abstract: The present disclosure discloses a control method and a control apparatus for speech interaction. The detailed implementation solution of the control method for the speech interaction includes: collecting an audio signal; detecting a wake-up word in the audio signal to obtain a wake-up word result; and playing a prompt tone and/or executing a speech instruction in the audio signal based on the wake-up word result.

2.

发明授权
Speech control method, electronic device, and storage medium 有权

公开(公告)号：US11893988B2

公开(公告)日：2024-02-06

申请号：US17357598

申请日：2021-06-24

Applicant: BEIJING BAIDU NETCOM SCIENCE AND TECHNOLOGY CO., LTD.

Inventor： Song Yang , Saisai Zou , Jieyi Cao , Junyao Shao

IPC: G10L15/22 , G06F16/635 , G10L15/05

CPC classification number: G10L15/22 , G06F16/635 , G10L15/05 , G10L2015/223

Abstract: The disclosure provides a speech control method, a speech control apparatus, an electronic device, and a storage medium. The method includes: acquiring target audio data sent by a client, the target audio data including audio data collected by the client within a target duration before wake-up and audio data collected by the client after wake-up; performing speech recognition on the target audio data; and controlling the client based on an instruction recognized from a second audio segment of the target audio data in response to recognizing a wake-up word from a first audio segment at beginning of the target audio data; in which, the second audio segment is later than the first audio segment or has an overlapping portion with the first audio segment.

3.

发明授权
Control method and control apparatus for speech interaction, storage medium and system 有权

公开(公告)号：US11823662B2

公开(公告)日：2023-11-21

申请号：US17158726

申请日：2021-01-26

Applicant: BEIJING BAIDU NETCOM SCIENCE AND TECHNOLOGY CO., LTD.

Inventor： Cong Gao , Saisai Zou , Jinfeng Bai , Lei Jia

IPC: G10L15/00 , G10L15/08 , G10L15/22 , G10L15/02 , G10L15/14

CPC classification number: G10L15/08 , G10L15/22 , G10L15/02 , G10L15/14 , G10L2015/088

Abstract: The present disclosure discloses a control method and a control apparatus for speech interaction. The detailed implementation solution of the control method for the speech interaction includes: collecting an audio signal; detecting a wake-up word in the audio signal to obtain a wake-up word result; and playing a prompt tone and/or executing a speech instruction in the audio signal based on the wake-up word result.

4.

发明授权
Method, apparatus and device for training network and storage medium 有权

公开(公告)号：US11823660B2

公开(公告)日：2023-11-21

申请号：US17353758

申请日：2021-06-21

Applicant: BEIJING BAIDU NETCOM SCIENCE AND TECHNOLOGY CO., LTD.

Inventor： Li Chen , Saisai Zou

IPC: G10L15/06 , G06N3/08 , G10L15/02 , G10L15/16 , G10L15/18 , G10L15/22 , G10L21/02 , G06N3/044

CPC classification number: G10L15/063 , G06N3/044 , G06N3/08 , G10L15/02 , G10L15/16 , G10L15/1815 , G10L15/22 , G10L21/02 , G10L2015/027

Abstract: Embodiments of the present disclosure disclose a method, apparatus and device for training a network, and a storage medium, relate to the field of artificial intelligence technology such as deep learning and speech analysis. A semantic prediction network comprises: an encoder network and at least one decoder network; and a particular solution is: acquiring a first speech feature of a target speech sample; the target speech sample being a synthesized speech sample or a real speech sample, the synthesized speech sample being attached with a sample syllable label and a semantic label comprising a value of the domain, and the real speech sample being attached with a sample syllable label; and jointly training an initial semantic prediction network and a syllable classification network using the first speech feature of the target speech sample, to obtain a trained semantic prediction network.

Patent Agency Ranking