-
公开(公告)号:US11615784B2
公开(公告)日:2023-03-28
申请号:US17118869
申请日:2020-12-11
Inventor: Cong Gao , Saisai Zou , Jinfeng Bai , Lei Jia
Abstract: The present disclosure discloses a control method and a control apparatus for speech interaction. The detailed implementation solution of the control method for the speech interaction includes: collecting an audio signal; detecting a wake-up word in the audio signal to obtain a wake-up word result; and playing a prompt tone and/or executing a speech instruction in the audio signal based on the wake-up word result.
-
公开(公告)号:US11893988B2
公开(公告)日:2024-02-06
申请号:US17357598
申请日:2021-06-24
Inventor: Song Yang , Saisai Zou , Jieyi Cao , Junyao Shao
IPC: G10L15/22 , G06F16/635 , G10L15/05
CPC classification number: G10L15/22 , G06F16/635 , G10L15/05 , G10L2015/223
Abstract: The disclosure provides a speech control method, a speech control apparatus, an electronic device, and a storage medium. The method includes: acquiring target audio data sent by a client, the target audio data including audio data collected by the client within a target duration before wake-up and audio data collected by the client after wake-up; performing speech recognition on the target audio data; and controlling the client based on an instruction recognized from a second audio segment of the target audio data in response to recognizing a wake-up word from a first audio segment at beginning of the target audio data; in which, the second audio segment is later than the first audio segment or has an overlapping portion with the first audio segment.
-
公开(公告)号:US11823662B2
公开(公告)日:2023-11-21
申请号:US17158726
申请日:2021-01-26
Inventor: Cong Gao , Saisai Zou , Jinfeng Bai , Lei Jia
CPC classification number: G10L15/08 , G10L15/22 , G10L15/02 , G10L15/14 , G10L2015/088
Abstract: The present disclosure discloses a control method and a control apparatus for speech interaction. The detailed implementation solution of the control method for the speech interaction includes: collecting an audio signal; detecting a wake-up word in the audio signal to obtain a wake-up word result; and playing a prompt tone and/or executing a speech instruction in the audio signal based on the wake-up word result.
-
公开(公告)号:US11823660B2
公开(公告)日:2023-11-21
申请号:US17353758
申请日:2021-06-21
Inventor: Li Chen , Saisai Zou
CPC classification number: G10L15/063 , G06N3/044 , G06N3/08 , G10L15/02 , G10L15/16 , G10L15/1815 , G10L15/22 , G10L21/02 , G10L2015/027
Abstract: Embodiments of the present disclosure disclose a method, apparatus and device for training a network, and a storage medium, relate to the field of artificial intelligence technology such as deep learning and speech analysis. A semantic prediction network comprises: an encoder network and at least one decoder network; and a particular solution is: acquiring a first speech feature of a target speech sample; the target speech sample being a synthesized speech sample or a real speech sample, the synthesized speech sample being attached with a sample syllable label and a semantic label comprising a value of the domain, and the real speech sample being attached with a sample syllable label; and jointly training an initial semantic prediction network and a syllable classification network using the first speech feature of the target speech sample, to obtain a trained semantic prediction network.
-
-
-