-
1.
公开(公告)号:US12087307B2
公开(公告)日:2024-09-10
申请号:US17538604
申请日:2021-11-30
Applicant: SAMSUNG ELECTRONICS CO., LTD.
Inventor: Myungjong Kim , Vijendra Raj Apsingekar , Aviral Anshu , Taeyeon Ki
IPC: G10L17/06 , G10L17/02 , G10L17/18 , G10L21/0272 , G10L21/0308
CPC classification number: G10L17/06 , G10L17/02 , G10L17/18 , G10L21/0272 , G10L21/0308
Abstract: An apparatus for processing speech data may include a processor configured to: separate an input speech into speech signals; identify a bandwidth of each of the speech signals; extract speaker embeddings from the speech signals based on the bandwidth of each of the speech signals, using at least one neural network configured to receive the speech signals and output the speaker embeddings; and cluster the speaker embeddings into one or more speaker clusters, each speaker cluster corresponding to a speaker identity.
-
公开(公告)号:US11580970B2
公开(公告)日:2023-02-14
申请号:US16826713
申请日:2020-03-23
Applicant: Samsung Electronics Co., Ltd.
Inventor: JongHo Shin , Alireza Dirafzoon , Aviral Anshu
Abstract: A method, an electronic device and computer readable medium for dialogue breakdown detection are provided. The method includes obtaining a verbal input from an audio sensor. The method also includes generating a reply to the verbal input. The method additionally includes identifying a local context from the verbal input and a global context from the verbal input, additional verbal inputs previously received by the audio sensor, and previous replies generated in response to the additional verbal inputs. The method further includes identifying a dialogue breakdown in response to determining that the reply does not correspond to the local context and the global context. In addition, the method includes generating sound corresponding to the reply through a speaker when the dialogue breakdown is not identified.
-
公开(公告)号:US20200321002A1
公开(公告)日:2020-10-08
申请号:US16826713
申请日:2020-03-23
Applicant: Samsung Electronics Co., Ltd.
Inventor: JongHo Shin , Alireza Dirafzoon , Aviral Anshu
Abstract: A method, an electronic device and computer readable medium for dialogue breakdown detection are provided. The method includes obtaining a verbal input from an audio sensor. The method also includes generating a reply to the verbal input. The method additionally includes identifying a local context from the verbal input and a global context from the verbal input, additional verbal inputs previously received by the audio sensor, and previous replies generated in response to the additional verbal inputs. The method further includes identifying a dialogue breakdown in response to determining that the reply does not correspond to the local context and the global context. In addition, the method includes generating sound corresponding to the reply through a speaker when the dialogue breakdown is not identified.
-
-