Patent search ap:("Samsung Electronics Co. Page Ltd.") AND inv:"Vijendra Raj Apsingekar"

11.

发明申请
AUTOMATIC UPDATING OF AUTOMATIC SPEECH RECOGNITION FOR NAMED ENTITIES 有权

公开(公告)号：US20250149031A1

公开(公告)日：2025-05-08

申请号：US18816659

申请日：2024-08-27

Applicant: Samsung Electronics Co., Ltd.

Inventor： Aditya Jajodia , Akash Sahoo , Patrick Hegarty , Divya Neelagiri , Vijendra Raj Apsingekar

IPC: G10L15/197 , G10L13/02 , G10L15/06

Abstract: A method includes identifying, using an automated speech recognition (ASR) system, at least one named entity hypothesis from at least one audio input. The method also can include providing, using the ASR system, the identified at least one named entity to a large language model (LLM). The method also can include generating a prompt using an automated prompt generator. The method also can include processing, using the LLM, the identified at least one named entity hypothesis and the prompt to generate updated named entity recognition data. The method also can include providing the updated named entity recognition data back to the ASR system.

12.

发明授权
System and method for accent-agnostic frame-level wake word detection 有权

公开(公告)号：US12272357B2

公开(公告)日：2025-04-08

申请号：US17929280

申请日：2022-09-01

Applicant: Samsung Electronics Co., Ltd.

Inventor： Sivakumar Balasubramanian , Gowtham Srinivasan , Srinivasa Rao Ponakala , Vijendra Raj Apsingekar , Anil Sunder Yadav

IPC: G10L15/22 , G10L15/06

Abstract: A method includes accessing, using at least one processor of an electronic device, a machine learning model. The machine learning model is a trained student model that is trained using audio samples in a plurality of accent types. The method also includes receiving, using the at least one processor, an audio input from an audio input device. The method further includes providing, using the at least one processor, the audio input to the trained student model. The method also includes receiving, using the at least one processor, an output from the trained student model including frame-level probabilities associated with the audio input. In addition, the method includes instructing, using the at least one processor, at least one action based on the frame-level probabilities associated with the audio input.

13.

发明授权
Method and apparatus for performing speaker diarization on mixed-bandwidth speech signals 有权

公开(公告)号：US12087307B2

公开(公告)日：2024-09-10

申请号：US17538604

申请日：2021-11-30

Applicant: SAMSUNG ELECTRONICS CO., LTD.

Inventor： Myungjong Kim , Vijendra Raj Apsingekar , Aviral Anshu , Taeyeon Ki

IPC: G10L17/06 , G10L17/02 , G10L17/18 , G10L21/0272 , G10L21/0308

CPC classification number: G10L17/06 , G10L17/02 , G10L17/18 , G10L21/0272 , G10L21/0308

Abstract: An apparatus for processing speech data may include a processor configured to: separate an input speech into speech signals; identify a bandwidth of each of the speech signals; extract speaker embeddings from the speech signals based on the bandwidth of each of the speech signals, using at least one neural network configured to receive the speech signals and output the speaker embeddings; and cluster the speaker embeddings into one or more speaker clusters, each speaker cluster corresponding to a speaker identity.

14.

发明公开
SYSTEM AND METHOD FOR SPEAKER VERIFICATION FOR VOICE ASSISTANT 审中-公开

公开(公告)号：US20230419962A1

公开(公告)日：2023-12-28

申请号：US18047609

申请日：2022-10-18

Applicant: Samsung Electronics Co., Ltd.

Inventor： Myungjong Kim , Taeyeon Ki , Cindy Sushen Tseng , Srinivasa Rao Ponakala , Vijendra Raj Apsingekar

IPC: G10L15/22 , G10L15/08

CPC classification number: G10L15/22 , G10L2015/088 , G10L15/08

Abstract: A method includes obtaining audio data and identifying an utterance of a wake word or phrase in the audio data. The method also includes generating an embedding vector based on the utterance from the audio data and accessing a set of previously-generated vectors representing previous utterances of the wake word or phrase. The method further includes performing clustering on the embedding vector and the set of previously-generated vectors to identify a cluster including the embedding vector, where the identified cluster is associated with a speaker. The method also includes updating a speaker vector associated with the speaker based on the embedding vector and determining, using a speaker verification model, a similarity score between the updated speaker vector and the embedding vector. In addition, the method includes determining, based on the similarity score, whether a speaker providing the utterance matches the speaker associated with the identified cluster.

15.

发明申请
METHOD AND SYSTEM FOR DEVICE FEATURE ANALYSIS TO IMPROVE USER EXPERIENCE 有权

公开(公告)号：US20230117535A1

公开(公告)日：2023-04-20

申请号：US17502838

申请日：2021-10-15

Applicant: SAMSUNG ELECTRONICS CO., LTD.

Inventor： Vijendra Raj Apsingekar , Myungjong Kim , Anil Yadav

IPC: G10L15/02 , G10L25/30 , G06F40/279 , G10L15/26

Abstract: A method and system are provided. The method includes receiving an audio input, in response to the audio input being unrecognized by an audio recognition model, identifying contextual information, determining whether the contextual information corresponds to the audio input, and in response to determining that the contextual information corresponds to the audio input, causing training of a neural network associated with the audio recognition model based on the contextual information and the audio input.

Patent Agency Ranking