-
公开(公告)号:US20190206389A1
公开(公告)日:2019-07-04
申请号:US15989366
申请日:2018-05-25
Applicant: Samsung Electronics Co., Ltd.
Inventor: Ki Soo KWON , Minyoung MUN , SangHyun YOO
CPC classification number: G10L15/075 , G10L15/063 , G10L15/22 , G10L2015/0638 , G10L2015/225 , G10L2015/228
Abstract: A method and apparatus for personalizing a speech recognition model is disclosed. The apparatus may obtain feedback data that is a result of recognizing a first speech input of a user using a trained speech recognition model, determine whether to update the speech recognition model based on the obtained feedback data, and selectively update, dependent on the determining, the speech recognition model based on the feedback data.
-
公开(公告)号:US20190088251A1
公开(公告)日:2019-03-21
申请号:US15916512
申请日:2018-03-09
Applicant: Samsung Electronics Co., Ltd.
Inventor: Minyoung MUN , SangHyun YOO , Young Sang CHOI , Ki Soo KWON , Hodong LEE
IPC: G10L15/06 , G10L15/08 , G10L15/187 , G10L15/28
Abstract: A speech signal recognition method, apparatus, and system. The speech signal recognition method may include obtaining by or from a terminal an output of a personalization layer, with respect to a speech signal provided by a user of the terminal, having been implemented by input of the speech signal to the personalization layer, the personalization layer being previously trained based on speech features of the user, implementing a global model by providing the obtained output of the personalization layer to the global model, the global model being configured to output a phonemic signal indicating a phoneme included in the speech signal through the global model being previously trained based on speech features common to a plurality of users, and re-training the personalization layer based on the phonemic signal output from the global model, where the personalization layer and the global model collectively represent an acoustic model.
-
公开(公告)号:US20240144653A1
公开(公告)日:2024-05-02
申请号:US18310075
申请日:2023-05-01
Applicant: Samsung Electronics Co., Ltd.
Inventor: Hyunsoo KIM , Seijoon KIM , Minyoung MUN , Seungkeun YOON
IPC: G06V10/77 , G06V10/50 , G06V10/74 , G06V10/774 , G06V10/82
CPC classification number: G06V10/7715 , G06V10/50 , G06V10/761 , G06V10/774 , G06V10/82
Abstract: A processor-implemented method includes: determining distances between an input vector and center vectors comprised in a plurality of output nodes comprised in a trained codebook; and outputting a first feature vector of the input vector based on the distances between the center vectors and the input vector, wherein the trained codebook is trained by: determining a distance between a training input vector and the center vector for each of the output nodes; determining, among the plurality of output nodes, a best matched unit (BMU) in which a distance between the training input vector and the center vector of the BMU is minimized; and training the codebook by updating the center vector of the BMU, based on the distance between the training input vector and the center vector of the BMU.
-
公开(公告)号:US20240078785A1
公开(公告)日:2024-03-07
申请号:US18116602
申请日:2023-03-02
Applicant: SAMSUNG ELECTRONICS CO., LTD.
Inventor: Minyoung MUN , Seijoon KIM
IPC: G06V10/74 , G06V10/764
CPC classification number: G06V10/761 , G06V10/764
Abstract: A method generates an anchor image embedding vector for an anchor image using an image representation model, determine first similarities between the anchor image and negative samples of the anchor image using first image embedding vectors for the negative samples and the generated anchor image embedding vector, determine second similarities between the anchor image and positive samples of the anchor image using second image embedding vectors for the positive samples and the generated anchor image embedding vector, obtain one of a vector corresponding to a label of the anchor image and third similarities between the label of the anchor image and labels of the negative samples, determine a loss value for the anchor image based on the determined first similarities, and the determined second similarities, and one of the obtained third similarities and a fourth similarity.
-
公开(公告)号:US20210287663A1
公开(公告)日:2021-09-16
申请号:US17337571
申请日:2021-06-03
Applicant: Samsung Electronics Co., Ltd.
Inventor: Ki Soo KWON , Minyoung MUN , SangHyun YOO
Abstract: A method and apparatus for personalizing a speech recognition model is disclosed. The apparatus may obtain feedback data that is a result of recognizing a first speech input of a user using a trained speech recognition model, determine whether to update the speech recognition model based on the obtained feedback data, and selectively update, dependent on the determining, the speech recognition model based on the feedback data.
-
公开(公告)号:US20180211652A1
公开(公告)日:2018-07-26
申请号:US15841528
申请日:2017-12-14
Applicant: Samsung Electronics Co., Ltd.
Inventor: Minyoung MUN , Hoshik LEE , Young Sang CHOI
IPC: G10L15/187 , G10L15/14 , G10L15/22 , G10L15/02
Abstract: A speech recognition method includes generating pieces of candidate text data from a speech signal of a user, determining a decoding condition corresponding to an utterance type of the user, and determining target text data among the pieces of candidate text data by performing decoding based on the determined decoding condition.
-
公开(公告)号:US20180046618A1
公开(公告)日:2018-02-15
申请号:US15450333
申请日:2017-03-06
Applicant: SAMSUNG ELECTRONICS CO., LTD.
Inventor: Hodong LEE , Youngki PARK , Hwidong NA , Minyoung MUN , Inchul SONG
CPC classification number: G06F17/2836 , G06F16/3344 , G06F17/2854 , G06F17/2863 , G06F17/289 , G06N3/0445 , G06N3/0454 , G06N3/063 , G06N3/08
Abstract: A translation method and apparatus may respectively perform or include: using one or more processors, plural different translation processes, in parallel, for a source sentence in a first language, including encoding, to generate respective feature vectors, the source sentence in each of two or more translation processes of the plural translation processes or the source sentence and a variation of the source sentence in respective translation processes of the plural translation processes, and decoding each of the respective feature vectors to generate respective plural candidate sentences in a second language; and selecting a final sentence in the second language from the respective plural candidate sentences in the second language.
-
公开(公告)号:US20160012820A1
公开(公告)日:2016-01-14
申请号:US14558479
申请日:2014-12-02
Applicant: SAMSUNG ELECTRONICS CO., LTD
Inventor: Minyoung MUN , YoungSang CHOI
CPC classification number: G10L15/32 , G10L15/1822 , G10L2015/223
Abstract: A multilevel speech recognition method and an apparatus performing the method are disclosed. The method includes receiving a first speech command from a user through a speech interface, and extracting a keyword from the first speech command. The method also includes providing a candidate application group of a category providing a service associated with the keyword, and processing a second speech command from the user associated with an application selected from the candidate application group.
Abstract translation: 公开了一种多级语音识别方法和执行该方法的装置。 该方法包括通过语音接口从用户接收第一语音命令,以及从第一语音命令中提取关键字。 该方法还包括提供提供与关键字相关联的服务的类别的候选应用组,以及处理来自与从候选应用组中选择的应用相关联的用户的第二语音命令。
-
-
-
-
-
-
-