-
公开(公告)号:US10825445B2
公开(公告)日:2020-11-03
申请号:US15819924
申请日:2017-11-21
Applicant: SAMSUNG ELECTRONICS CO., LTD.
Inventor: Inchul Song , Sang Hyun Yoo
IPC: G10L15/06 , G10L15/16 , G10L25/30 , G10L15/187
Abstract: A training method of an acoustic model includes constructing window-level input speech data based on a speech sequence; inputting the window-level input speech data to an acoustic model; calculating a sequence level-error based on an output of the acoustic model; acquiring window-level errors based on the sequence level-error; and updating the acoustic model based on the window-level errors.
-
公开(公告)号:US10902216B2
公开(公告)日:2021-01-26
申请号:US15450333
申请日:2017-03-06
Applicant: SAMSUNG ELECTRONICS CO., LTD.
Inventor: Hodong Lee , Youngki Park , Hwidong Na , Minyoung Mun , Inchul Song
Abstract: A translation method and apparatus may respectively perform or include: using one or more processors, plural different translation processes, in parallel, for a source sentence in a first language, including encoding, to generate respective feature vectors, the source sentence in each of two or more translation processes of the plural translation processes or the source sentence and a variation of the source sentence in respective translation processes of the plural translation processes, and decoding each of the respective feature vectors to generate respective plural candidate sentences in a second language; and selecting a final sentence in the second language from the respective plural candidate sentences in the second language.
-
3.
公开(公告)号:US10714077B2
公开(公告)日:2020-07-14
申请号:US15187428
申请日:2016-06-20
Applicant: Samsung Electronics Co., Ltd.
Inventor: Inchul Song , Young Sang Choi
Abstract: An apparatus for calculating acoustic score, a method of calculating acoustic score, an apparatus for speech recognition, a method of speech recognition, and an electronic device including the same are provided. An apparatus for calculating acoustic score includes a preprocessor configured to sequentially extract audio frames into windows and a score calculator configured to calculate an acoustic score of a window by using a deep neural network (DNN)-based acoustic model.
-
公开(公告)号:US10599781B2
公开(公告)日:2020-03-24
申请号:US15254347
申请日:2016-09-01
Applicant: Samsung Electronics Co., Ltd.
Inventor: Hwidong Na , Inchul Song , Hoshik Lee
Abstract: An apparatus and method for evaluating quality of an automatic translation is disclosed. An apparatus for evaluating quality of automatic translation includes a converter which converts an automatic translation and a reference translation of an original text to a first distributed representation and a second distributed representation, respectively, using a distributed representation model and a quality evaluator which evaluates quality of automatic translation data based on similarity between the first distributed representation and the second distributed representation.
-
公开(公告)号:US11282501B2
公开(公告)日:2022-03-22
申请号:US16656700
申请日:2019-10-18
Applicant: Samsung Electronics Co., Ltd. , Universite De Montreal
Inventor: Sanghyun Yoo , Yoshua Bengio , Inchul Song
Abstract: A speech recognition method and apparatus, including implementation and/or training, are disclosed. The speech recognition method includes obtaining a speech signal, and performing a recognition of the speech signal, including generating a dialect parameter, for the speech signal, from input dialect data using a parameter generation model, applying the dialect parameter to a trained speech recognition model to generate a dialect speech recognition model, and generating a speech recognition result from the speech signal by implementing, with respect to the speech signal, the dialect speech recognition model. The speech recognition method and apparatus may perform speech recognition and/or training of the speech recognition model and the parameter generation model.
-
公开(公告)号:US10957308B2
公开(公告)日:2021-03-23
申请号:US16118807
申请日:2018-08-31
Applicant: Samsung Electronics Co., Ltd.
Inventor: Ki Soo Kwon , Inchul Song , YoungSang Choi
Abstract: Provided is a method and device to personalize a speech recognition model, the device that personalizes a speech recognition model by identifying a language group corresponding to a user, and generating a personalized speech recognition model by applying a group scale matrix corresponding to the identified language group to at least a layer of a speech recognition model.
-
公开(公告)号:US11348572B2
公开(公告)日:2022-05-31
申请号:US16038343
申请日:2018-07-18
Applicant: Samsung Electronics Co., Ltd. , Universite de Montreal
Inventor: Inchul Song , Junyoung Chung , Taesup Kim , Sanghyun Yoo
IPC: G10L15/16 , G10L15/08 , G06N3/08 , G10L15/06 , G10L15/187
Abstract: A speech recognition method includes obtaining an acoustic sequence divided into a plurality of frames, and determining pronunciations in the acoustic sequence by predicting a duration of a same pronunciation in the acoustic sequence and skipping a pronunciation prediction for a frame corresponding to the duration.
-
公开(公告)号:US10930268B2
公开(公告)日:2021-02-23
申请号:US16244397
申请日:2019-01-10
Applicant: Samsung Electronics Co., Ltd.
Inventor: Sang Hyun Yoo , Minyoung Mun , Inchul Song
Abstract: Disclosed is a speech recognition method and apparatus, wherein the apparatus acquires first outputs from sub-models in a recognition model based on a speech signal, acquires a second output including values corresponding to the sub-models from a classification model based on the speech signal, and recognizes the speech signal based on the first outputs and the second output.
-
公开(公告)号:US10529319B2
公开(公告)日:2020-01-07
申请号:US15855456
申请日:2017-12-27
Applicant: SAMSUNG ELECTRONICS CO., LTD.
Inventor: Inchul Song , Sang Hyun Yoo
IPC: G10L15/16 , G10L15/02 , G10L15/22 , G06N3/02 , G10L15/07 , G06N3/04 , G06N3/08 , G10L15/14 , G06N7/00
Abstract: A user adaptive speech recognition method and apparatus are provided. A speech recognition method includes extracting an identity vector representing an individual characteristic of a user from speech data, implementing a sub-neural network by inputting a sub-input vector including at least the identity vector to the sub-neural network, determining a scaling factor based on a result of the implementing of the sub-neural network, implementing a main neural network, configured to perform a recognition operation, by applying the determined scaling factor to the main neural network and inputting the speech data to the main neural network to which the determined scaling factor is applied, and indicating a recognition result of the implementation of the main neural network.
-
-
-
-
-
-
-
-