-
1.
公开(公告)号:US20250095666A1
公开(公告)日:2025-03-20
申请号:US18884978
申请日:2024-09-13
Applicant: SAMSUNG ELECTRONICS CO., LTD.
Inventor: Ching-Hua LEE , Chouchang YANG , Rakshith Sharma SRINIVASA , Yashas Malur SAIDUTTA , Jaejin CHO , Yilin SHEN , Hongxia JIN
IPC: G10L21/0208 , G10L15/06 , G10L21/0216
Abstract: A method for generating a customized speech enhancement model includes obtaining noisy-clean speech data from a source domain, obtaining noisy speech data from a target domain; obtaining raw speech data, using the noisy-clean speech data, the noisy speech data, and the raw speech data, training the customized SE model based on at least one of self-supervised representation-based adaptation (SSRA), ensemble mapping, or self-supervised adaptation loss, generating the customized SE model by denoising the noisy speech data using the trained customized SE model, and providing the customized SE model to a user device to use the denoised noisy speech data.
-
公开(公告)号:US20240414448A1
公开(公告)日:2024-12-12
申请号:US18515732
申请日:2023-11-21
Applicant: SAMSUNG ELECTRONICS CO., LTD.
Inventor: Karim AHMED , Yi WEI , Vasili RAMANISHKA , Yilin SHEN , Hongxia JIN
IPC: H04N23/84 , G06T3/4015 , G06T5/60 , G06T5/70
Abstract: Provided is a U-shaped network for image restoration. The U-shaped network is lightweight based on a transformer block and is suitable to be deployed on-device, such as in a smartphone. The U-shaped network uses the transformer block to implement encoder, decoder and bottleneck functions. Decoders are connected to respective encoders using skip connections based on element-wise addition, which avoids dimension expansion of concatenation. The transformer block uses a configuration of scaling and pool mixing to process input image data without the need for self-attention computations which permits reduction in memory, reduction in latency, reduction in computational demand, all while maintaining good output image quality.
-
公开(公告)号:US20220398459A1
公开(公告)日:2022-12-15
申请号:US17835457
申请日:2022-06-08
Applicant: SAMSUNG ELECTRONICS CO., LTD.
Inventor: Yen-Chang HSU , Yilin SHEN , Hongxia JIN
Abstract: A method of training a student model includes providing an input to a teacher model that is larger than the student model, where a layer of the teacher model outputs a first output vector, providing the input to the student model, where a layer of the student model outputs a second output vector, determining an importance value associated with each dimension of the first output vector based on gradients from the teacher model and updating at least one parameter of the student model to minimize a difference between the second output vector and the first output vector based on the importance values.
-
公开(公告)号:US20250094820A1
公开(公告)日:2025-03-20
申请号:US18824166
申请日:2024-09-04
Applicant: SAMSUNG ELECTRONICS CO., LTD.
Inventor: Sudipta PAUL , Lingyu ZHANG , Yilin SHEN , Hongxia JIN
IPC: G06N3/096
Abstract: A method for enabling an improved device control capability of a small language model (SLM) transferrable to a hub device configured to be operable by a user in an environment, is disclosed. The method includes performing a fine-tuning the SLM based on a data set including base plans and contrastive plans; generating computer codes corresponding to the fine-tuned SLM; and transferring the generated computer codes to the hub device to be connected with a group of the electronic devices in the environment.
-
公开(公告)号:US20220121947A1
公开(公告)日:2022-04-21
申请号:US17465439
申请日:2021-09-02
Applicant: SAMSUNG ELECTRONICS CO., LTD.
Inventor: Qian LOU , Yilin SHEN , Hongxia JIN
Abstract: A method of a server device is provided. The method of a server device includes retrieving a prediction input and a prediction setting, replacing at least one non-linear activation channel in a neural network with at least one replacement channel based on the received prediction setting, generating a prediction based on the received prediction input based on the neural network with the at least one replacement channel, and outputting the generated prediction.
-
公开(公告)号:US20250166339A1
公开(公告)日:2025-05-22
申请号:US18955003
申请日:2024-11-21
Applicant: SAMSUNG ELECTRONICS CO., LTD.
Inventor: Wenbo LI , Yilin SHEN , Hongxia JIN
Abstract: A method includes generating a pair of features of a current frame by inputting the current frame into a first cross-collaborative consistency learning (C3L) model, the current frame comprising a red-green-blue (RGB) image and a thermal image; generating a pair of denoised features by inputting the of pair of features of the current frame and one or more pairs of features of past frames into a denoised memory read (DMR) model; generating an updated pair of denoised features by inputting the pair of denoised features into a second C3L model, the updated pair of denoised features comprising an updated RGB image feature and an updated thermal feature; and generating a segmentation mask by inputting the updated pair of features into a segmentation head.
-
公开(公告)号:US20240119077A1
公开(公告)日:2024-04-11
申请号:US18368353
申请日:2023-09-14
Applicant: SAMSUNG ELECTRONICS CO., LTD.
Inventor: Shangqian GAO , Burak UZKENT , Yilin SHEN , Hongxia JIN
IPC: G06F16/33 , G06F16/583 , G06N3/0985
CPC classification number: G06F16/334 , G06F16/5846 , G06N3/0985
Abstract: A method of performing a multimodal tasks by using a multimodal model that includes a text encoder and a vision encoder, may include obtaining a text feature from the query via the text encoder; obtaining an image feature from the one or more input images via the vision encoder; and outputting a response to the query based on similarity between the text feature and the image feature, wherein weights vectors of the text encoder and the vision encoder are pruned and shared according to a sharing vector and a pruning vector that are generated by a hypernetwork, and wherein the hypernetwork and the multimodal model are jointly trained to minimize at least one of a difference between the weight vectors in the text encoder and the vision encoder, a difference between the weight vectors in different layers of the text encoder, and a number of parameters in the multimodal model.
-
公开(公告)号:US20180077168A1
公开(公告)日:2018-03-15
申请号:US15264116
申请日:2016-09-13
Applicant: Samsung Electronics Co., Ltd.
Inventor: Yilin SHEN , Pengfei HU , Hongxia JIN
CPC classification number: H04L63/107 , G06F1/3206 , G06F1/3212 , G06F1/3215 , G06F1/325 , G06F1/3296 , H04L43/106 , H04L43/16 , H04L63/0428 , H04L63/0492 , H04L63/06 , H04L63/083 , H04L63/0876 , H04W4/021 , H04W4/80 , H04W12/00503 , H04W12/02 , H04W12/04 , H04W12/06 , Y02D10/174
Abstract: An accurate distance between two devices can be determined in continuous and secure manner using modulated audible signals containing time-based information. This calculated distance can be used to lock and unlock one of the two devices such that if one of the devices, such as a smart phone or smart watch, is beyond a pre-configured distance from the other device, such as a laptop or tablet, the other device locks and may display a message to the user. The modulated messages contain time difference data of audible signal emission and receiving times which are used by each device to calculate an accurate estimate of the distance between the two devices.
-
-
-
-
-
-
-