-
公开(公告)号:US20220156287A1
公开(公告)日:2022-05-19
申请号:US17112975
申请日:2020-12-04
Applicant: Samsung Electronics Co., Ltd.
Inventor: HUI ZHANG , JOO HWAN LEE , YIQUN ZHANG , ARMIN HAJ ABOUTALEBI , XIAODONG ZHAO , PRAVEEN KRISHNAMOORTHY , ANDREW CHANG , YANG SEOK KI
Abstract: A method of processing data in a system having a host and a storage node may include performing a shuffle operation on data stored at the storage node, wherein the shuffle operation may include performing a shuffle write operation, and performing a shuffle read operation, wherein at least a portion of the shuffle operation is performed by an accelerator at the storage node. A method for partitioning data may include sampling, at a device, data from one or more partitions based on a number of samples, transferring the sampled data from the device to a host, determining, at the host, one or more splitters based on the sampled data, communicating the one or more splitters from the host to the device, and partitioning, at the device, data for the one or more partitions based on the one or more splitters.
-
公开(公告)号:US20200234146A1
公开(公告)日:2020-07-23
申请号:US16442447
申请日:2019-06-14
Applicant: Samsung Electronics Co., Ltd.
Inventor: JOO HWAN LEE , YANG SEOK KI , BEHNAM POURGHASSEMI
Abstract: Computing resources are optimally allocated for a multipath neural network using a multipath neural network analyzer that includes an interface and a processing device. The interface receives a multipath neural network that includes two or more paths. A first path includes one or more layers. A first layer of the first path corresponds to a first kernel that runs on a compute unit that includes two or more cores. The processing device allocates to the first kernel a minimum number of cores of the compute unit and a maximum number of cores of the compute unit. The minimum number of cores of the compute unit is allocated based on the first kernel being run concurrently with at least one other kernel on the compute unit and the maximum number of cores of the compute unit is allocated based on the first kernel being run alone on the compute unit.
-
公开(公告)号:US20220231698A1
公开(公告)日:2022-07-21
申请号:US17357953
申请日:2021-06-24
Applicant: Samsung Electronics Co., Ltd.
Inventor: Sahand SALAMAT , JOO HWAN LEE , ARMIN HAJ ABOUTALEBI , PRAVEEN KRISHNAMOORTHY , XIAODONG ZHAO , HUI ZHANG , YANG SEOK KI
Abstract: An accelerator is disclosed. The accelerator may include a memory that may store a dictionary table. An address generator may be configured to generate an address in the dictionary table based on an encoded value, which may have an encoded width. An output filter may be configured to filter a decoded value from the dictionary table based on the encoded value, the encoded width, and a decoded width of the decoded data. The accelerator may be configured to support at least two different encoded widths.
-
-