-
1.
公开(公告)号:US20230418673A1
公开(公告)日:2023-12-28
申请号:US18166685
申请日:2023-02-09
Inventor: Myoungsoo JUNG , Junhyeok JANG , Miryeong KWON , Donghyun GOUK , Hanyeoreum BAE
CPC classification number: G06F9/5027 , G06F9/54 , G06F9/4881
Abstract: Provided is an apparatus for accelerating a graph neural network for efficient parallel processing of massive graph datasets, including a streaming multiprocess (SM) scheduler and a computation unit, wherein the SM scheduler obtains a subgraph and an embedding table per layer, determines a number of SMs to be allocated for processing embeddings of a destination-vertex based on a feature dimension and a maximum number of threads in each of the SMs, and allocates the determined number of SMs to each of all destination-vertices included in the subgraph, and the computation unit obtains, by each of the SMs, embeddings of a destination-vertex allocated to each SM, obtains, by each SM, embeddings of at least one or more neighbor-vertices of the destination-vertex using the subgraph, and performs, by each SM, a user-designated operation using the embeddings of the destination-vertex and the embeddings of the neighbor-vertices.
-
公开(公告)号:US20250061077A1
公开(公告)日:2025-02-20
申请号:US18797821
申请日:2024-08-08
Inventor: Myoungsoo JUNG , Miryeong Kwon , Junhyeok JANG , Seungjun LEE , Hanjin CHOI , Hanyeoreum BAE
Abstract: A memory expander is disclosed. The memory expander includes a memory, a memory controller configured to control the memory, a compute express link (CXL) engine configured to acquire a CXL flit from a host device connected to the memory expander and configured to acquire a calculation request for pieces of data stored in the memory by performing conversion on the CXL flit, and a domain-specific accelerator configured to perform a calculation in response to the calculation request.
-