Method and apparatus for managing unified virtual memory by preventing migration of deep learning model's weight from processor memory to coprocessor memory

    公开(公告)号:US12189527B2

    公开(公告)日:2025-01-07

    申请号:US18343099

    申请日:2023-06-28

    Abstract: A method and apparatus for managing a unified virtual memory (UVM) are provided. The UVM is backed by a main processor memory and a coprocessor memory, and the method includes: checking properties of data blocks of the UVM used to execute a deep learning model; based on a first of the data blocks storing weight data of the deep learning model, storing the first data block in the main processor memory among the main processor memory and the coprocessor memory; and performing an operation of the deep learning model based on the first data block using a coprocessor while directly loading at least a portion of the first data block from the main processor memory into a cache memory of the coprocessor without migration of the first data block from the main processor memory to the coprocessor memory.

    Active scheduling method and computing apparatus

    公开(公告)号:US12014215B2

    公开(公告)日:2024-06-18

    申请号:US17327600

    申请日:2021-05-21

    CPC classification number: G06F9/5038 G06F9/3877 G06F9/505 G06F9/542

    Abstract: An active scheduling method performed with a master processor and a plurality of slave processors. The method includes determining whether a job to be performed has a dependency by referencing a job queue; in a case in which it is determined that the job to be performed has a dependency, updating a state of the job to be performed in a table in which information of each of a plurality of jobs is recorded; analyzing a state of a job preceding the job to be performed based on the table; and in a case in which the job preceding the job to be performed is determined to have been completed, performing the job to be performed by retrieving the job to be performed from the job queue.

    Apparatus and method with remote page access

    公开(公告)号:US12299292B2

    公开(公告)日:2025-05-13

    申请号:US18089839

    申请日:2022-12-28

    Abstract: An apparatus includes a memory configured to store data, and a processor. The processor configured to determine whether an access to the data is a local memory access; determine, based on a result of the determination of whether the access to the data is the local memory access, whether a page fault of the access occurred; determine, based on a result of the determination of whether the page fault occurred, whether the access is a remote access outside a socket; and perform, based on a result of the determination of whether the access is the remote access, the access to the data by copying the data onto a local memory.

Patent Agency Ranking