GENERIC CONCURRENCY RESTRICTION
    1.
    发明公开

    公开(公告)号:US20240345901A1

    公开(公告)日:2024-10-17

    申请号:US18757299

    申请日:2024-06-27

    发明人: David Dice Alex Kogan

    IPC分类号: G06F9/52 G06F9/50

    CPC分类号: G06F9/52 G06F9/5022 G06F9/524

    摘要: Generic Concurrency Restriction (GCR) may divide a set of threads waiting to acquire a lock into two sets: an active set currently able to contend for the lock, and a passive set waiting for an opportunity to join the active set and contend for the lock. The number of threads in the active set may be limited to a predefined maximum or even a single thread. Generic Concurrency Restriction may be implemented as a wrapper around an existing lock implementation. Generic Concurrency Restriction may, in some embodiments, be unfair (e.g., to some threads) over the short term, but may improve the overall throughput of the underlying multithreaded application via passivation of a portion of the waiting threads.

    MEMORY MANAGEMENT IN A MULTI-PROCESSOR ENVIRONMENT

    公开(公告)号:US20240345881A1

    公开(公告)日:2024-10-17

    申请号:US18751415

    申请日:2024-06-24

    申请人: Next Silicon Ltd

    IPC分类号: G06F9/50 G06F12/02

    摘要: There is provided a memory, comprising: issuing an allocation operation for allocation of a region of a memory by a first process of a plurality of first processes executed in parallel on a first processor, sending a message to a second processor indicating the allocation of the region of the pool of the memory, issuing a free operation for release of the allocated region of the pool of the memory by a second process of a plurality of second processes executed in parallel on a second processor, and releasing, by the first processor, the allocated region of the pool of the memory as indicated in the free operation, wherein a same region of memory is allocated by the first process and released by the second process, wherein the first processes are concurrently attempting to issue the allocation operation and the second processes are concurrently attempting to issue the free operation.

    METHOD AND SYSTEM FOR ALLOCATING ON-CHIP MEMORY OF NEURAL PROCESSING UNIT

    公开(公告)号:US20240320044A1

    公开(公告)日:2024-09-26

    申请号:US18673214

    申请日:2024-05-23

    申请人: REBELLIONS INC.

    发明人: Minhoo Kang

    IPC分类号: G06F9/50

    摘要: A method for allocating on-chip memory of a neural processing unit is performed by one or more processors, and includes deallocating an allocated chunk in an on-chip memory area, which is finished with the use of the memory, and converting it into a cached chunk, receiving an on-chip memory allocation request for specific data, determining whether there is a cached chunk of one or more cached chunks that is allocable for the specific data, based on a comparison between a size of the specific data and the size of the one or more cached chunks, and based on a result of determining whether there is the cached chunk that is allocable for the specific data, allocating the specific data to a specific cached chunk of the one or more cached chunks, or allocating the specific data to at least a portion of the free chunk.

    Method and system for allocating on-chip memory of neural processing unit

    公开(公告)号:US12026552B1

    公开(公告)日:2024-07-02

    申请号:US18389676

    申请日:2023-12-19

    申请人: REBELLIONS INC.

    发明人: Minhoo Kang

    IPC分类号: G06F9/46 G06F9/50

    摘要: A method for allocating on-chip memory of a neural processing unit is performed by one or more processors, and includes deallocating an allocated chunk in an on-chip memory area, which is finished with the use of the memory, and converting it into a cached chunk, receiving an on-chip memory allocation request for specific data, determining whether there is a cached chunk of one or more cached chunks that is allocable for the specific data, based on a comparison between a size of the specific data and the size of the one or more cached chunks, and based on a result of determining whether there is the cached chunk that is allocable for the specific data, allocating the specific data to a specific cached chunk of the one or more cached chunks, or allocating the specific data to at least a portion of the free chunk.