EFFICIENT TENSOR REMATERIALIZATION FOR NEURAL NETWORKS

    公开(公告)号:US20240249128A1

    公开(公告)日:2024-07-25

    申请号:US18353637

    申请日:2023-07-17

    CPC classification number: G06N3/063

    Abstract: A processor-implemented method for rematerialization for an artificial neural network (ANN) includes receiving a graph representing the ANN. The graph includes multiple nodes connected by edges and each node represents an operation. Retention intervals for the nodes are determined based on a precedence constraint for the nodes. The retention intervals correspond to a time interval for retaining each node output in a local memory. One of the nodes to recompute is determined based on the retention intervals.

    EFFICIENT OPTIMIZATION OF TENSOR REMATERIALIZATION AND PAGING FOR NEURAL NETWORKS

    公开(公告)号:US20240386237A1

    公开(公告)日:2024-11-21

    申请号:US18495609

    申请日:2023-10-26

    Abstract: A processor-implemented method includes receiving a graph representing an artificial neural network (ANN). The graph includes multiple nodes connected by edges and each node represents an operation. Retention intervals are determined for the multiple node outputs based on rematerialization constraints and paging constraints. The retention intervals correspond to a time interval for retaining each node output in at least one local memory. A sequence of tasks for executing the multiple nodes of the graph representing the ANN is determined based on the retention intervals.

    NEURAL DIRECTED ACYCLIC GRAPH (DAG) SCHEDULING VIA ONE-SHOT PRIORITY SAMPLING

    公开(公告)号:US20240119301A1

    公开(公告)日:2024-04-11

    申请号:US18464996

    申请日:2023-09-11

    CPC classification number: G06N3/092

    Abstract: A processor-implemented method includes sampling, according to a priority sampling policy, a set of node priorities from a computation graph. Each node priority of the set of node priorities may be associated with a respective node on the computation graph. Additionally, each node may represent an operation of a task performed by an artificial neural network. The method also includes converting, via a list scheduling function, the node priorities to a schedule that associates each node of the computation graph with a processor of a group of processors of a device associated with the artificial neural network, the schedule associated with a makespan. The method further includes performing the task in accordance with the schedule.

Patent Agency Ranking