Scheduling for locality of reference to memory

    公开(公告)号:US11625269B1

    公开(公告)日:2023-04-11

    申请号:US17301343

    申请日:2021-03-31

    摘要: A technique for scheduling instructions includes obtaining a set of instructions that operate on memory objects, and determining the dependencies of the memory objects. The memory objects are then sorted into a sequence of memory objects based on the dependencies of the memory objects, and the set of instructions are scheduled into a sequence of instructions according to the sequence of memory objects. Sorting memory objects allows instructions that operate on the same memory object to be kept together. This helps minimize spilling conditions because intervening instructions that do not operate on the same memory object can be avoided.

    System for routing machine learning model inferences

    公开(公告)号:US11170309B1

    公开(公告)日:2021-11-09

    申请号:US15821564

    申请日:2017-11-22

    摘要: A machine learning model inference routing system in a machine learning service is described herein. The machine learning model inference routing system includes load balancer(s), network traffic router(s), an endpoint registry, and a feedback processing system that collectively allow the machine learning model inference routing system to adjust the routing of inferences based on machine learning model accuracy, demand, and/or the like. In addition, the arrangement of components in the machine learning model inference routing system enables the machine learning service to perform shadow testing, support ensemble machine learning models, and/or improve existing machine learning models using feedback data.

    Scheduling for locality of reference to memory

    公开(公告)号:US12131188B1

    公开(公告)日:2024-10-29

    申请号:US18192081

    申请日:2023-03-29

    IPC分类号: G06F9/48 G06F8/41 G06N3/063

    摘要: A technique for scheduling instructions includes obtaining a set of instructions that operate on memory objects, and determining the dependencies of the memory objects. The memory objects are then sorted into a sequence of memory objects based on the dependencies of the memory objects, and the set of instructions are scheduled into a sequence of instructions according to the sequence of memory objects. Sorting memory objects allows instructions that operate on the same memory object to be kept together. This helps minimize spilling conditions because intervening instructions that do not operate on the same memory object can be avoided.