USING CONTAINER AND MODEL INFORMATION TO SELECT CONTAINERS FOR EXECUTING MODELS

    公开(公告)号:US20220237506A1

    公开(公告)日:2022-07-28

    申请号:US17159805

    申请日:2021-01-27

    Abstract: Using container and model information to select containers for executing models is described. A system receives a request from an application and identifies a version of a machine-learning model associated with the request. The system identifies model information associated with machine learning models corresponding to a cluster of available serving containers associated with the version of the machine-learning model. The system uses the model information to select a serving container from the cluster of available serving containers. If the machine-learning model is not loaded in the serving container, the system loads the machine-learning model in the serving container. If the machine-learning model is loaded in the serving container, the system executes, in the serving container, the machine-learning model on behalf of the request. The system responds to the request based on executing the machine-learning model on behalf of the request.

    MULTI-MODEL SCORING IN A MULTI-TENANT SYSTEM

    公开(公告)号:US20220414548A1

    公开(公告)日:2022-12-29

    申请号:US17357419

    申请日:2021-06-24

    Abstract: Methods and systems for multi-model scoring in a multi-tenant system are presented. A request for a machine learning application is received from a tenant application. A tenant identifier that identifies one of the multiple tenants is determined. Based on the tenant identifier and a type of the machine learning application, a first and a second machine learning models are determined. The first machine learning model was generated based on a first training data set associated with the tenant identifier. The second machine learning model that was generated based on a second training data set associated with the tenant identifier. A flow of operations that includes running the first and second machine learning models with data related to the request is executed to obtain a scoring result. The scoring result is returned to the tenant application in response to the request.

Patent Agency Ranking