-
公开(公告)号:US20220391747A1
公开(公告)日:2022-12-08
申请号:US17337387
申请日:2021-06-02
Applicant: salesforce.com, inc.
Inventor: Seyedshahin Ashrafzadeh , Yuliya L. Feldman , Alexandr Nikitin , Manoj Agarwal , Chirag Rajan , Swaminathan Sundaramurthy
IPC: G06N20/00
Abstract: A method by a router component in a multi-tenant on-demand serving infrastructure to route scoring requests to scoring containers. The method includes receiving a scoring request, determining a machine learning application associated with the scoring request, determining whether a router instance for the machine learning application exists, and responsive to a determination that a router instance for the machine learning application does not exist, obtaining a configuration object for the machine learning application and instantiating the router instance for the machine learning application based on the configuration object for the machine learning application. The method further includes invoking the router instance for the machine learning application to route the scoring request associated with the machine learning application to a scoring container that provides scoring functionality for the machine learning application.
-
公开(公告)号:US20220382539A1
公开(公告)日:2022-12-01
申请号:US17334617
申请日:2021-05-28
Applicant: salesforce.com, inc.
Inventor: Vaibhav Gumashta , Alexandr Nikitin , Yuliya L. Feldman , Seyedshahin Ashrafzadeh , Manoj Agarwal
Abstract: Machine learning version management method for a prediction service includes receiving a prediction request, determining application metadata for the request that defines routing logic and a machine learning framework version, determining model metadata for the request that defines at least one model and at least one model version, forwarding the prediction request to the at least one model with the at least one model version, and returning a prediction from the at least one model to a requestor.
-
公开(公告)号:US12204892B2
公开(公告)日:2025-01-21
申请号:US17337388
申请日:2021-06-02
Applicant: Salesforce.com, Inc.
Inventor: Seyedshahin Ashrafzadeh , Yuliya L Feldman , Alexandr Nikitin , Manoj Agarwal , Chirag Rajan , Swaminathan Sundaramurthy
IPC: G06F9/44 , G06F8/60 , G06F8/71 , G06F9/455 , G06F11/14 , G06F11/30 , G06F11/32 , G06F11/34 , G06N20/00 , G06F8/10 , G06F11/36
Abstract: A method by one or more electronic devices to provision an infrastructure for a machine learning application in a multi-tenant on-demand serving infrastructure. The method includes storing a plurality of templates, wherein each of the plurality of templates indicates a scoring interface, a web server, a definition of a continuous integration pipeline, and a definition of a continuous deployment pipeline, receiving a request to provision the infrastructure for the machine learning application using a specified template from the plurality of templates, and provisioning the infrastructure for the machine learning application using the specified template to create a version control system repository, a continuous integration pipeline, and a continuous deployment pipeline.
-
公开(公告)号:US20220318647A1
公开(公告)日:2022-10-06
申请号:US17217406
申请日:2021-03-30
Applicant: salesforce.com, inc.
Inventor: Seyedshahin Ashrafzadeh , Yuliya Feldman , Manoj Agarwal , Chirag Rajan , Swaminathan Sundaramurthy , Endri Deliu
Abstract: A method and system for a single framework for both streaming and on-demand inference that includes receiving a request from a tenant application for a machine-learning serving infrastructure, where the request identifies features of tenant data and a machine-learning model, subscribing to events for the identified features, initiating the machine-learning model for the request, and generating a prediction using the machine-learning model on the identified features.
-
公开(公告)号:US20220237506A1
公开(公告)日:2022-07-28
申请号:US17159805
申请日:2021-01-27
Applicant: salesforce.com, inc.
Inventor: Yuliya L. Feldman , Seyedshahin Ashrafzadeh , Alexandr Nikitin , Manoj Agarwal
Abstract: Using container and model information to select containers for executing models is described. A system receives a request from an application and identifies a version of a machine-learning model associated with the request. The system identifies model information associated with machine learning models corresponding to a cluster of available serving containers associated with the version of the machine-learning model. The system uses the model information to select a serving container from the cluster of available serving containers. If the machine-learning model is not loaded in the serving container, the system loads the machine-learning model in the serving container. If the machine-learning model is loaded in the serving container, the system executes, in the serving container, the machine-learning model on behalf of the request. The system responds to the request based on executing the machine-learning model on behalf of the request.
-
公开(公告)号:US20220391748A1
公开(公告)日:2022-12-08
申请号:US17337389
申请日:2021-06-02
Applicant: salesforce.com, inc.
Inventor: Alexandr Nikitin , Vaibhav Gumashta , Manoj Agarwal , Swaminathan Sundaramurthy
Abstract: A method of a base scorer in a scoring service container includes sending a model identifier to a model loader of an application specific scorer in the scoring service container, receiving a model object from the model loader in response to sending the model identifier, sending a request for a scoring from a client application to a scoring function of the application specific scorer, receiving the scoring from the application specific scorer, and returning the scoring to the client application.
-
公开(公告)号:US20220391199A1
公开(公告)日:2022-12-08
申请号:US17337388
申请日:2021-06-02
Applicant: salesforce.com, inc.
Inventor: Seyedshahin Ashrafzadeh , Yuliya L. Feldman , Alexandr Nikitin , Manoj Agarwal , Chirag Rajan , Swaminathan Sundaramurthy
Abstract: A method by one or more electronic devices to provision an infrastructure for a machine learning application in a multi-tenant on-demand serving infrastructure. The method includes storing a plurality of templates, wherein each of the plurality of templates indicates a scoring interface, a web server, a definition of a continuous integration pipeline, and a definition of a continuous deployment pipeline, receiving a request to provision the infrastructure for the machine learning application using a specified template from the plurality of templates, and provisioning the infrastructure for the machine learning application using the specified template to create a version control system repository, a continuous integration pipeline, and a continuous deployment pipeline.
-
公开(公告)号:US20220414548A1
公开(公告)日:2022-12-29
申请号:US17357419
申请日:2021-06-24
Applicant: salesforce.com, inc.
Inventor: Seyedshahin Ashrafzadeh , Alexandr Nikitin , Vaibhav Gumashta , Yuliya L. Feldman , Chirag Rajan , Manoj Agarwal , Swaminathan Sundaramurthy
Abstract: Methods and systems for multi-model scoring in a multi-tenant system are presented. A request for a machine learning application is received from a tenant application. A tenant identifier that identifies one of the multiple tenants is determined. Based on the tenant identifier and a type of the machine learning application, a first and a second machine learning models are determined. The first machine learning model was generated based on a first training data set associated with the tenant identifier. The second machine learning model that was generated based on a second training data set associated with the tenant identifier. A flow of operations that includes running the first and second machine learning models with data related to the request is executed to obtain a scoring result. The scoring result is returned to the tenant application in response to the request.
-
9.
公开(公告)号:US20220391239A1
公开(公告)日:2022-12-08
申请号:US17337377
申请日:2021-06-02
Applicant: salesforce.com, inc.
Inventor: Yuliya L. Feldman , Alexandr Nikitin , Manoj Agarwal , Chirag Rajan
IPC: G06F9/455 , H04L29/06 , G06F11/36 , G06N20/00 , G06F16/955
Abstract: A method by one or more electronic devices for spinning up a scoring container on demand. The method includes receiving, from an orchestrator component via an application programming interface (API), a request to spin up the scoring container, wherein the scoring container is configured to provide scoring functionality, spinning up the scoring container responsive to receiving the request to spin up the scoring container, and providing, to the orchestrator component via the API, a response to the request to spin up the scoring container, wherein the response includes a uniform resource locator (URL) to use to submit scoring requests to the scoring container.
-
10.
公开(公告)号:US20220382601A1
公开(公告)日:2022-12-01
申请号:US17334592
申请日:2021-05-28
Applicant: salesforce.com, inc.
Inventor: Yuliya L. Feldman , Seyedshahin Ashrafzadeh , Alexandr Nikitin , Manoj Agarwal
Abstract: A machine learning serving infrastructure implementing a method of receiving or detecting an update of container metrics including resource usage and serviced requests per model or per container, processing the container metrics per model or per container to determine recent resource usage and serviced requests per model or per container, and rebalancing distribution of models to a plurality of containers to decrease a detected load imbalance between containers or a stressed container in the plurality of containers.
-
-
-
-
-
-
-
-
-