- 专利标题: ASYNCHRONOUS PREDICTION IN MACHINE LEARNING MODEL SERVING
-
申请号: US17945960申请日: 2022-09-15
-
公开(公告)号: US20240095109A1公开(公告)日: 2024-03-21
- 发明人: Lawrence Lam , Di Zhao
- 申请人: Twitter, Inc.
- 申请人地址: US CA San Francisco
- 专利权人: Twitter, Inc.
- 当前专利权人: Twitter, Inc.
- 当前专利权人地址: US CA San Francisco
- 主分类号: G06F9/54
- IPC分类号: G06F9/54 ; G06N20/00
摘要:
Example computer-implemented methods, media, and systems for serving machine learning (ML) models using an asynchronous input/output (I/O) mechanism are disclosed. One example method includes receiving a first request for running a ML model to provide a first prediction. A first green thread is generated responsive to the first request and executed on an operating system (OS) thread to send a first asynchronous remote procedure call (RPC) to a multiple producer single consumer (MPSC) channel. A second request for running the ML model to provide a second prediction is received. A second green thread is generated responsive to the second request and executed on the OS thread to send a second RPC to the MPSC channel. The first and the second asynchronous RPCs are scheduled using a first and a second blocking threads respectively, which are used by the ML model to generate the first prediction and the second prediction.
信息查询