-
1.
公开(公告)号:US20220414503A1
公开(公告)日:2022-12-29
申请号:US17569393
申请日:2022-01-05
Inventor: Jongse PARK , Wonik SEO , Sanghoon CHA , Yeonjae KIM , Jaehyuk HUH
Abstract: Disclosed is an SLO-aware artificial intelligence inference scheduler technology in a heterogeneous processor-based edge system. A scheduling method for a machine learning (ML) inference task, which is performed by a scheduling system, may include receiving inference task requests of multiple ML models with respect to an edge system composed of heterogeneous processors and operating heterogeneous processor resources of the edge system based on a service-level objective (SLO)-aware-based scheduling policy in response to the received inference task requests.