APPARATUS AND METHOD FOR MANAGING GIANT MODEL

    公开(公告)号:US20250148362A1

    公开(公告)日:2025-05-08

    申请号:US18678634

    申请日:2024-05-30

    Abstract: Disclosed herein is an apparatus and method for managing a giant model. The apparatus includes memory in which at least one program is recorded and a processor for executing the program. The program may perform lightweighting a first model into a second model in consideration of hardware resources, generating partitioning information of the first model based on a result of analysis of the second model, and performing training or inference for the first model based on the generated partitioning information.

Patent Agency Ranking