Model Weight Obtaining Method and Related System

    公开(公告)号:US20240281641A1

    公开(公告)日:2024-08-22

    申请号:US18653096

    申请日:2024-05-02

    CPC classification number: G06N3/045

    Abstract: A model weight obtaining method includes obtaining structure information of a first neural network model; searching, based on the structure information of the first neural network model, a weight library that stores a plurality of groups of historical weights to obtain a reference weight, where the reference weight is a weight of a second neural network model having a structure similar to that of the first neural network model in the plurality of groups of historical weights; and converting the reference weight to obtain a weight of the first neural network model. In the method, a weight of a neural network model having a structure similar to that of a to-be-trained neural network model is searched for in a weight library, and the weight is converted, to obtain a weight that can be used by the to-be-trained neural network model.

    AI model optimization method and apparatus

    公开(公告)号:US12032571B2

    公开(公告)日:2024-07-09

    申请号:US17694970

    申请日:2022-03-15

    CPC classification number: G06F16/2453 G06N20/00

    Abstract: In a method for AI model optimization, an optimization device receives an original AI model and search configuration information that comprises a plurality of search items each indicating its search categories for performing optimization information search on the original AI model. The device obtains a plurality of search operators corresponding to the plurality of search items, and arranges the search operators in an operation sequence based on the search configuration information. The device then executes the search operators in the arranged operation sequence on the original AI model to obtain an optimized AI model. In the execution of the operation sequence, each search operator, except for the first search operator in the operation sequence, is executed utilizing operation results of a preceding search operator in the operation sequence, the operation results including generated network structures and search space information.

    Neural Network Optimization Method and Apparatus

    公开(公告)号:US20240095529A1

    公开(公告)日:2024-03-21

    申请号:US18521152

    申请日:2023-11-28

    CPC classification number: G06N3/08

    Abstract: A neural network optimization method includes receiving a model file of a to-be-optimized neural network; obtaining a search space of a target neural network architecture based on the model file of the to-be-optimized neural network, where the search space includes a value range of each attribute of each neuron in the target neural network architecture; obtaining the target neural network architecture based on the search space; training the target neural network architecture based on the model file of the to-be-optimized neural network, to obtain a model file of a target neural network; and providing the model file of the target neural network to a user.

    Neural Architecture Search Method and Apparatus, Device, and Medium

    公开(公告)号:US20220414426A1

    公开(公告)日:2022-12-29

    申请号:US17902206

    申请日:2022-09-02

    Abstract: This application provides a neural architecture search method, applied to a search system. The search system includes a generator and a searcher. The method includes: The generator generates a plurality of neural network architectures based on a search space; the searcher obtains evaluation indicator values of a plurality of child models obtained based on the plurality of neural network architectures on first hardware; and the searcher determines, based on the neural network architectures corresponding to the plurality of child models and the evaluation indicator values of the plurality of child models on the first hardware, a first target neural network architecture that meets a preset condition. In this way, different initial child model training processes are decoupled, and a neural architecture search process is decoupled from an initial child model training process, so that search duration is reduced and search efficiency is improved.

    AI MODEL OPTIMIZATION METHOD AND APPARATUS

    公开(公告)号:US20220197901A1

    公开(公告)日:2022-06-23

    申请号:US17694970

    申请日:2022-03-15

    Abstract: In a method for AI model optimization, an optimization device receives an original AI model and search configuration information that comprises a plurality of search items each indicating its search categories for performing optimization information search on the original AI model. The device obtains a plurality of search operators corresponding to the plurality of search items, and arranges the search operators in an operation sequence based on the search configuration information. The device then executes the search operators in the arranged operation sequence on the original AI model to obtain an optimized AI model. In the execution of the operation sequence, each search operator, except for the first search operator in the operation sequence, is executed utilizing operation results of a preceding search operator in the operation sequence, the operation results including generated network structures and search space information.

Patent Agency Ranking