Continual neural network learning via explicit structure learning

    公开(公告)号:US11645509B2

    公开(公告)日:2023-05-09

    申请号:US16176419

    申请日:2018-10-31

    CPC classification number: G06N3/08 G06N3/04

    Abstract: Embodiments for training a neural network using sequential tasks are provided. A plurality of sequential tasks are received. For each task in the plurality of tasks a copy of the neural network that includes a plurality of layers is generated. From the copy of the neural network a task specific neural network is generated by performing an architectural search on the plurality of layers in the copy of the neural network. The architectural search identifies a plurality of candidate choices in the layers of the task specific neural network. Parameters in the task specific neural network that correspond to the plurality of candidate choices and that maximize architectural weights at each layer are identified. The parameters are retrained and merged with the neural network. The neural network trained on the plurality of sequential tasks is a trained neural network.

Patent Agency Ranking