EFFICIENT NEURAL NETWORKS WITH ELABORATE MATRIX STRUCTURES IN MACHINE LEARNING ENVIRONMENTS

    公开(公告)号:US20200234137A1

    公开(公告)日:2020-07-23

    申请号:US16632145

    申请日:2017-08-18

    Abstract: A mechanism is described for facilitating slimming of neural networks in machine learning environments. A method of embodiments, as described herein, includes learning a first neural network associated with machine learning processes to be performed by a processor of a computing device, where learning includes analyzing a plurality of channels associated with one or more layers of the first neural network. The method may further include computing a plurality of scaling factors to be associated with the plurality of channels such that each channel is assigned a scaling factor, wherein each scaling factor to indicate relevance of a corresponding channel within the first neural network. The method may further include pruning the first neural network into a second neural network by removing one or more channels of the plurality of channels having low relevance as indicated by one or more scaling factors of the plurality of scaling factors assigned to the one or more channels.

    Efficient neural networks with elaborate matrix structures in machine learning environments

    公开(公告)号:US12165065B2

    公开(公告)日:2024-12-10

    申请号:US16632145

    申请日:2017-08-18

    Abstract: A mechanism is described for facilitating slimming of neural networks in machine learning environments. A method includes learning a first neural network associated with machine learning processes to be performed by a processor of a computing device, where learning includes analyzing a plurality of channels associated with one or more layers of the first neural network. The method may further include computing a plurality of scaling factors to be associated with the plurality of channels such that each channel is assigned a scaling factor, wherein each scaling factor to indicate relevance of a corresponding channel within the first neural network. The method may further include pruning the first neural network into a second neural network by removing one or more channels of the plurality of channels having low relevance as indicated by one or more scaling factors of the plurality of scaling factors assigned to the one or more channels.

    EFFICIENT NEURAL NETWORKS WITH ELABORATE MATRIX STRUCTURES IN MACHINE LEARNING ENVIRONMENTS

    公开(公告)号:US20250053814A1

    公开(公告)日:2025-02-13

    申请号:US18805370

    申请日:2024-08-14

    Abstract: A mechanism is described for facilitating slimming of neural networks in machine learning environments. A method of embodiments, as described herein, includes learning a first neural network associated with machine learning processes to be performed by a processor of a computing device, where learning includes analyzing a plurality of channels associated with one or more layers of the first neural network. The method may further include computing a plurality of scaling factors to be associated with the plurality of channels such that each channel is assigned a scaling factor, wherein each scaling factor to indicate relevance of a corresponding channel within the first neural network. The method may further include pruning the first neural network into a second neural network by removing one or more channels of the plurality of channels having low relevance as indicated by one or more scaling factors of the plurality of scaling factors assigned to the one or more channels.

Patent Agency Ranking