LOW-POWER CHANGE-BASED NEURAL NETWORK INFERENCE FOR IMAGE PROCESSING

    公开(公告)号:US20230377321A1

    公开(公告)日:2023-11-23

    申请号:US17664262

    申请日:2022-05-20

    CPC classification number: G06V10/82 G06V20/46 G06V20/70 G06V10/42

    Abstract: One or more aspects of the present disclosure enable high accuracy computer vision and image processing techniques with decreased system resource requirements (e.g., with decreased computational load, shallower neural network designs, etc.). As described in more detail herein, one or more aspects of the described techniques may leverage key layers (e.g., certain key layers of a neural network) and compressed tensor comparisons to efficiently exploit temporal redundancy in videos and other slow changing signals (e.g., to efficiently reduce neural network inference computational burden, with only minor increase in data transfer power consumption). For example, key layers of a neural network may be identified, and temporal/spatial redundancy across frames may be efficiently leveraged such that only a computation region in a subsequent frame n+1 is re-computed in layers between identified key layers, while remaining feature-map calculations may be disabled in the layers between the identified key layers.

    Region of interest selection for object detection

    公开(公告)号:US11461992B2

    公开(公告)日:2022-10-04

    申请号:US17095883

    申请日:2020-11-12

    Abstract: An object detection system may generate regions of interest (ROIs) from an input image that can be processed by a wide range of object detectors. According to the techniques described herein, an image is processed by a light-weight neural network (e.g., a heatmap network) that outputs object center and object scale heat-maps. The heatmaps are processed to define ROIs that are likely to include objects. Overlapping ROIs are then merged to reduce the aggregate size of the ROIs, and merged ROIs are downscaled to a reduced set of pre-defined resolutions. Fully-convolutional, high-accuracy object detectors may then operate on the downscaled ROIs to output accurate detections at a fraction of the computations by operating on a reduced image. For example, fully-convolutional, high-accuracy object detectors may operate on a subset of the entire image (e.g., cropped images based on ROIs) thus reducing computations otherwise performed over the entire image.

    NEURAL NETWORK LAYER FOLDING
    4.
    发明申请

    公开(公告)号:US20220327386A1

    公开(公告)日:2022-10-13

    申请号:US17399374

    申请日:2021-08-11

    Abstract: The present disclosure describes neural network reduction techniques for decreasing the number of neurons or layers in a neural network. Embodiments of the method, apparatus, non-transitory computer readable medium, and system are configured to receive a trained neural network and replace certain non-linear activation units with an identity function. Next, linear blocks may then be folded to form a single block in places where the non-linear activation units were replaced by an identity function. Such techniques may reduce the number of layers in the neural network, which may optimize power and computation efficiency of the neural network architecture (e.g., without unduly influencing the accuracy of the network model).

    REGION OF INTEREST SELECTION FOR OBJECT DETECTION

    公开(公告)号:US20220147751A1

    公开(公告)日:2022-05-12

    申请号:US17095883

    申请日:2020-11-12

    Abstract: An object detection system may generate regions of interest (ROIs) from an input image that can be processed by a wide range of object detectors. According to the techniques described herein, an image is processed by a light-weight neural network (e.g., a heatmap network) that outputs object center and object scale heat-maps. The heatmaps are processed to define ROIs that are likely to include objects. Overlapping ROIs are then merged to reduce the aggregate size of the ROIs, and merged ROIs are downscaled to a reduced set of pre-defined resolutions. Fully-convolutional, high-accuracy object detectors may then operate on the downscaled ROIs to output accurate detections at a fraction of the computations by operating on a reduced image. For example, fully-convolutional, high-accuracy object detectors may operate on a subset of the entire image (e.g., cropped images based on ROIs) thus reducing computations otherwise performed over the entire image.

Patent Agency Ranking