-
公开(公告)号:US11461992B2
公开(公告)日:2022-10-04
申请号:US17095883
申请日:2020-11-12
Applicant: SAMSUNG ELECTRONICS CO., LTD.
Inventor: Ran Vitek , Alexandra Dana , Maor Shutman , Matan Shoef , Yotam Perlitz , Tomer Peleg , Netanel Stein , Roy Josef Jevnisek
Abstract: An object detection system may generate regions of interest (ROIs) from an input image that can be processed by a wide range of object detectors. According to the techniques described herein, an image is processed by a light-weight neural network (e.g., a heatmap network) that outputs object center and object scale heat-maps. The heatmaps are processed to define ROIs that are likely to include objects. Overlapping ROIs are then merged to reduce the aggregate size of the ROIs, and merged ROIs are downscaled to a reduced set of pre-defined resolutions. Fully-convolutional, high-accuracy object detectors may then operate on the downscaled ROIs to output accurate detections at a fraction of the computations by operating on a reduced image. For example, fully-convolutional, high-accuracy object detectors may operate on a subset of the entire image (e.g., cropped images based on ROIs) thus reducing computations otherwise performed over the entire image.
-
公开(公告)号:US20220147751A1
公开(公告)日:2022-05-12
申请号:US17095883
申请日:2020-11-12
Applicant: Samsung Electronics Co., LTD.
Inventor: Ran Vitek , Alexandra Dana , Maor Shutman , Matan Shoef , Yotam Perlitz , Tomer Peleg , Netanel Stein , Roy Josef Jevnisek
Abstract: An object detection system may generate regions of interest (ROIs) from an input image that can be processed by a wide range of object detectors. According to the techniques described herein, an image is processed by a light-weight neural network (e.g., a heatmap network) that outputs object center and object scale heat-maps. The heatmaps are processed to define ROIs that are likely to include objects. Overlapping ROIs are then merged to reduce the aggregate size of the ROIs, and merged ROIs are downscaled to a reduced set of pre-defined resolutions. Fully-convolutional, high-accuracy object detectors may then operate on the downscaled ROIs to output accurate detections at a fraction of the computations by operating on a reduced image. For example, fully-convolutional, high-accuracy object detectors may operate on a subset of the entire image (e.g., cropped images based on ROIs) thus reducing computations otherwise performed over the entire image.
-
公开(公告)号:US20230368520A1
公开(公告)日:2023-11-16
申请号:US17663035
申请日:2022-05-12
Applicant: SAMSUNG ELECTRONICS CO., LTD.
Inventor: Ishay Goldin , Netanel Stein , Alexandra Dana , Alon Intrater , David Tsidkiahu , Nathan Levy , Omer Shabtai , Ran Vitek , Tal Heller , Yaron Ukrainitz , Yotam Platner , Zuf Pilosof
CPC classification number: G06V10/96 , G06T3/4046 , G06V20/49 , G06V10/82 , G06V20/41 , G06V20/70 , G06V10/774
Abstract: Techniques and apparatuses enabling high accuracy video object detection using reduced system resource requirements (e.g., reduced computational load, shallower neural network designs, etc.) are described. For example, a search domain of an object detection scheme (e.g., a target object class, a target object size, a target object rotation angle, etc.) may be separated into subdomains (e.g., such as subdomains of object classes, subdomains of object sizes, subdomains object rotation angles, etc.). Specialized, subdomain-level object detection/segmentation tasks may then be separated across sequential video frames. As such, different subdomain-level processing techniques (e.g., via specialized neural networks) may be implemented across different frames of a video sequence. Moreover, redundancy information of consecutive video frames may be leveraged, such that specialized object detection tasks combined with visual object tracking across consecutive frames may enable more efficient (e.g., more accurate, less computationally intensive, etc.) full domain object detection and object segmentation schemes.
-
-