-
公开(公告)号:US20220114739A1
公开(公告)日:2022-04-14
申请号:US17558588
申请日:2021-12-21
发明人: Yu Gao , Xing Wang , Rui Ma , Chao Shen , Minghua Chen , Jie Liang , Jianbing Wu
摘要: Embodiments described herein provide various examples of real-time visual object tracking. In another aspect, a process for performing a local re-identification of a target object which was earlier detected in a video but later lost when tracking the target object is disclosed. This process begins by receiving a current video frame of the video and a predicted location of the target object. The process then places a current search window in the current video frame centered on or in the vicinity of the predicted location of the target object. Next, the process extracts a feature map from an image patch within the current search window. The process further retrieves a set of stored feature maps computed at a set of previously-determined locations of the target object from a set of previously-processed video frames in the video. The process next computes a set of correlation maps between the feature map and each of the set of stored feature maps. The process then attempts to re-identify the target object locally in the current video frame based on the set of computed correlation maps.
-
公开(公告)号:US10467458B2
公开(公告)日:2019-11-05
申请号:US15789957
申请日:2017-10-20
发明人: Xing Wang , Mehdi Seyfi , Minghua Chen , Him Wai Ng , Jie Liang
摘要: Embodiments described herein provide various examples of a joint face-detection and head-pose-angle-estimation system based on using a small-scale hardware CNN module such as the built-in CNN module in HiSilicon Hi3519 system-on-chip. In some embodiments, the disclosed joint face-detection and head-pose-angle-estimation system is configured to jointly perform multiple tasks of detecting most or all faces in a sequence of video frames, generating pose-angle estimations for the detected faces, tracking detected faces of a same person across the sequence of video frames, and generating “best-pose” estimation for the person being tracked. The disclosed joint face-detection and pose-angle-estimation system can be implemented on resource-limited embedded systems such as smart camera systems that are only integrated with one or more small-scale CNN modules. The proposed system in conjunction with using a subimage-based technique has made it possible to performance multiple face detection and face recognition tasks on high-resolution input images with small-scale low-cost CNN modules.
-
3.
公开(公告)号:US10268947B2
公开(公告)日:2019-04-23
申请号:US15657109
申请日:2017-07-21
发明人: Xing Wang , Mehdi Seyfi , Minghua Chen , Him Wai Ng , Jie Liang
IPC分类号: G06K9/00 , G06N3/04 , G06T7/11 , G06T7/246 , G06K9/20 , G06K9/62 , G06N3/06 , G06N3/08 , G06T5/50 , G06K9/46
摘要: Embodiments described herein provide various examples of a face detection system, based on using a small-scale hardware convolutional neural network (CNN) module configured into a multi-task cascaded CNN. In some embodiments, a subimage-based CNN system can be configured to be equivalent to a large-scale CNN that processes the entire input image without partitioning such that the output of the subimage-based CNN system can be exactly identical to the output of the large-scale CNN. Based on this observation, some embodiments of this patent disclosure make use of the subimage-based CNN system and technique on one or more stages of a cascaded CNN or a multitask cascaded CNN (MTCNN) so that a larger input image to a given stage of the cascaded CNN or the MTCNN can be partitioned into a set of subimages of a smaller size. As a result, each stage of the cascaded CNN or the MTCNN can use the same small-scale hardware CNN module that is associated with a maximum input image size constraint.
-
4.
公开(公告)号:US20180150740A1
公开(公告)日:2018-05-31
申请号:US15441194
申请日:2017-02-23
发明人: Xing Wang , Him Wai Ng , Jie Liang
CPC分类号: G06N3/04 , G06K9/00228 , G06K9/00268 , G06K9/00275 , G06K9/00288 , G06K9/00402 , G06K9/2054 , G06K9/4628 , G06K9/6232 , G06K9/6256 , G06K9/6274 , G06K9/6289 , G06K2009/00322 , G06K2209/15 , G06N3/0454 , G06N3/06 , G06N3/08 , G06T5/50 , G06T7/11 , G06T7/248 , G06T2200/28 , G06T2207/20021 , G06T2207/20084 , G06T2207/20221 , G06T2207/20224 , G06T2207/30201 , G06T2210/12
摘要: Embodiments of a convolutional neutral network (CNN) system based on using resolution-limited small-scale CNN modules are disclosed. In some embodiments, a CNN system includes: a receiving module for receiving an input image of a first image size, the receiving module can be used to partition the input image into a set of subimages of a second image size; a first processing stage that includes a first hardware CNN module configured with a maximum input image size, the first hardware CNN module is configured to sequentially receive the set of subimages and sequentially process the received subimages to generate a set of outputs; a merging module for merging the sets of outputs into a set of merged feature maps; and a second processing stage for receiving the set of feature maps and processing the set of feature maps to generate an output including at least one prediction on the input image.
-
公开(公告)号:US20200205697A1
公开(公告)日:2020-07-02
申请号:US16731025
申请日:2019-12-30
发明人: Jiannan Zheng , Chao Shen , Dong Zhang , Jie Liang
摘要: Various embodiments of a video-based fall risk assessment system are disclosed. During operation, this fall risk assessment system can receives a sequence of video frames including a person being monitored for fall risk assessment. The system next generates a sequence of action labels for the sequence of video frames by, for each video frame in the sequence of video frames: estimating a pose of the person within the video frame; and classifying the estimated pose as a given action among a set of predetermined actions. Next, the system identifies a subset of action labels within the sequence of action labels. The system next extracts a set of gait features for the person from a subset of video frames within the sequence of video frames corresponding to the subset of action labels. Subsequently, the system analyzes the set of extracted gait features to generate a fall risk assessment for the person. In some embodiments, the sequence of video frames is captured during a predetermined time period, such as an hour, a day, or a week.
-
公开(公告)号:US20190205620A1
公开(公告)日:2019-07-04
申请号:US15859652
申请日:2017-12-31
发明人: Zili Yi , Xing Wang , Him Wai Ng , Sami Ma , Jie Liang
CPC分类号: G06K9/00288 , G06K9/00228 , G06K9/00268 , G06K9/00926 , G06K9/4628 , G06K9/6212 , G06K9/6215 , G06K9/6257 , G06N5/022 , G06N20/00
摘要: Embodiments described herein provide various examples of a face-image training data preparation system for performing large-scale face-image training data acquisition, pre-processing, cleaning, balancing, and post-processing. The disclosed training data preparation system can collect a very large set of loosely-labeled images of different people from the public domain, and then generate a raw training dataset including a set of incorrectly-labeled face images. The disclosed training data preparation system can then perform cleaning and balancing operations on the raw training dataset to generate a high-quality face-image training dataset free of the incorrectly-labeled face images. The processed high-quality face-image training dataset can be subsequently used to train deep-neural-network-based face recognition systems to achieve high performance in various face recognition applications. Compared to conventional face recognition systems and techniques, the disclosed training data preparation system and technique provide a fully-automatic, highly-deterministic and high-quality training data preparation procedure which does not rely heavily on assumptions.
-
公开(公告)号:US20190026538A1
公开(公告)日:2019-01-24
申请号:US15789957
申请日:2017-10-20
发明人: Xing Wang , Mehdi Seyfi , Minghua Chen , Him Wai Ng , Jie Liang
CPC分类号: G06K9/00261 , G06K9/00228 , G06K9/00268 , G06K9/00288 , G06K9/3233 , G06K9/4628 , G06K9/4642 , G06K9/4652 , G06K9/627 , G06T7/20 , G06T7/74 , G06T2207/10016 , G06T2207/20081 , G06T2207/20084 , G06T2207/30201
摘要: Embodiments described herein provide various examples of a joint face-detection and head-pose-angle-estimation system based on using a small-scale hardware CNN module such as the built-in CNN module in HiSilicon Hi3519 system-on-chip. In some embodiments, the disclosed joint face-detection and head-pose-angle-estimation system is configured to jointly perform multiple tasks of detecting most or all faces in a sequence of video frames, generating pose-angle estimations for the detected faces, tracking detected faces of a same person across the sequence of video frames, and generating “best-pose” estimation for the person being tracked. The disclosed joint face-detection and pose-angle-estimation system can be implemented on resource-limited embedded systems such as smart camera systems that are only integrated with one or more small-scale CNN modules. The proposed system in conjunction with using a subimage-based technique has made it possible to performance multiple face detection and face recognition tasks on high-resolution input images with small-scale low-cost CNN modules.
-
8.
公开(公告)号:US20180150681A1
公开(公告)日:2018-05-31
申请号:US15657109
申请日:2017-07-21
发明人: Xing Wang , Mehdi Seyfi , Minghua Chen , Him Wai Ng , Jie Liang
CPC分类号: G06N3/04 , G06K9/00228 , G06K9/00268 , G06K9/00275 , G06K9/00288 , G06K9/00402 , G06K9/2054 , G06K9/4628 , G06K9/6232 , G06K9/6256 , G06K9/6274 , G06K9/6289 , G06K2009/00322 , G06K2209/15 , G06N3/0454 , G06N3/06 , G06N3/08 , G06T5/50 , G06T7/11 , G06T7/248 , G06T2200/28 , G06T2207/20021 , G06T2207/20084 , G06T2207/20221 , G06T2207/20224 , G06T2207/30201 , G06T2210/12
摘要: Embodiments described herein provide various examples of a face detection system, based on using a small-scale hardware convolutional neutral network (CNN) module configured into a multi-task cascaded CNN. In some embodiments, a subimage-based CNN system can be configured to be equivalent to a large-scale CNN that processes the entire input image without partitioning such that the output of the subimage-based CNN system can be exactly identical to the output of the large-scale CNN. Based on this observation, some embodiments of this patent disclosure make use of the subimage-based CNN system and technique on one or more stages of a cascaded CNN or a multitask cascaded CNN (MTCNN) so that a larger input image to a given stage of the cascaded CNN or the MTCNN can be partitioned into a set of subimages of a smaller size. As a result, each stage of the cascaded CNN or the MTCNN can use the same small-scale hardware CNN module that is associated with a maximum input image size constraint.
-
公开(公告)号:US11645765B2
公开(公告)日:2023-05-09
申请号:US17558588
申请日:2021-12-21
发明人: Yu Gao , Xing Wang , Rui Ma , Chao Shen , Minghua Chen , Jie Liang , Jianbing Wu
CPC分类号: G06T7/262 , B64C39/024 , G06V10/255 , B64C2201/127 , G06T2207/20056
摘要: Embodiments described herein provide various examples of real-time visual object tracking. In another aspect, a process for performing a local re-identification of a target object which was earlier detected in a video but later lost when tracking the target object is disclosed. This process begins by receiving a current video frame of the video and a predicted location of the target object. The process then places a current search window in the current video frame centered on or in the vicinity of the predicted location of the target object. Next, the process extracts a feature map from an image patch within the current search window. The process further retrieves a set of stored feature maps computed at a set of previously-determined locations of the target object from a set of previously-processed video frames in the video. The process next computes a set of correlation maps between the feature map and each of the set of stored feature maps. The process then attempts to re-identify the target object locally in the current video frame based on the set of computed correlation maps.
-
10.
公开(公告)号:US20190304120A1
公开(公告)日:2019-10-03
申请号:US15943978
申请日:2018-04-03
发明人: Rui Ma , Chao Shen , Yu Gao , Ye Lu , Minghua Chen , Jie Liang , Jianbing Wu
摘要: Embodiments described herein provide various examples of an automatic obstacle avoidance system for unmanned vehicles using embedded stereo vision techniques. In one aspect, an UAV capable of performing autonomous obstacle detection and avoidance is disclosed. This UAV includes: a stereo vision camera set coupled to the one or more processors and the memory to capture a sequence of stereo images; and a stereo vision module configured to: receive a pair of stereo images captured by a pair of stereo vision cameras; perform a border cropping operation on the pair of stereo images to obtain a pair of cropped stereo images; perform a subsampling operation on the pair of cropped stereo images to obtain a pair of subsampled stereo images; and perform a dense stereo matching operation on the pair of subsampled stereo images to generate a dense three-dimensional (3D) point map of a space corresponding to the pair of stereo images.
-
-
-
-
-
-
-
-
-