-
公开(公告)号:US20180018535A1
公开(公告)日:2018-01-18
申请号:US15300474
申请日:2015-12-10
Applicant: Intel Corporation
Inventor: Jianguo Li , Yurong Chen , Jianwei Luo
CPC classification number: G06K9/4628 , G06K9/6269 , G06K9/66 , G06N3/04 , G06N3/0454 , G06N3/08
Abstract: A processing device for performing visual recognition using deep learning attributes and method for performing the same are described. In one embodiment, a processing device comprises: an interface to receive an input image; and a recognition unit coupled to the interface and operable to perform visual object recognition on the input image, where the recognition unit has an extractor to extract region proposals from the input image, a convolutional neural network (CNN) to compute features for each extracted region proposal, the CNN being operable to create a soft-max layer output, a cross region pooling unit operable to perform pooling of the soft-max layer output to create a set of attributes of the input image, and an image classifier operable to perform image classification based on the attributes of the input image.
-
公开(公告)号:US09971953B2
公开(公告)日:2018-05-15
申请号:US15300474
申请日:2015-12-10
Applicant: Intel Corporation
Inventor: Jianguo Li , Yurong Chen , Jianwei Luo
CPC classification number: G06K9/4628 , G06K9/6269 , G06K9/66 , G06N3/04 , G06N3/0454 , G06N3/08
Abstract: A processing device for performing visual recognition using deep learning attributes and method for performing the same are described. In one embodiment, a processing device comprises: an interface to receive an input image; and a recognition unit coupled to the interface and operable to perform visual object recognition on the input image, where the recognition unit has an extractor to extract region proposals from the input image, a convolutional neural network (CNN) to compute features for each extracted region proposal, the CNN being operable to create a soft-max layer output, a cross region pooling unit operable to perform pooling of the soft-max layer output to create a set of attributes of the input image, and an image classifier operable to perform image classification based on the attributes of the input image.
-