-
1.
公开(公告)号:US20240045925A1
公开(公告)日:2024-02-08
申请号:US18266744
申请日:2022-02-02
Applicant: CARNEGIE MELLON UNIVERSITY
Inventor: Marios Savvides , Chenchen Zhu , Fangyi Chen , Uzair Ahmed , Ran Tao
IPC: G06F18/2136 , G06N3/04 , G06N5/02
CPC classification number: G06F18/2136 , G06N3/04 , G06N5/02
Abstract: Disclosed herein is an improved few-shot detector which utilizes a dynamic semantic network which takes as input a language feature and generates trainable parameters for a visual network. The visual network takes a visual feature as input and generates a classification and localization of an object.
-
公开(公告)号:US12026226B2
公开(公告)日:2024-07-02
申请号:US17408674
申请日:2021-08-23
Applicant: CARNEGIE MELLON UNIVERSITY
Inventor: Marios Savvides , Chenchen Zhu , Fangyi Chen , Uzair Ahmed , Ran Tao
IPC: G06F18/2136 , G06N3/04 , G06N5/02
CPC classification number: G06F18/2136 , G06N3/04 , G06N5/02
Abstract: Disclosed herein is an improved few-shot detector which utilizes semantic relation reasoning to learn novel objects from both visual information and the semantic relation of base class objects Specifically, a semantic space is constructed using word embeddings. Guided by the word embeddings of the classes, the detector is trained to project the objects from the visual space to the semantic space and to align their image representations with the corresponding class embeddings.
-
公开(公告)号:US11954175B2
公开(公告)日:2024-04-09
申请号:US17386879
申请日:2021-07-28
Applicant: CARNEGIE MELLON UNIVERSITY
Inventor: Fangyi Chen , Chenchen Zhu , Zhiqiang Shen , Han Zhang , Marios Savvides
IPC: G06F18/214 , G06F18/213 , G06F18/2431 , G06N5/04 , G06N20/00
CPC classification number: G06F18/2148 , G06F18/213 , G06F18/2431 , G06N5/04 , G06N20/00
Abstract: Disclosed herein is an improvement to prior art feature pyramids for general object detection that inserts a simple norm calibration (NC) operation between the feature pyramids and detection head to alleviate and balance the norm bias caused by feature pyramid network (FPN) and which leverages an enhanced multi-feature selective strategy (MS) during training to assign the ground-truth to one or more levels of the feature pyramid.
-
公开(公告)号:US20240046621A1
公开(公告)日:2024-02-08
申请号:US18491059
申请日:2023-10-20
Applicant: CARNEGIE MELLON UNIVERSITY
Inventor: Marios Savvides , Fangyi Chen , Han Zhang , ChenChen Zhu
IPC: G06V10/774 , G06V10/77 , G06V10/82 , G06V10/766 , G06V10/764 , G06V10/776 , G06V30/18 , G06V30/19
CPC classification number: G06V10/774 , G06V10/7715 , G06V10/82 , G06V10/766 , G06V10/764 , G06V10/776 , G06V30/1801 , G06V30/19093 , G06V30/19147
Abstract: Disclosed herein are designs for two baselines to detect products in a retail setting. A novel detector, referred to herein as RetailDet, detects quadrilateral products. To match products using visual texts on 2D space, text features are encoded with spatial positional encoding and the Hungarian Algorithm that calculates optimal assignment plans between varying text sequences is used.
-
公开(公告)号:US20240046503A1
公开(公告)日:2024-02-08
申请号:US18266737
申请日:2022-01-31
Applicant: CARNEGIE MELLON UNIVERSITY
Inventor: Fangyi Chen , Shayeree Sarkar
IPC: G06T7/70 , G06V10/774
CPC classification number: G06T7/70 , G06V10/774 , G06V2201/07
Abstract: Disclosed herein is an improved method for identifying images containing objects-of-interest from a large set of images. The method comprises mixing two or more of the images to create a grouped image and exposing the grouped image to an object detector trained on grouped images to make an initial determination that the grouped image was formed from at least one image containing an object-of-interest. The images which formed the grouped image are then exposed to regular object detectors to determine a classification of the object-of-interest.
-
公开(公告)号:US12266156B2
公开(公告)日:2025-04-01
申请号:US17670737
申请日:2022-02-14
Applicant: CARNEGIE MELLON UNIVERSITY
Inventor: Marios Savvides , Zhiqiang Shen , Fangyi Chen , Han Zhang
IPC: G06V10/774
Abstract: Disclosed herein is a system and method for improving the accuracy of an object detector when trained with a dataset having a significant number of missing annotations. The method uses a novel Background Recalibration Loss (BRL) which adjusts the gradient direction according to its own activation to reduce the adverse effect of error signals by replacing the negative branch of the focal loss with a mirror of the positive branch when the activation is below a confusion threshold.
-
7.
公开(公告)号:US12189714B2
公开(公告)日:2025-01-07
申请号:US18266744
申请日:2022-02-02
Applicant: CARNEGIE MELLON UNIVERSITY
Inventor: Marios Savvides , Chenchen Zhu , Fangyi Chen , Uzair Ahmed , Ran Tao
IPC: G06F18/2136 , G06N3/04 , G06N5/02
Abstract: Disclosed herein is an improved few-shot detector which utilizes a dynamic semantic network which takes as input a language feature and generates trainable parameters for a visual network. The visual network takes a visual feature as input and generates a classification and localization of an object.
-
公开(公告)号:US20220058425A1
公开(公告)日:2022-02-24
申请号:US17408778
申请日:2021-08-23
Applicant: CARNEGIE MELLON UNIVERSITY
Inventor: Marios Savvides , Chenchen Zhu , Fangyi Chen , Uzair Ahmed , Ran Tao
Abstract: Disclosed herein is a system and method of identifying new products on a retail shelf using a feature extractor trained to extract features from images of products on the shelf and output identifying information regarding the product in the product image. The extracted features are compared to extracted features in a product library and a best-fit is obtained. A new product is identified if the distance between the features of the product on the shelf and the features of the best-fit product from the product library are above a predetermined threshold.
-
公开(公告)号:US12131497B2
公开(公告)日:2024-10-29
申请号:US18266737
申请日:2022-01-31
Applicant: CARNEGIE MELLON UNIVERSITY
Inventor: Fangyi Chen , Shayeree Sarkar , Marios Savvides
IPC: G06V10/774 , G06T7/70
CPC classification number: G06T7/70 , G06V10/774 , G06V2201/07
Abstract: Disclosed herein is an improved method for identifying images containing objects-of-interest from a large set of images. The method comprises mixing two or more of the images to create a grouped image and exposing the grouped image to an object detector trained on grouped images to make an initial determination that the grouped image was formed from at least one image containing an object-of-interest. The images which formed the grouped image are then exposed to regular object detectors to determine a classification of the object-of-interest.
-
公开(公告)号:US20240355085A1
公开(公告)日:2024-10-24
申请号:US18587200
申请日:2024-02-26
Applicant: CARNEGIE MELLON UNIVERSITY
Inventor: Marios Savvides , Chenchen Zhu , Fangyi Chen , Uzair Ahmed , Ran Tao
IPC: G06V10/44 , G06F18/214 , G06T7/73 , G06V10/25 , G06V20/20
CPC classification number: G06V10/443 , G06F18/214 , G06T7/73 , G06V10/25 , G06V20/20
Abstract: Disclosed herein is a system and method for matching products detected in an image of a shelf. The match or non-match of the products is then used to make a determination that the products are correctly positioned on the shelf of if the positioning of the products represents a plug or spread situation.
-
-
-
-
-
-
-
-
-