-
公开(公告)号:US10860836B1
公开(公告)日:2020-12-08
申请号:US16192433
申请日:2018-11-15
Applicant: Amazon Technologies, Inc.
Inventor: Ambrish Tyagi , Amit Kumar Agrawal , Siddhartha Chandra , Visesh Uday Kumar Chari , Shashank Tripathi , James Rehg
Abstract: Techniques are generally described for object detection in image data. First image data comprising a first plurality of pixel values representing an object and a second plurality of pixel values representing a background may be received. First foreground image data and first background image data may be generated from the first image data. A first feature vector representing the first plurality of pixel values may be generated. A second feature vector representing a first plurality of pixel values of second background image data may be generated. A first machine learning model may determine a first operation to perform on the first foreground image data. A transformed representation of the first foreground image data may be generated by performing the first operation on the first foreground image data. Composite image data may be generated by compositing the transformed representation of the first foreground image data with the second background image data.
-
公开(公告)号:US11887252B1
公开(公告)日:2024-01-30
申请号:US17412072
申请日:2021-08-25
Applicant: Amazon Technologies, Inc.
Inventor: Siddhartha Chandra , Visesh Uday Kumar Chari , Prakash Ramu , Antonio Criminisi , F Noam Sorek , Apoorv Chaudhri
IPC: G06T15/00 , G06T17/00 , G06T19/20 , G06T15/04 , G06V40/10 , G06V40/16 , G06F18/22 , G06F18/214 , G06N3/045
CPC classification number: G06T17/00 , G06F18/214 , G06F18/22 , G06N3/045 , G06T15/04 , G06T19/20 , G06V40/103 , G06V40/168 , G06T2219/2004 , G06T2219/2012 , G06T2219/2021
Abstract: Described are systems and methods directed to generation and subsequent update of a dimensionally accurate body model of a body, such as a human body, based on two-dimensional (“2D”) images of at least a portion of that body and/or face images of a face of the body. A user may use a 2D camera, such as a digital camera typically included in many of today's portable devices (e.g., cell phones, tablets, laptops, etc.) to produce body images that are used to generate a body model of the body of the user. Subsequently, the body model may be updated based on a face image of the face of the user, without requiring the user to provide another body image.
-
公开(公告)号:US10909349B1
公开(公告)日:2021-02-02
申请号:US16450499
申请日:2019-06-24
Applicant: Amazon Technologies, Inc.
Inventor: Shashank Tripathi , Visesh Chari , Ambrish Tyagi , Amit Kumar Agrawal , James Rehg , Siddhartha Chandra
Abstract: Techniques are generally described for object detection in image data. First image data comprising a three-dimensional model representing an object may be received. First background image data comprising a first plurality of pixel values may be received. A first feature vector representing the three-dimensional model may be generated. A second feature vector representing the first plurality of pixel values of the first background image data may be generated. A first machine learning model may generate a transformed representation of the three-dimensional model using the first feature vector. First foreground image data comprising a two-dimensional representation of the transformed representation of the three-dimensional model may be generated. A frame of composite image data may be generated by combining the first foreground image data with the first background image data.
-
公开(公告)号:US11903730B1
公开(公告)日:2024-02-20
申请号:US16582840
申请日:2019-09-25
Applicant: Amazon Technologies, Inc.
Inventor: Apoorv Chaudhri , Siddhartha Chandra , Prakash Ramu , Amit Kumar Agrawal , Sigal Raab , Anantharanga Prithviraj , Ram Sever , Ita Lifshitz , Ayush Sharma , Anna Shtengel , Gal Levi , Rajesh Gautam
IPC: A61B5/00 , G06T7/11 , G06T7/194 , G06T3/40 , G06T7/00 , G06N3/04 , G06N3/08 , G16H30/40 , G06T7/60
CPC classification number: A61B5/4872 , A61B5/0077 , A61B5/7264 , A61B5/7278 , A61B5/742 , G06N3/04 , G06N3/08 , G06T3/40 , G06T7/0014 , G06T7/11 , G06T7/194 , G06T7/60 , G16H30/40 , A61B2560/0431 , A61B2576/00 , G06T2207/20084 , G06T2207/30196
Abstract: Described are systems and methods that use one or more two-dimensional (“2D”) body images of a body to determine body fat measurements of that body. For example, a standard 2D camera of a portable device, such as a cell phone, tablet, laptop, etc., may be used to generate one or more 2D body images of a user. Those 2D body images, or image, may be processed using the disclosed implementations to determine a body fat measurement of the body represented in the image.
-
公开(公告)号:US11631260B1
公开(公告)日:2023-04-18
申请号:US17132738
申请日:2020-12-23
Applicant: Amazon Technologies, Inc.
Inventor: Shashank Tripathi , Visesh Chari , Ambrish Tyagi , Amit Kumar Agrawal , James Rehg , Siddhartha Chandra
Abstract: Techniques are generally described for object detection in image data. First image data comprising a three-dimensional model representing an object may be received. First background image data comprising a first plurality of pixel values may be received. A first feature vector representing the three-dimensional model may be generated. A second feature vector representing the first plurality of pixel values of the first background image data may be generated. A first machine learning model may generate a transformed representation of the three-dimensional model using the first feature vector. First foreground image data comprising a two-dimensional representation of the transformed representation of the three-dimensional model may be generated. A frame of composite image data may be generated by combining the first foreground image data with the first background image data.
-
公开(公告)号:US11450008B1
公开(公告)日:2022-09-20
申请号:US16803363
申请日:2020-02-27
Applicant: Amazon Technologies, Inc.
Inventor: Ambrish Tyagi , Siddhartha Chandra , Amit Kumar Agrawal , Viveka Kulharia
Abstract: Devices and techniques are generally described for weakly-supervised object segmentation in image data. In various examples, a first frame of image data may be received. The first frame may include a first bounding box surrounding a first set of pixels, wherein first subset of pixels of the first set of pixels represent a first object of a first class and wherein second subset of pixels of the first set of pixels represent background image data. Cross-entropy loss may be determined for the first set of pixels. In some examples, a spatial attention map may be determined for the first set of pixels. In further examples, parameters of a convolutional neural network may be determined by modulating the cross-entropy loss for the first set of pixels using the spatial attention map. The convolutional neural network may be used to generate a segmentation map.
-
-
-
-
-