Patent search ap:("Amazon Technologies Page Inc.") AND inv:"Hao Wu"

1.

发明授权
Content moderation using object detection and image classification 有权

公开(公告)号：US11423265B1

公开(公告)日：2022-08-23

申请号：US16917721

申请日：2020-06-30

Applicant: Amazon Technologies, Inc.

Inventor： Hao Chen , Hao Wu , Hao Li , Michael Quang Thai Lam , Xinyu Li , Kaustav Kundu , Meng Wang , Joseph P Tighe , Rahul Bhotika

IPC: G06K9/62 , G06T7/10 , G06T11/20 , G06N20/00

Abstract: Methods, systems, and computer-readable media for content moderation using object detection and image classification are disclosed. A content moderation system performs object detection on an input image using one or more object detectors. The object detection finds one or more elements in the input image. The content moderation system performs classification based at least in part on the input image using one or more image classifiers. The classification determines one or more values indicative of one or more content types in the input image. The content moderation system determines one or more scores for one or more content labels corresponding to the one or more content types. At least one of the scores represents a finding of one or more of the content types in the input image. The content moderation system generates output indicating the finding of the one or more of the content types.

2.

发明授权
Image text localization 有权

公开(公告)号：US10572760B1

公开(公告)日：2020-02-25

申请号：US15810991

申请日：2017-11-13

Applicant: Amazon Technologies, Inc.

Inventor： Hao Wu , Jonathan Wu , Meng Wang , Wei Xia

IPC: G06K9/00 , G06K9/32 , G06K9/18 , G06N3/08 , G06K9/62 , G06T11/60

Abstract: A method and system for analyzing text in an image is disclosed. A text localization and classification system accesses an annotated image comprising a plurality of text location identifiers for a given item of text. A neural network predicts the location of the given item of text using at least a first location identifier and a second location identifier. Optionally, the first location identifier comprises a first shape and the second location identifier comprises a second shape. A first loss is generated using a first loss function, the first loss corresponding to the predicated location using the first location identifier. A second loss is generated using a second loss function, the second loss corresponding to the predicated location using the second location identifier. The neural network is enhanced with backpropagation using the first loss and the second loss.

3.

发明授权
Pose-based personal equipment detection 有权

公开(公告)号：US11501573B1

公开(公告)日：2022-11-15

申请号：US16894709

申请日：2020-06-05

Applicant: Amazon Technologies, Inc.

Inventor： Joseph P. Tighe , Meng Wang , Hao Wu , Manchen Wang

IPC: G06V40/20 , G06K9/62 , G06T7/00 , G06T7/11 , G06Q10/08

Abstract: Personal equipment detection may utilize pose-based detection. Input image data may be evaluated to detect persons in the image data. For detected persons, regions of the persons may be determined. Personal equipment may be detected for the detected persons in the image data and compared with the regions of the persons to determine whether the detected personal equipment is properly placed on the person.

4.

发明授权
Semantic ordering of image text 有权

公开(公告)号：US10706322B1

公开(公告)日：2020-07-07

申请号：US15821629

申请日：2017-11-22

Applicant: Amazon Technologies, Inc.

Inventor： Shuo Yang , Hao Wu , Jonathan Wu , Meng Wang

IPC: G06K9/34 , G06K9/62 , G06T7/70 , G06K9/72

Abstract: Embodiments of the present disclosure provide systems and processes for automatically determining a layout of text within an image that makes sense from a semantic perspective. In certain embodiments, the systems disclosed herein receive bounding box information relating to one or more bounding boxes that surround text within the image. The systems compare the received bounding box information to determine a clustering of bounding boxes that have an above threshold probability of including words that when read in order make sense semantically. For example, systems herein can determine whether words in a cluster correspond to a line of text.

Patent Agency Ranking