-
公开(公告)号:US20240233440A1
公开(公告)日:2024-07-11
申请号:US18153166
申请日:2023-01-11
Applicant: Accenture Global Solutions Limited
Inventor: David Nguyen , Hailing Zhou , Nan Ke
Abstract: Implementations include actions of receiving an image, providing a set of features for the image, determining a set of HOIs including one or more HOIs that are potentially represented in the image, providing sets of feature scores by, for each HOI in the set of HOIs, determining, by a first ML model, a set of feature scores for respective features in the set of features, generating, by a second ML model, sets of weights based on the set of HOIs, providing a set of final scores by, for each HOI in the set of HOIs, determining a final score based on a respective set of weights and the set of feature scores, each final score corresponding to a respective HOI in the set of HOIs, and selecting an output HOI for the image from the set of HOIs based on the set of final scores.
-
公开(公告)号:US20240233439A1
公开(公告)日:2024-07-11
申请号:US18152627
申请日:2023-01-10
Applicant: Accenture Global Solutions Limited
Inventor: David Nguyen , Hailing Zhou , Nan Ke
IPC: G06V40/20 , G06F40/30 , G06V10/74 , G06V10/774 , G06V20/70
CPC classification number: G06V40/20 , G06F40/30 , G06V10/761 , G06V10/774 , G06V20/70
Abstract: Implementations include actions of receiving an image; extracting a visual HOI and a set of visual embeddings, the visual HOI indicating a subject and an object; obtaining, using a vector library, a set of semantic HOIs and sets of semantic embeddings based on the subject, the object and a set of verbs included in the vector library, each set of semantic embeddings corresponding to a semantic HOI; processing, by a compositional model, the set of visual embeddings to provide a set of transition visual embeddings; processing the sets of semantic embeddings to provide respective sets of transition semantic embeddings; determining a set of scores based on the set of transition visual embeddings and the sets of transition semantic embeddings, each score representing a degree of similarity between the visual HOI and a semantic HOI; and determining at least one predicted HOI represented within the image based on the scores.
-
公开(公告)号:US12073027B2
公开(公告)日:2024-08-27
申请号:US18084885
申请日:2022-12-20
Applicant: Accenture Global Solutions Limited
Inventor: Keyu Qi , Hailing Zhou , Nan Ke , David Nguyen , Binghao Tang
CPC classification number: G06F3/017 , G06V10/761 , G06V10/82 , G06V20/52
Abstract: Implementations are directed to receiving a first set of images included in a first video captured by a camera that monitors a human performing a task; processing the first set of images using a first machine learning (ML) model to determine whether the first set of images depicts a gesture that is included in a predefined set of gestures; in response to determining that the first set of images depicts a gesture included in a predefined set of gestures, processing a second set of images included in the first video using a second ML model to determine a first gesture type of the gesture; comparing the first gesture type with a first expected gesture type to determine whether performance of the task conforms to a standard operating procedure (SOP) for the task; and providing feedback representative of a comparison result in a user interface.
-
公开(公告)号:US20240201789A1
公开(公告)日:2024-06-20
申请号:US18084885
申请日:2022-12-20
Applicant: Accenture Global Solutions Limited
Inventor: Keyu Qi , Hailing Zhou , Nan Ke , David Nguyen , Binghao Tang
CPC classification number: G06F3/017 , G06V10/761 , G06V10/82 , G06V20/52
Abstract: Implementations are directed to receiving a first set of images included in a first video captured by a camera that monitors a human performing a task; processing the first set of images using a first machine learning (ML) model to determine whether the first set of images depicts a gesture that is included in a predefined set of gestures; in response to determining that the first set of images depicts a gesture included in a predefined set of gestures, processing a second set of images included in the first video using a second ML model to determine a first gesture type of the gesture; comparing the first gesture type with a first expected gesture type to determine whether performance of the task conforms to a standard operating procedure (SOP) for the task; and providing feedback representative of a comparison result in a user interface.
-
-
-