Patent search ap:("ADOBE INC.") AND inv:"Sriyash Poddar" Page 1

1.

发明授权
Optimal sequential decision making with changing action space 有权

公开(公告)号：US12111884B2

公开(公告)日：2024-10-08

申请号：US17659983

申请日：2022-04-20

Applicant: ADOBE INC.

Inventor： Tanay Anand , Pinkesh Badjatiya , Sriyash Poddar , Jayakumar Subramanian , Georgios Theocharous , Balaji Krishnamurthy

IPC: G06F18/2137 , G06N3/088

CPC classification number: G06F18/2137 , G06N3/088

Abstract: Systems and methods for machine learning are described. Embodiments of the present disclosure receive state information that describes a state of a decision making agent in an environment; compute an action vector from an action embedding space based on the state information using a policy neural network of the decision making agent, wherein the policy neural network is trained using reinforcement learning based on a topology loss that constrains changes in a mapping between an action set and the action embedding space; and perform an action that modifies the state of the decision making agent in the environment based on the action vector, wherein the action is selected based on the mapping.

2.

发明公开
OPTIMAL SEQUENTIAL DECISION MAKING WITH CHANGING ACTION SPACE 审中-公开

公开(公告)号：US20230342425A1

公开(公告)日：2023-10-26

申请号：US17659983

申请日：2022-04-20

Applicant: ADOBE INC.

Inventor： Tanay Anand , Pinkesh Badjatiya , Sriyash Poddar , Jayakumar Subramanian , Georgios Theocharous , Balaji Krishnamurthy

IPC: G06K9/62 , G06N3/08

CPC classification number: G06K9/6251 , G06N3/088

Abstract: Systems and methods for machine learning are described. Embodiments of the present disclosure receive state information that describes a state of a decision making agent in an environment; compute an action vector from an action embedding space based on the state information using a policy neural network of the decision making agent, wherein the policy neural network is trained using reinforcement learning based on a topology loss that constrains changes in a mapping between an action set and the action embedding space; and perform an action that modifies the state of the decision making agent in the environment based on the action vector, wherein the action is selected based on the mapping.

Patent Agency Ranking