-
公开(公告)号:US12111884B2
公开(公告)日:2024-10-08
申请号:US17659983
申请日:2022-04-20
Applicant: ADOBE INC.
Inventor: Tanay Anand , Pinkesh Badjatiya , Sriyash Poddar , Jayakumar Subramanian , Georgios Theocharous , Balaji Krishnamurthy
IPC: G06F18/2137 , G06N3/088
CPC classification number: G06F18/2137 , G06N3/088
Abstract: Systems and methods for machine learning are described. Embodiments of the present disclosure receive state information that describes a state of a decision making agent in an environment; compute an action vector from an action embedding space based on the state information using a policy neural network of the decision making agent, wherein the policy neural network is trained using reinforcement learning based on a topology loss that constrains changes in a mapping between an action set and the action embedding space; and perform an action that modifies the state of the decision making agent in the environment based on the action vector, wherein the action is selected based on the mapping.
-
公开(公告)号:US20230342425A1
公开(公告)日:2023-10-26
申请号:US17659983
申请日:2022-04-20
Applicant: ADOBE INC.
Inventor: Tanay Anand , Pinkesh Badjatiya , Sriyash Poddar , Jayakumar Subramanian , Georgios Theocharous , Balaji Krishnamurthy
CPC classification number: G06K9/6251 , G06N3/088
Abstract: Systems and methods for machine learning are described. Embodiments of the present disclosure receive state information that describes a state of a decision making agent in an environment; compute an action vector from an action embedding space based on the state information using a policy neural network of the decision making agent, wherein the policy neural network is trained using reinforcement learning based on a topology loss that constrains changes in a mapping between an action set and the action embedding space; and perform an action that modifies the state of the decision making agent in the environment based on the action vector, wherein the action is selected based on the mapping.
-