Meta-Q learning
    1.
    发明授权

    公开(公告)号:US12217137B1

    公开(公告)日:2025-02-04

    申请号:US17039447

    申请日:2020-09-30

    Abstract: Techniques for Meta-Q-Learning (MQL) are described. A method of MQL may include receiving a request from an agent to perform adaptation based at least on task data associated with a new task collected by the agent, identifying a subset of meta-training data corresponding to the task data in a replay buffer, and adapting a policy using the subset of meta-training data and the task data to generate an adapted policy, wherein the adapted policy is used identify a next action for the agent to perform.

Patent Agency Ranking