Deep reinforcement learning for robotic manipulation

发明授权

US11897133B2 Deep reinforcement learning for robotic manipulation 有权

请登陆查看更多内容

专利标题： Deep reinforcement learning for robotic manipulation
申请号： US17878186

申请日： 2022-08-01
公开(公告)号： US11897133B2

公开(公告)日： 2024-02-13
发明人: Sergey Levine , Ethan Holly , Shixiang Gu , Timothy Lillicrap
申请人： Google LLC
申请人地址： US CA Mountain View
专利权人： GOOGLE LLC
当前专利权人： GOOGLE LLC
当前专利权人地址： US CA Mountain View
代理机构： Gray Ice Higdon
主分类号： G06F17/00
IPC分类号： G06F17/00 ; B25J9/16 ; G05B13/02 ; G06N3/08 ; G06N3/008 ; G06N3/045 ; G05B19/042

Deep reinforcement learning for robotic manipulation

摘要：

Implementations utilize deep reinforcement learning to train a policy neural network that parameterizes a policy for determining a robotic action based on a current state. Some of those implementations collect experience data from multiple robots that operate simultaneously. Each robot generates instances of experience data during iterative performance of episodes that are each explorations of performing a task, and that are each guided based on the policy network and the current policy parameters for the policy network during the episode. The collected experience data is generated during the episodes and is used to train the policy network by iteratively updating policy parameters of the policy network based on a batch of collected experience data. Further, prior to performance of each of a plurality of episodes performed by the robots, the current updated policy parameters can be provided (or retrieved) for utilization in performance of the episode.

公开/授权文献

US11845183B2 Deep reinforcement learning for robotic manipulation 公开/授权日：2023-12-19

信息查询

Espacenet

IPC分类:

G	物理
G06	计算；推算或计数
G06F	电数字数据处理（基于特定计算模型的计算机系统入G06N）
G06F17/00	特别适用于特定功能的数字计算设备或数据处理设备或数据处理方法（信息检索，数据库结构或文件系统结构，G06F 16/00）