Evaluating varying-sized action spaces using reinforcement learning

发明授权

US11243532B1 Evaluating varying-sized action spaces using reinforcement learning 有权

请登陆查看更多内容

专利标题： Evaluating varying-sized action spaces using reinforcement learning
申请号： US16143124

申请日： 2018-09-26
公开(公告)号： US11243532B1

公开(公告)日： 2022-02-08
发明人: Martin Levihn , Pekka Tapani Raiko
申请人： Apple Inc.
申请人地址： US CA Cupertino
专利权人： Apple Inc.
当前专利权人： Apple Inc.
当前专利权人地址： US CA Cupertino
代理机构： Kowert, Hood, Munyon, Rankin & Goetzel, P.C.
代理商 Robert C. Kowert
主分类号： G05D1/00
IPC分类号： G05D1/00 ; G06N3/04 ; G05D1/02 ; G06N3/08

Evaluating varying-sized action spaces using reinforcement learning

摘要：

A set of actions corresponding to a particular state of the environment of a vehicle is identified. A respective encoding is generated for different actions of the set, using elements such as distinct colors to distinguish attributes such as target lane segments. Using the encodings as inputs to respective instances of a machine learning model, respective value metrics are estimated for each of the actions. One or more motion-control directives to implement a particular action selected using the value metrics are transmitted to motion-control subsystems of the vehicle.

信息查询

Espacenet

IPC分类:

G	物理
G05	控制；调节
G05D	非电变量的控制或调节系统（金属的连续铸造入B22D11/16；阀门本身入F16K；非电变量的检测见G01各有关小类；电或磁变量的调节入G05F）
G05D1/00	陆地、水上、空中或太空中的运载工具的位置、航道、高度或姿态的控制，例如自动驾驶仪（无线电导航系统或使用其他波的类似系统入G01S）