- 专利标题: Evaluating varying-sized action spaces using reinforcement learning
-
申请号: US16143124申请日: 2018-09-26
-
公开(公告)号: US11243532B1公开(公告)日: 2022-02-08
- 发明人: Martin Levihn , Pekka Tapani Raiko
- 申请人: Apple Inc.
- 申请人地址: US CA Cupertino
- 专利权人: Apple Inc.
- 当前专利权人: Apple Inc.
- 当前专利权人地址: US CA Cupertino
- 代理机构: Kowert, Hood, Munyon, Rankin & Goetzel, P.C.
- 代理商 Robert C. Kowert
- 主分类号: G05D1/00
- IPC分类号: G05D1/00 ; G06N3/04 ; G05D1/02 ; G06N3/08
摘要:
A set of actions corresponding to a particular state of the environment of a vehicle is identified. A respective encoding is generated for different actions of the set, using elements such as distinct colors to distinguish attributes such as target lane segments. Using the encodings as inputs to respective instances of a machine learning model, respective value metrics are estimated for each of the actions. One or more motion-control directives to implement a particular action selected using the value metrics are transmitted to motion-control subsystems of the vehicle.
信息查询