-
公开(公告)号:US11242050B2
公开(公告)日:2022-02-08
申请号:US16425482
申请日:2019-05-29
摘要: Systems and methods for providing navigation to a vehicle may include receiving observation data from one or more sensors of the vehicle, generating projection data corresponding to the one or more traffic participants based on the observation data for each time step within a time period, and predicting interactions between the vehicle, the one or more traffic participants, and the one or more obstacles, based on the projection data of the one or more traffic participants. The systems and methods may further include determining a set of actions by the vehicle corresponding to a probability of the vehicle safely arriving at a target location based on the predicted interactions, and selecting one or more actions from the set of actions and provide the one or more actions to a navigation system of the vehicle, wherein the navigation system uses the navigation data to provide navigation instructions to the vehicle.
-
公开(公告)号:US20210271988A1
公开(公告)日:2021-09-02
申请号:US16940680
申请日:2020-07-28
发明人: Maxime Bouton , David Francis Isele , Alireza Nakhaei Sarvedani , Mykel Kochenderfer , Kikuo Fujimura
摘要: According to one aspect, a system for reinforcement learning with iterative reasoning may include a memory for storing computer readable code and a processor operatively coupled to the memory, the processor configured to receive a level-0 policy and a desired reasoning level n. The processor may repeat for k=1 . . . n times, the following: populate a training environment with a level-(k−1) first agent, populate the training environment with a level-(k−1) second agent, and train a level-k agent based on the level-(k−1) first agent and the level-(k−1) second agent to derive a level-k policy.
-
公开(公告)号:US20200247402A1
公开(公告)日:2020-08-06
申请号:US16425482
申请日:2019-05-29
IPC分类号: B60W30/095 , G08G1/01 , G08G1/16 , G06N3/04 , G05D1/00
摘要: Systems and methods for providing navigation to a vehicle may include receiving observation data from one or more sensors of the vehicle, generating projection data corresponding to the one or more traffic participants based on the observation data for each time step within a time period, and predicting interactions between the vehicle, the one or more traffic participants, and the one or more obstacles, based on the projection data of the one or more traffic participants. The systems and methods may further include determining a set of actions by the vehicle corresponding to a probability of the vehicle safely arriving at a target location based on the predicted interactions, and selecting one or more actions from the set of actions and provide the one or more actions to a navigation system of the vehicle, wherein the navigation system uses the navigation data to provide navigation instructions to the vehicle.
-
-