-
公开(公告)号:US20230368541A1
公开(公告)日:2023-11-16
申请号:US17745141
申请日:2022-05-16
Applicant: Ford Global Technologies, LLC
Inventor: Daniel Goodman , Sandhya Bhaskar , Nikita Jaipuria , Jinesh Jain , Vidya Nariyambut Murali
CPC classification number: G06V20/58 , G06V20/41 , G06V10/82 , G06V20/46 , B60W60/0027
Abstract: A computer that includes a processor and a memory can predict future status of one or more moving objects by acquiring a plurality of video frames with a sensor included in a device, inputting the plurality of video frames to a first deep neural network to determine one or more objects included in the plurality of video frames, and inputting the objects to a second deep neural network to determine object features and full frame features. The computer can further input the object features and full frame features to a third deep neural network to determine spatial attention weights for the object features and full frame features, input the object features and full frame features to a fourth deep neural network to determine temporal attention weights for the object features and full frame features, and input the object features, full frame features, spatial attention weights and temporal attention weights to a fifth deep neural network to determine predictions regarding the one or more objects included the plurality of video frames.