SYSTEMS AND METHODS FOR JOINTLY TRAINING A MACHINE-LEARNING-BASED MONOCULAR OPTICAL FLOW, DEPTH, AND SCENE FLOW ESTIMATOR

    公开(公告)号:US20220392083A1

    公开(公告)日:2022-12-08

    申请号:US17489237

    申请日:2021-09-29

    摘要: Systems and methods described herein relate to jointly training a machine-learning-based monocular optical flow, depth, and scene flow estimator. One embodiment processes a pair of temporally adjacent monocular image frames using a first neural network structure to produce a first optical flow estimate; processes the pair of temporally adjacent monocular image frames using a second neural network structure to produce an estimated depth map and an estimated scene flow; processes the estimated depth map and the estimated scene flow using the second neural network structure to produce a second optical flow estimate; and imposes a consistency loss between the first optical flow estimate and the second optical flow estimate that minimizes a difference between the first optical flow estimate and the second optical flow estimate to improve performance of the first neural network structure in estimating optical flow and the second neural network structure in estimating depth and scene flow.

    Monocular 3D vehicle modeling and auto-labeling using semantic keypoints

    公开(公告)号:US11475628B2

    公开(公告)日:2022-10-18

    申请号:US17147049

    申请日:2021-01-12

    摘要: A method for 3D object modeling includes linking 2D semantic keypoints of an object within a video stream into a 2D structured object geometry. The method includes inputting, to a neural network, the object to generate a 2D NOCS image and a shape vector, the shape vector being mapped to a continuously traversable coordinate shape. The method includes applying a differentiable shape renderer to the SDF shape and the 2D NOCS image to render a shape of the object corresponding to a 3D object model in the continuously traversable coordinate shape space. The method includes lifting the linked, 2D semantic keypoints of the 2D structured object geometry to a 3D structured object geometry. The method includes geometrically and projectively aligning the 3D object model, the 3D structured object geometry, and the rendered shape to form a rendered object. The method includes generating 3D bounding boxes from the rendered object.

    Attention-based recurrent convolutional network for vehicle taillight recognition

    公开(公告)号:US11361557B2

    公开(公告)日:2022-06-14

    申请号:US16389255

    申请日:2019-04-19

    摘要: A method for performing vehicle taillight recognition is described. The method includes extracting spatial features from a sequence of images of a real-world traffic scene during operation of an ego vehicle. The method includes selectively focusing a convolutional neural network (CNN) of a CNN-long short-term memory (CNN-LSTM) framework on a selected region of the sequence of images according to a spatial attention model for a vehicle taillight recognition task. The method includes selecting, by an LSTM network of the CNN-LSTM framework, frames within the selected region of the sequence of images according to a temporal attention model for the vehicle taillight recognition task. The method includes inferring, according to the selected frames within the selected region of the sequence of images, an intent of an ado vehicle according to a taillight state. The method includes planning a trajectory of the ego vehicle from the intent inferred from the ado vehicle.

    SYSTEM AND METHOD FOR PREDICTING THE MOVEMENT OF PEDESTRIANS

    公开(公告)号:US20210245744A1

    公开(公告)日:2021-08-12

    申请号:US16787523

    申请日:2020-02-11

    摘要: A system and related method for predicting movement of a plurality of pedestrians may include one or more processors and a memory. The memory includes an initial trajectory module, an exit point prediction module, a path planning module, and an adjustment module. The modules include instructions that when executed by the one or more processors cause the one or more processors to obtain trajectories of the plurality of pedestrians, predict future exit points for the plurality of pedestrians from a scene based on the trajectories of the plurality of pedestrians, determine trajectory paths of the plurality of pedestrians based on the future exit points and at least one scene element of a map, and adjust the trajectory paths based on at least one predicted interaction between at least two of the plurality of pedestrians.

    Systems and methods for projecting a location of a nearby object into a map according to a camera image

    公开(公告)号:US10409288B2

    公开(公告)日:2019-09-10

    申请号:US15585878

    申请日:2017-05-03

    IPC分类号: G05D1/02 G06K9/00 G08G1/16

    摘要: System, methods, and other embodiments described herein relate to locating an object within an image and projecting the object into a map. In one embodiment, a method includes, responsive to identifying a nearby vehicle in an image of an external environment surrounding a scanning vehicle, determining a relative location of the nearby vehicle in relation to the scanning vehicle using the image. The method includes projecting the nearby vehicle into a map according to at least the relative location determined from the image. The method includes controlling the scanning vehicle according to the map.