METHOD AND APPARATUS FOR THREE-DIMENSIONAL OBJECT PERCEPTION

    公开(公告)号:US20250157230A1

    公开(公告)日:2025-05-15

    申请号:US18635249

    申请日:2024-04-15

    Abstract: A method for (3D) object detection includes: receiving an input image with respect to a 3D space, an input point cloud with respect to the 3D space, and an input language with respect to a target object in the 3D space; using an encoding model to generate candidate image features of partial areas of the input image, a point cloud feature of the input point cloud, and a linguistic feature of the input language; selecting a target image feature corresponding to the linguistic feature from among the candidate image features based on similarity scores of similarities between the candidate image features and the linguistic feature; generating a decoding output by executing a multi-modal decoding model based on the target image feature and the point cloud feature; and detecting a 3D bounding box corresponding to the target object by executing an object detection model based on the decoding output.

Patent Agency Ranking