OBJECT-TO-ROBOT POSE ESTIMATION FROM A SINGLE RGB IMAGE

    公开(公告)号:US20200311855A1

    公开(公告)日:2020-10-01

    申请号:US16902097

    申请日:2020-06-15

    摘要: Pose estimation generally refers to a computer vision technique that determines the pose of some object, usually with respect to a particular camera. Pose estimation has many applications, but is particularly useful in the context of robotic manipulation systems. To date, robotic manipulation systems have required a camera to be installed on the robot itself (i.e. a camera-in-hand) for capturing images of the object and/or a camera external to the robot for capturing images of the object. Unfortunately, the camera-in-hand has a limited field of view for capturing objects, whereas the external camera, which may have a greater field of view, requires costly calibration each time the camera is even slightly moved. Similar issues apply when estimating the pose of any object with respect to another object (i.e. which may be moving or not). The present disclosure avoids these issues and provides object-to-object pose estimation from a single image.

    Equivariant landmark transformation for landmark localization

    公开(公告)号:US10783394B2

    公开(公告)日:2020-09-22

    申请号:US16006728

    申请日:2018-06-12

    摘要: A method, computer readable medium, and system are disclosed to generate coordinates of landmarks within images. The landmark locations may be identified on an image of a human face and used for emotion recognition, face identity verification, eye gaze tracking, pose estimation, etc. A transform is applied to input image data to produce transformed input image data. The transform is also applied to predicted coordinates for landmarks of the input image data to produce transformed predicted coordinates. A neural network model processes the transformed input image data to generate additional landmarks of the transformed input image data and additional predicted coordinates for each one of the additional landmarks. Parameters of the neural network model are updated to reduce differences between the transformed predicted coordinates and the additional predicted coordinates.

    SEMI-SUPERVISED LEARNING FOR LANDMARK LOCALIZATION

    公开(公告)号:US20180365532A1

    公开(公告)日:2018-12-20

    申请号:US16006709

    申请日:2018-06-12

    摘要: A method, computer readable medium, and system are disclosed for sequential multi-tasking to generate coordinates of landmarks within images. The landmark locations may be identified on an image of a human face and used for emotion recognition, face identity verification, eye gaze tracking, pose estimation, etc. A neural network model processes input image data to generate pixel-level likelihood estimates for landmarks in the input image data and a soft-argmax function computes predicted coordinates of each landmark based on the pixel-level likelihood estimates.

    EQUIVARIANT LANDMARK TRANSFORMATION FOR LANDMARK LOCALIZATION

    公开(公告)号:US20180365512A1

    公开(公告)日:2018-12-20

    申请号:US16006728

    申请日:2018-06-12

    摘要: A method, computer readable medium, and system are disclosed to generate coordinates of landmarks within images. The landmark locations may be identified on an image of a human face and used for emotion recognition, face identity verification, eye gaze tracking, pose estimation, etc. A transform is applied to input image data to produce transformed input image data. The transform is also applied to predicted coordinates for landmarks of the input image data to produce transformed predicted coordinates. A neural network model processes the transformed input image data to generate additional landmarks of the transformed input image data and additional predicted coordinates for each one of the additional landmarks. Parameters of the neural network model are updated to reduce differences between the transformed predicted coordinates and the additional predicted coordinates.

    Systems and methods for pruning neural networks for resource efficient inference

    公开(公告)号:US11315018B2

    公开(公告)日:2022-04-26

    申请号:US15786406

    申请日:2017-10-17

    IPC分类号: G06N3/08 G06N3/04

    摘要: A method, computer readable medium, and system are disclosed for neural network pruning. The method includes the steps of receiving first-order gradients of a cost function relative to layer parameters for a trained neural network and computing a pruning criterion for each layer parameter based on the first-order gradient corresponding to the layer parameter, where the pruning criterion indicates an importance of each neuron that is included in the trained neural network and is associated with the layer parameter. The method includes the additional steps of identifying at least one neuron having a lowest importance and removing the at least one neuron from the trained neural network to produce a pruned neural network.