Object reconstruction using media data

    公开(公告)号:US12062130B2

    公开(公告)日:2024-08-13

    申请号:US17403656

    申请日:2021-08-16

    CPC classification number: G06T17/00 G06T13/40 G06T19/20 G06T2219/2004

    Abstract: Systems and techniques are provided for performing video-based activity recognition. For example, a process can include generating a three-dimensional (3D) model of a first portion of an object based on one or more frames depicting the object. The process can also include generating a mask for the one or more frames, the mask including an indication of one or more regions of the object. The process can further include generating a 3D base model based on the 3D model of the first portion of the object and the mask, the 3D base model representing the first portion of the object and a second portion of the object. The process can include generating, based on the mask and the 3D base model, a 3D model of the second portion of the object.

    Image augmentation for analytics
    73.
    发明授权

    公开(公告)号:US11386633B2

    公开(公告)日:2022-07-12

    申请号:US17109045

    申请日:2020-12-01

    Abstract: Systems and techniques are provided for facial image augmentation. An example method can include obtaining a first image capturing a face. Using the first image, the method can determine, using a prediction model, a UV face position map including a two-dimensional (2D) representation of a three-dimensional (3D) structure of the face. The method can generate, based on the UV face position map, a 3D model of the face. The method can generate an extended 3D model of the face by extending the 3D model to include region(s) beyond a boundary of the 3D model. The region(s) can include a forehead region, a region surrounding at least a portion of the face, and/or other region. The method can generate, based on the extended 3D model, a second image depicting the face in a rotated position relative to a position of the face in the first image.

    COORDINATED MULTI-VIEWPOINT IMAGE CAPTURE

    公开(公告)号:US20220006922A1

    公开(公告)日:2022-01-06

    申请号:US16920219

    申请日:2020-07-02

    Abstract: Various embodiments may include methods and systems for configuring synchronous multipoint photography. Various embodiments may include displaying preview images on initiating and responding devices. Various embodiments may include determining an adjustment to the orientation of a responding device based on the preview images. Various embodiments may include transmitting an instruction configured to enable the responding device to display a notification for adjusting the position or the orientation of the responding device based at least on the adjustment. Various embodiments may include transmitting, to the responding device, a second instruction to enable the responding device to capture a second image at approximately the same time as the initiating device captures a first image. Embodiments further include capturing, via a camera, the first image, receiving, from the responding device, s second image, and generating an image file based on the first image and the second image.

    Two-pass omni-directional object detection

    公开(公告)号:US11188740B2

    公开(公告)日:2021-11-30

    申请号:US16719900

    申请日:2019-12-18

    Abstract: Methods, systems, and devices for object detection are described. A device may receive an image, and detect, via a first stage of a cascade neural network, object recognition information over one or more angular orientations during a first pass. The device may determine, via a second stage of the cascade neural network, a confidence score associated with one or more of the candidate object in the image, the candidate bounding box associated with the candidate object in the image, or one or more object features of the candidate object in the image, or an orientation of the candidate object in the image, or a combination thereof. The device may identify, via a third stage of the cascade neural network, whether to detect the object recognition information during a second pass based on the confidence score satisfying a threshold.

    Multi-resolution feature description for object recognition

    公开(公告)号:US11068741B2

    公开(公告)日:2021-07-20

    申请号:US16224644

    申请日:2018-12-18

    Abstract: Techniques and systems are provided for determining features for one or more objects in one or more video frames. For example, an image of an object, such as a face, can be received, and features of the object in the image can be identified. A size of the object can be determined based on the image, for example based on inter-eye distance of a face. Based on the size, either a high-resolution set of features or a low-resolution set of features is selected to compare to the features of the object. The object can be identified by matching the features of the object to matching features from the selected set of features.

    High-level signaling for fisheye video data

    公开(公告)号:US10992961B2

    公开(公告)日:2021-04-27

    申请号:US15987231

    申请日:2018-05-23

    Abstract: An example method includes processing a file including fisheye video data, the file including a syntax structure including a plurality of syntax elements that specify attributes of the fisheye video data, wherein the plurality of syntax elements includes: a first syntax element that explicitly indicates whether the fisheye video data is monoscopic or stereoscopic, and one or more syntax elements that implicitly indicate whether the fisheye video data is monoscopic or stereoscopic; determining, based on the first syntax element, whether the fisheye video data is monoscopic or stereoscopic; and rendering, based on the determination, the fisheye video data as monoscopic or stereoscopic.

    PERSONALIZED EYE OPENNESS ESTIMATION
    78.
    发明申请

    公开(公告)号:US20200218878A1

    公开(公告)日:2020-07-09

    申请号:US16239352

    申请日:2019-01-03

    Abstract: Methods, systems, and devices for personalized (e.g., user specific) eye openness estimation are described. A network model (e.g., a convolutional neural network) may be trained using a set of synthetic eye openness image data (e.g., synthetic face images with known degrees or percentages of eye openness) and a set of real eye openness image data (e.g., facial images of real persons that are annotated as either open eyed or closed eyed). A device may estimate, using the network model, a multi-stage eye openness level (e.g., a percentage or degree to which an eye is open) of a user based on captured real time eye openness image data. The degree of eye openness estimated by the network model may then be compared to an eye size of the user (e.g., a user specific maximum eye size), and a user specific eye openness level may be estimated based on the comparison.

    FEATURE MATCHING WITH A SUBSPACE SPANNED BY MULTIPLE REPRESENTATIVE FEATURE VECTORS

    公开(公告)号:US20190311183A1

    公开(公告)日:2019-10-10

    申请号:US15948676

    申请日:2018-04-09

    Abstract: Methods, systems, and devices for object recognition are described. A device may generate a subspace based at least in part on a set of representative feature vectors for an object. The device may obtain an array of pixels representing an image. The device may determine a probe feature vector for the image by applying a convolutional operation to the array of pixels. The device may create a reconstructed feature vector in the subspace based at least in part on the set of representative feature vectors and the probe feature vector. The device may compare the reconstructed feature vector and the probe feature vector and recognize the object in the image based at least in part on the comparison. For example, the described techniques may support pose invariant facial recognition or other such object recognition applications.

Patent Agency Ranking