-
公开(公告)号:US12062130B2
公开(公告)日:2024-08-13
申请号:US17403656
申请日:2021-08-16
Applicant: QUALCOMM Incorporated
Inventor: Yan Deng , Michel Adib Sarkis , Ning Bi , Chieh-Ming Kuo
CPC classification number: G06T17/00 , G06T13/40 , G06T19/20 , G06T2219/2004
Abstract: Systems and techniques are provided for performing video-based activity recognition. For example, a process can include generating a three-dimensional (3D) model of a first portion of an object based on one or more frames depicting the object. The process can also include generating a mask for the one or more frames, the mask including an indication of one or more regions of the object. The process can further include generating a 3D base model based on the 3D model of the first portion of the object and the mask, the 3D base model representing the first portion of the object and a second portion of the object. The process can include generating, based on the mask and the 3D base model, a 3D model of the second portion of the object.
-
公开(公告)号:US11748949B2
公开(公告)日:2023-09-05
申请号:US17744484
申请日:2022-05-13
Applicant: QUALCOMM Incorporated
Inventor: Ke-Li Cheng , Kuang-Man Huang , Michel Adib Sarkis , Gerhard Reitmayr , Ning Bi
CPC classification number: G06T17/205 , G06T7/12 , G06T19/006 , G06T2210/56
Abstract: Techniques are provided for generating three-dimensional models of objects from one or more images or frames. For example, at least one frame of an object in a scene can be obtained. A portion of the object is positioned on a plane in the at least one frame. The plane can be detected in the at least one frame and, based on the detected plane, the object can be segmented from the plane in the at least one frame. A three-dimensional (3D) model of the object can be generated based on segmenting the object from the plane. A refined mesh can be generated for a portion of the 3D model corresponding to the portion of the object positioned on the plane.
-
公开(公告)号:US11386633B2
公开(公告)日:2022-07-12
申请号:US17109045
申请日:2020-12-01
Applicant: QUALCOMM Incorporated
Inventor: Peng Liu , Lei Wang , Zhen Wang , Ke-Li Cheng , Ning Bi
Abstract: Systems and techniques are provided for facial image augmentation. An example method can include obtaining a first image capturing a face. Using the first image, the method can determine, using a prediction model, a UV face position map including a two-dimensional (2D) representation of a three-dimensional (3D) structure of the face. The method can generate, based on the UV face position map, a 3D model of the face. The method can generate an extended 3D model of the face by extending the 3D model to include region(s) beyond a boundary of the 3D model. The region(s) can include a forehead region, a region surrounding at least a portion of the face, and/or other region. The method can generate, based on the extended 3D model, a second image depicting the face in a rotated position relative to a position of the face in the first image.
-
公开(公告)号:US20220006922A1
公开(公告)日:2022-01-06
申请号:US16920219
申请日:2020-07-02
Applicant: QUALCOMM Incorporated
Inventor: Jayesh BATHIJA , Taoufik Tani , Ning Bi
Abstract: Various embodiments may include methods and systems for configuring synchronous multipoint photography. Various embodiments may include displaying preview images on initiating and responding devices. Various embodiments may include determining an adjustment to the orientation of a responding device based on the preview images. Various embodiments may include transmitting an instruction configured to enable the responding device to display a notification for adjusting the position or the orientation of the responding device based at least on the adjustment. Various embodiments may include transmitting, to the responding device, a second instruction to enable the responding device to capture a second image at approximately the same time as the initiating device captures a first image. Embodiments further include capturing, via a camera, the first image, receiving, from the responding device, s second image, and generating an image file based on the first image and the second image.
-
公开(公告)号:US11188740B2
公开(公告)日:2021-11-30
申请号:US16719900
申请日:2019-12-18
Applicant: QUALCOMM incorporated
Inventor: Chun-Ting Huang , Lei Wang , Zhen Wang , Xiaoliang Bai , Ning Bi
Abstract: Methods, systems, and devices for object detection are described. A device may receive an image, and detect, via a first stage of a cascade neural network, object recognition information over one or more angular orientations during a first pass. The device may determine, via a second stage of the cascade neural network, a confidence score associated with one or more of the candidate object in the image, the candidate bounding box associated with the candidate object in the image, or one or more object features of the candidate object in the image, or an orientation of the candidate object in the image, or a combination thereof. The device may identify, via a third stage of the cascade neural network, whether to detect the object recognition information during a second pass based on the confidence score satisfying a threshold.
-
公开(公告)号:US11068741B2
公开(公告)日:2021-07-20
申请号:US16224644
申请日:2018-12-18
Applicant: QUALCOMM Incorporated
Abstract: Techniques and systems are provided for determining features for one or more objects in one or more video frames. For example, an image of an object, such as a face, can be received, and features of the object in the image can be identified. A size of the object can be determined based on the image, for example based on inter-eye distance of a face. Based on the size, either a high-resolution set of features or a low-resolution set of features is selected to compare to the features of the object. The object can be identified by matching the features of the object to matching features from the selected set of features.
-
公开(公告)号:US10992961B2
公开(公告)日:2021-04-27
申请号:US15987231
申请日:2018-05-23
Applicant: QUALCOMM Incorporated
Inventor: Ye-Kui Wang , Ning Bi , Bijan Forutanpour
IPC: H04N19/17 , H04N19/176 , H04N19/70 , H04N21/854 , H04N21/235 , H04N21/81 , H04N19/46 , H04N21/845 , H04N21/236
Abstract: An example method includes processing a file including fisheye video data, the file including a syntax structure including a plurality of syntax elements that specify attributes of the fisheye video data, wherein the plurality of syntax elements includes: a first syntax element that explicitly indicates whether the fisheye video data is monoscopic or stereoscopic, and one or more syntax elements that implicitly indicate whether the fisheye video data is monoscopic or stereoscopic; determining, based on the first syntax element, whether the fisheye video data is monoscopic or stereoscopic; and rendering, based on the determination, the fisheye video data as monoscopic or stereoscopic.
-
公开(公告)号:US20200218878A1
公开(公告)日:2020-07-09
申请号:US16239352
申请日:2019-01-03
Applicant: QUALCOMM Incorporated
Inventor: Eyasu Zemene Mequanint , Shuai Zhang , Yingyong Qi , Ning Bi
IPC: G06K9/00 , A61B5/11 , A61B5/1171 , G08B21/18 , G06F21/32
Abstract: Methods, systems, and devices for personalized (e.g., user specific) eye openness estimation are described. A network model (e.g., a convolutional neural network) may be trained using a set of synthetic eye openness image data (e.g., synthetic face images with known degrees or percentages of eye openness) and a set of real eye openness image data (e.g., facial images of real persons that are annotated as either open eyed or closed eyed). A device may estimate, using the network model, a multi-stage eye openness level (e.g., a percentage or degree to which an eye is open) of a user based on captured real time eye openness image data. The degree of eye openness estimated by the network model may then be compared to an eye size of the user (e.g., a user specific maximum eye size), and a user specific eye openness level may be estimated based on the comparison.
-
公开(公告)号:US20190311183A1
公开(公告)日:2019-10-10
申请号:US15948676
申请日:2018-04-09
Applicant: QUALCOMM Incorporated
Inventor: Lei Wang , Yingyong Qi , Ning Bi
Abstract: Methods, systems, and devices for object recognition are described. A device may generate a subspace based at least in part on a set of representative feature vectors for an object. The device may obtain an array of pixels representing an image. The device may determine a probe feature vector for the image by applying a convolutional operation to the array of pixels. The device may create a reconstructed feature vector in the subspace based at least in part on the set of representative feature vectors and the probe feature vector. The device may compare the reconstructed feature vector and the probe feature vector and recognize the object in the image based at least in part on the comparison. For example, the described techniques may support pose invariant facial recognition or other such object recognition applications.
-
公开(公告)号:US10379734B2
公开(公告)日:2019-08-13
申请号:US14764149
申请日:2013-02-23
Applicant: QUALCOMM Incorporated
IPC: G06K9/00 , G06F3/0488 , G06F3/0484 , G06T11/00 , G06T11/60 , G06T17/20
Abstract: A method for interactive image caricaturing by an electronic device is described. The method includes detecting at least one feature location of an image. The method further includes generating, based on the at least one feature location, an image mesh that comprises a grid of at least one horizontal line and at least one vertical line. The method additionally includes obtaining a gesture input. The method also includes determining at least one caricature action based on the at least one gesture input. The method further includes generating a caricature image based on the image mesh, the at least one caricature action and the image.
-
-
-
-
-
-
-
-
-