EQUIVARIANT LANDMARK TRANSFORMATION FOR LANDMARK LOCALIZATION

    公开(公告)号:US20180365512A1

    公开(公告)日:2018-12-20

    申请号:US16006728

    申请日:2018-06-12

    Abstract: A method, computer readable medium, and system are disclosed to generate coordinates of landmarks within images. The landmark locations may be identified on an image of a human face and used for emotion recognition, face identity verification, eye gaze tracking, pose estimation, etc. A transform is applied to input image data to produce transformed input image data. The transform is also applied to predicted coordinates for landmarks of the input image data to produce transformed predicted coordinates. A neural network model processes the transformed input image data to generate additional landmarks of the transformed input image data and additional predicted coordinates for each one of the additional landmarks. Parameters of the neural network model are updated to reduce differences between the transformed predicted coordinates and the additional predicted coordinates.

    Model-based three-dimensional head pose estimation

    公开(公告)号:US09830703B2

    公开(公告)日:2017-11-28

    申请号:US14825129

    申请日:2015-08-12

    Abstract: One embodiment of the present invention sets forth a technique for estimating a head pose of a user. The technique includes acquiring depth data associated with a head of the user and initializing each particle included in a set of particles with a different candidate head pose. The technique further includes performing one or more optimization passes that include performing at least one iterative closest point (ICP) iteration for each particle and performing at least one particle swarm optimization (PSO) iteration. Each ICP iteration includes rendering the three-dimensional reference model based on the candidate head pose associated with the particle and comparing the three-dimensional reference model to the depth data. Each PSO iteration comprises updating a global best head pose associated with the set of particles and modifying at least one candidate head pose. The technique further includes modifying a shape of the three-dimensional reference model based on depth data.

    MIXED PRIMARY DISPLAY WITH SPATIALLY MODULATED BACKLIGHT
    34.
    发明申请
    MIXED PRIMARY DISPLAY WITH SPATIALLY MODULATED BACKLIGHT 审中-公开
    具有空调调制背光的混合主显示

    公开(公告)号:US20160307482A1

    公开(公告)日:2016-10-20

    申请号:US15130886

    申请日:2016-04-15

    Abstract: A method, computer readable medium, and system are disclosed for generating mixed-primary data for display. The method includes the steps of receiving a source image that includes a plurality of pixels, dividing the source image into a plurality of blocks, analyzing the source image based on an image decomposition algorithm, encoding chroma information and modulation information to generate a video signal, and transmitting the video signal to a mixed-primary display. The chroma information and modulation information correspond with two or more mixed-primary color components and are generated by the image decomposition algorithm to minimize error between a reproduced image and the source image. The two or more mixed-primary colors selected for each block of the source image are not limited to any particular set of colors and each mixed-primary color component may be selected from any color capable of being reproduced by the mixed-primary display.

    Abstract translation: 公开了一种用于生成用于显示的混合主数据的方法,计算机可读介质和系统。 该方法包括以下步骤:接收包括多个像素的源图像,将源图像划分为多个块,基于图像分解算法分析源图像,对色度信息和调制信息进行编码以产生视频信号, 并将视频信号发送到混合主显示器。 色度信息和调制信息与两个或更多个混合原色分量相对应,并且由图像分解算法产生,以最小化再现图像与源图像之间的误差。 为源图像的每个块选择的两个或多个混合原色不限于任何特定的颜色集合,并且可以从能够由混合主显示器再现的任何颜色中选择每个混合原色分量。

    UNIFIED OPTIMIZATION METHOD FOR END-TO-END CAMERA IMAGE PROCESSING FOR TRANSLATING A SENSOR CAPTURED IMAGE TO A DISPLAY IMAGE
    35.
    发明申请
    UNIFIED OPTIMIZATION METHOD FOR END-TO-END CAMERA IMAGE PROCESSING FOR TRANSLATING A SENSOR CAPTURED IMAGE TO A DISPLAY IMAGE 有权
    用于将传感器捕获的图像转换为显示图像的端到端相机图像处理的统一优化方法

    公开(公告)号:US20150206504A1

    公开(公告)日:2015-07-23

    申请号:US14600507

    申请日:2015-01-20

    Abstract: A computer implemented method of determining a latent image from an observed image is disclosed. The method comprises implementing a plurality of image processing operations within a single optimization framework, wherein the single optimization framework comprises solving a linear minimization expression. The method further comprises mapping the linear minimization expression onto at least one non-linear solver. Further, the method comprises using the non-linear solver, iteratively solving the linear minimization expression in order to extract the latent image from the observed image, wherein the linear minimization expression comprises: a data term, and a regularization term, and wherein the regularization term comprises a plurality of non-linear image priors.

    Abstract translation: 公开了一种从观察图像确定潜像的计算机实现方法。 该方法包括在单个优化框架内实现多个图像处理操作,其中单个优化框架包括求解线性最小化表达式。 该方法还包括将线性最小化表达映射到至少一个非线性求解器上。 此外,该方法包括使用非线性求解器,迭代地求解线性最小化表达以从观察图像中提取潜像,其中线性最小化表达式包括:数据项和正则化项,其中正则化 术语包括多个非线性图像先验。

    PHYSICS-GUIDED MOTION DIFFUSION MODEL
    37.
    发明公开

    公开(公告)号:US20240169636A1

    公开(公告)日:2024-05-23

    申请号:US18317378

    申请日:2023-05-15

    Abstract: Systems and methods are disclosed that improve performance of synthesized motion generated by a diffusion neural network model. A physics-guided motion diffusion model incorporates physical constraints into the diffusion process to model the complex dynamics induced by forces and contact. Specifically, a physics-based motion projection module uses motion imitation in a physics simulator to project the denoised motion of a diffusion step to a physically plausible motion. The projected motion is further used in the next diffusion iteration to guide the denoising diffusion process. The use of physical constraints in the physics-guided motion diffusion model iteratively pulls the motion toward a physically-plausible space, reducing artifacts such as floating, foot sliding, and ground penetration.

    Future object trajectory predictions for autonomous machine applications

    公开(公告)号:US11989642B2

    公开(公告)日:2024-05-21

    申请号:US17952866

    申请日:2022-09-26

    CPC classification number: G06N3/044 B60W40/02 G06N3/08 G06N3/045

    Abstract: In various examples, historical trajectory information of objects in an environment may be tracked by an ego-vehicle and encoded into a state feature. The encoded state features for each of the objects observed by the ego-vehicle may be used—e.g., by a bi-directional long short-term memory (LSTM) network—to encode a spatial feature. The encoded spatial feature and the encoded state feature for an object may be used to predict lateral and/or longitudinal maneuvers for the object, and the combination of this information may be used to determine future locations of the object. The future locations may be used by the ego-vehicle to determine a path through the environment, or may be used by a simulation system to control virtual objects—according to trajectories determined from the future locations—through a simulation environment.

    POSE TRANSFER FOR THREE-DIMENSIONAL CHARACTERS USING A LEARNED SHAPE CODE

    公开(公告)号:US20240070987A1

    公开(公告)日:2024-02-29

    申请号:US18110287

    申请日:2023-02-15

    CPC classification number: G06T19/00 G06T7/10 G06T17/20

    Abstract: Transferring pose to three-dimensional characters is a common computer graphics task that typically involves transferring the pose of a reference avatar to a (stylized) three-dimensional character. Since three-dimensional characters are created by professional artists through imagination and exaggeration, and therefore, unlike human or animal avatars, have distinct shape and features, matching the pose of a three-dimensional character to that of a reference avatar generally requires manually creating shape information for the three-dimensional character that is required for pose transfer. The present disclosure provides for the automated transfer of a reference pose to a three-dimensional character, based specifically on a learned shape code for the three-dimensional character.

Patent Agency Ranking