EXTRACTING TRIANGULAR 3-D MODELS, MATERIALS, AND LIGHTING FROM IMAGES

    公开(公告)号:US20230140460A1

    公开(公告)日:2023-05-04

    申请号:US17827918

    申请日:2022-05-30

    摘要: A technique is described for extracting or constructing a three-dimensional (3D) model from multiple two-dimensional (2D) images. In an embodiment, a foreground segmentation mask or depth field may be provided as an additional supervision input with each 2D image. In an embodiment, the foreground segmentation mask or depth field is automatically generated for each 2D image. The constructed 3D model comprises a triangular mesh topology, materials, and environment lighting. The constructed 3D model is represented in a format that can be directly edited and/or rendered by conventional application programs, such as digital content creation (DCC) tools. For example, the constructed 3D model may be represented as a triangular surface mesh (with arbitrary topology), a set of 2D textures representing spatially-varying material parameters, and an environment map. Furthermore, the constructed 3D model may be included in 3D scenes and interacts realistically with other objects.

    Appearance-driven automatic three-dimensional modeling

    公开(公告)号:US11615602B2

    公开(公告)日:2023-03-28

    申请号:US17888207

    申请日:2022-08-15

    摘要: Appearance driven automatic three-dimensional (3D) modeling enables optimization of a 3D model comprising the shape and appearance of a particular 3D scene or object. Triangle meshes and shading models may be jointly optimized to match the appearance of a reference 3D model based on reference images of the reference 3D model. Compared with the reference 3D model, the optimized 3D model is a lower resolution 3D model that can be rendered in less time. More specifically, the optimized 3D model may include fewer geometric primitives compared with the reference 3D model. In contrast with the conventional inverse rendering or analysis-by-synthesis modeling tools, the shape and appearance representations of the 3D model are automatically generated that, when rendered, match the reference images. Appearance driven automatic 3D modeling has a number of uses, including appearance-preserving simplification of extremely complex assets, conversion between rendering systems, and even conversion between geometric scene representations.

    SYSTEMS AND METHODS FOR TRAINING NEURAL NETWORKS WITH SPARSE DATA

    公开(公告)号:US20220405582A1

    公开(公告)日:2022-12-22

    申请号:US17665370

    申请日:2022-02-04

    IPC分类号: G06N3/08 G06N5/04

    摘要: A method, computer readable medium, and system are disclosed for training a neural network model. The method includes the step of selecting an input vector from a set of training data that includes input vectors and sparse target vectors, where each sparse target vector includes target data corresponding to a subset of samples within an output vector of the neural network model. The method also includes the steps of processing the input vector by the neural network model to produce output data for the samples within the output vector and adjusting parameter values of the neural network model to reduce differences between the output vector and the sparse target vector for the subset of the samples.

    MOTION BLUR AND DEPTH OF FIELD RECONSTRUCTION THROUGH TEMPORALLY STABLE NEURAL NETWORKS

    公开(公告)号:US20200051206A1

    公开(公告)日:2020-02-13

    申请号:US16422601

    申请日:2019-05-24

    IPC分类号: G06T3/00 G06T5/00 G06T1/20

    摘要: A neural network structure, namely a warped external recurrent neural network, is disclosed for reconstructing images with synthesized effects. The effects can include motion blur, depth of field reconstruction (e.g., simulating lens effects), and/or anti-aliasing (e.g., removing artifacts caused by sampling frequency). The warped external recurrent neural network is not recurrent at each layer inside the neural network. Instead, the external state output by the final layer of the neural network is warped and provided as a portion of the input to the neural network for the next image in a sequence of images. In contrast, in a conventional recurrent neural network, hidden state generated at each layer is provided as a feedback input to the generating layer. The neural network can be implemented, at least in part, on a processor. In an embodiment, the neural network is implemented on at least one parallel processing unit.

    SYSTEMS AND METHODS FOR TRAINING NEURAL NETWORKS WITH SPARSE DATA

    公开(公告)号:US20180357537A1

    公开(公告)日:2018-12-13

    申请号:US15881632

    申请日:2018-01-26

    IPC分类号: G06N3/08 G06N5/04

    CPC分类号: G06N3/08 G06N5/04

    摘要: A method, computer readable medium, and system are disclosed for training a neural network model. The method includes the step of selecting an input vector from a set of training data that includes input vectors and sparse target vectors, where each sparse target vector includes target data corresponding to a subset of samples within an output vector of the neural network model. The method also includes the steps of processing the input vector by the neural network model to produce output data for the samples within the output vector and adjusting parameter values of the neural network model to reduce differences between the output vector and the sparse target vector for the subset of the samples.

    Joint shape and appearance optimization through topology sampling

    公开(公告)号:US11657571B2

    公开(公告)日:2023-05-23

    申请号:US18065555

    申请日:2022-12-13

    IPC分类号: G06T17/20

    CPC分类号: G06T17/20

    摘要: Systems and methods enable optimization of a 3D model representation comprising the shape and appearance of a particular 3D scene or object. The opaque 3D mesh (e.g., vertex positions and corresponding topology) and spatially varying material attributes are jointly optimized based on image space losses to match multiple image observations (e.g., reference images of the reference 3D scene or object). A geometric topology defines faces and/or cells in the opaque 3D mesh that are visible and may be randomly initialized and optimized through training based on the image space losses. Applying the geometry topology to an opaque 3D mesh for learning the shape improves accuracy of silhouette edges and performance compared with using transparent mesh representations. In contrast with approaches that require an initial guess for the topology and/or an exhaustive testing of possible geometric topologies, the 3D model representation is learned based on image space differences without requiring an initial guess.