-
11.
公开(公告)号:US20210374384A1
公开(公告)日:2021-12-02
申请号:US16890941
申请日:2020-06-02
申请人: NVIDIA Corporation
摘要: Apparatuses, systems, and techniques to identify one or more layers of a three-dimensional graphical image to generate a two-dimensional representation. In at least one embodiment, one or more layers of a three-dimensional graphical image are identified to generate one or more two-dimensional representations.
-
12.
公开(公告)号:US20180357753A1
公开(公告)日:2018-12-13
申请号:US15807401
申请日:2017-11-08
申请人: NVIDIA Corporation
CPC分类号: G06T5/002 , G06N3/0454 , G06N3/08 , G06N3/084 , G06T2200/28 , G06T2207/20081 , G06T2207/20084
摘要: A method, computer readable medium, and system are disclosed for training a neural network. The method includes the steps of selecting an input sample from a set of training data that includes input samples and noisy target samples, where the input samples and the noisy target samples each correspond to a latent, clean target sample. The input sample is processed by a neural network model to produce an output and a noisy target sample is selected from the set of training data, where the noisy target samples have a distribution relative to the latent, clean target sample. The method also includes adjusting parameter values of the neural network model to reduce differences between the output and the noisy target sample.
-
公开(公告)号:US11861811B2
公开(公告)日:2024-01-02
申请号:US17930668
申请日:2022-09-08
申请人: NVIDIA Corporation
发明人: Carl Jacob Munkberg , Jon Niklas Theodor Hasselgren , Anjul Patney , Marco Salvi , Aaron Eliot Lefohn , Donald Lee Brittain
CPC分类号: G06T5/002 , G06T7/248 , G06T7/50 , G06T2207/10016 , G06T2207/20084
摘要: A neural network-based rendering technique increases temporal stability and image fidelity of low sample count path tracing by optimizing a distribution of samples for rendering each image in a sequence. A sample predictor neural network learns spatio-temporal sampling strategies such as placing more samples in dis-occluded regions and tracking specular highlights. Temporal feedback enables a denoiser neural network to boost the effective input sample count and increases temporal stability. The initial uniform sampling step typically present in adaptive sampling algorithms is not needed. The sample predictor and denoiser operate at interactive rates to achieve significantly improved image quality and temporal stability compared with conventional adaptive sampling techniques.
-
公开(公告)号:US20230140460A1
公开(公告)日:2023-05-04
申请号:US17827918
申请日:2022-05-30
申请人: NVIDIA Corporation
发明人: Carl Jacob Munkberg , Jon Niklas Theodor Hasselgren , Tianchang Shen , Jun Gao , Wenzheng Chen , Alex John Bauld Evans , Thomas Müller-Höhne , Sanja Fidler
摘要: A technique is described for extracting or constructing a three-dimensional (3D) model from multiple two-dimensional (2D) images. In an embodiment, a foreground segmentation mask or depth field may be provided as an additional supervision input with each 2D image. In an embodiment, the foreground segmentation mask or depth field is automatically generated for each 2D image. The constructed 3D model comprises a triangular mesh topology, materials, and environment lighting. The constructed 3D model is represented in a format that can be directly edited and/or rendered by conventional application programs, such as digital content creation (DCC) tools. For example, the constructed 3D model may be represented as a triangular surface mesh (with arbitrary topology), a set of 2D textures representing spatially-varying material parameters, and an environment map. Furthermore, the constructed 3D model may be included in 3D scenes and interacts realistically with other objects.
-
公开(公告)号:US11615602B2
公开(公告)日:2023-03-28
申请号:US17888207
申请日:2022-08-15
申请人: NVIDIA Corporation
摘要: Appearance driven automatic three-dimensional (3D) modeling enables optimization of a 3D model comprising the shape and appearance of a particular 3D scene or object. Triangle meshes and shading models may be jointly optimized to match the appearance of a reference 3D model based on reference images of the reference 3D model. Compared with the reference 3D model, the optimized 3D model is a lower resolution 3D model that can be rendered in less time. More specifically, the optimized 3D model may include fewer geometric primitives compared with the reference 3D model. In contrast with the conventional inverse rendering or analysis-by-synthesis modeling tools, the shape and appearance representations of the 3D model are automatically generated that, when rendered, match the reference images. Appearance driven automatic 3D modeling has a number of uses, including appearance-preserving simplification of extremely complex assets, conversion between rendering systems, and even conversion between geometric scene representations.
-
公开(公告)号:US20220405582A1
公开(公告)日:2022-12-22
申请号:US17665370
申请日:2022-02-04
申请人: NVIDIA Corporation
摘要: A method, computer readable medium, and system are disclosed for training a neural network model. The method includes the step of selecting an input vector from a set of training data that includes input vectors and sparse target vectors, where each sparse target vector includes target data corresponding to a subset of samples within an output vector of the neural network model. The method also includes the steps of processing the input vector by the neural network model to produce output data for the samples within the output vector and adjusting parameter values of the neural network model to reduce differences between the output vector and the sparse target vector for the subset of the samples.
-
17.
公开(公告)号:US20200051206A1
公开(公告)日:2020-02-13
申请号:US16422601
申请日:2019-05-24
申请人: NVIDIA Corporation
摘要: A neural network structure, namely a warped external recurrent neural network, is disclosed for reconstructing images with synthesized effects. The effects can include motion blur, depth of field reconstruction (e.g., simulating lens effects), and/or anti-aliasing (e.g., removing artifacts caused by sampling frequency). The warped external recurrent neural network is not recurrent at each layer inside the neural network. Instead, the external state output by the final layer of the neural network is warped and provided as a portion of the input to the neural network for the next image in a sequence of images. In contrast, in a conventional recurrent neural network, hidden state generated at each layer is provided as a feedback input to the generating layer. The neural network can be implemented, at least in part, on a processor. In an embodiment, the neural network is implemented on at least one parallel processing unit.
-
公开(公告)号:US20180357537A1
公开(公告)日:2018-12-13
申请号:US15881632
申请日:2018-01-26
申请人: NVIDIA Corporation
摘要: A method, computer readable medium, and system are disclosed for training a neural network model. The method includes the step of selecting an input vector from a set of training data that includes input vectors and sparse target vectors, where each sparse target vector includes target data corresponding to a subset of samples within an output vector of the neural network model. The method also includes the steps of processing the input vector by the neural network model to produce output data for the samples within the output vector and adjusting parameter values of the neural network model to reduce differences between the output vector and the sparse target vector for the subset of the samples.
-
公开(公告)号:US11967024B2
公开(公告)日:2024-04-23
申请号:US17827918
申请日:2022-05-30
申请人: NVIDIA Corporation
发明人: Carl Jacob Munkberg , Jon Niklas Theodor Hasselgren , Tianchang Shen , Jun Gao , Wenzheng Chen , Alex John Bauld Evans , Thomas Müller-Höhne , Sanja Fidler
CPC分类号: G06T17/205 , G06N3/084 , G06T9/002 , G06T15/04 , G06T15/506 , G06T19/00 , G06T2210/36
摘要: A technique is described for extracting or constructing a three-dimensional (3D) model from multiple two-dimensional (2D) images. In an embodiment, a foreground segmentation mask or depth field may be provided as an additional supervision input with each 2D image. In an embodiment, the foreground segmentation mask or depth field is automatically generated for each 2D image. The constructed 3D model comprises a triangular mesh topology, materials, and environment lighting. The constructed 3D model is represented in a format that can be directly edited and/or rendered by conventional application programs, such as digital content creation (DCC) tools. For example, the constructed 3D model may be represented as a triangular surface mesh (with arbitrary topology), a set of 2D textures representing spatially-varying material parameters, and an environment map. Furthermore, the constructed 3D model may be included in 3D scenes and interacts realistically with other objects.
-
公开(公告)号:US11657571B2
公开(公告)日:2023-05-23
申请号:US18065555
申请日:2022-12-13
申请人: NVIDIA Corporation
IPC分类号: G06T17/20
CPC分类号: G06T17/20
摘要: Systems and methods enable optimization of a 3D model representation comprising the shape and appearance of a particular 3D scene or object. The opaque 3D mesh (e.g., vertex positions and corresponding topology) and spatially varying material attributes are jointly optimized based on image space losses to match multiple image observations (e.g., reference images of the reference 3D scene or object). A geometric topology defines faces and/or cells in the opaque 3D mesh that are visible and may be randomly initialized and optimized through training based on the image space losses. Applying the geometry topology to an opaque 3D mesh for learning the shape improves accuracy of silhouette edges and performance compared with using transparent mesh representations. In contrast with approaches that require an initial guess for the topology and/or an exhaustive testing of possible geometric topologies, the 3D model representation is learned based on image space differences without requiring an initial guess.
-
-
-
-
-
-
-
-
-