-
公开(公告)号:US20230110206A1
公开(公告)日:2023-04-13
申请号:US18079772
申请日:2022-12-12
Applicant: NVIDIA Corporation
Inventor: Tero Tapani Karras , Samuli Matias Laine , David Patrick Luebke , Jaakko T. Lehtinen , Miika Samuli Aittala , Timo Oskari Aila , Ming-Yu Liu , Arun Mohanray Mallya , Ting-Chun Wang
Abstract: A latent code defined in an input space is processed by the mapping neural network to produce an intermediate latent code defined in an intermediate latent space. The intermediate latent code may be used as appearance vector that is processed by the synthesis neural network to generate an image. The appearance vector is a compressed encoding of data, such as video frames including a person's face, audio, and other data. Captured images may be converted into appearance vectors at a local device and transmitted to a remote device using much less bandwidth compared with transmitting the captured images. A synthesis neural network at the remote device reconstructs the images for display.
-
公开(公告)号:US20220392162A1
公开(公告)日:2022-12-08
申请号:US17718172
申请日:2022-04-11
Applicant: NVIDIA Corporation
Inventor: Tianchang Shen , Jun Gao , Kangxue Yin , Ming-Yu Liu , Sanja Fidler
Abstract: In various examples, a deep three-dimensional (3D) conditional generative model is implemented that can synthesize high resolution 3D shapes using simple guides—such as coarse voxels, point clouds, etc.—by marrying implicit and explicit 3D representations into a hybrid 3D representation. The present approach may directly optimize for the reconstructed surface, allowing for the synthesis of finer geometric details with fewer artifacts. The systems and methods described herein may use a deformable tetrahedral grid that encodes a discretized signed distance function (SDF) and a differentiable marching tetrahedral layer that converts the implicit SDF representation to an explicit surface mesh representation. This combination allows joint optimization of the surface geometry and topology as well as generation of the hierarchy of subdivisions using reconstruction and adversarial losses defined explicitly on the surface mesh.
-
公开(公告)号:US20220207770A1
公开(公告)日:2022-06-30
申请号:US17165701
申请日:2021-02-02
Applicant: NVIDIA Corporation
Inventor: Ming-Yu Liu , Ting-Chun Wang , Xihui Liu
Abstract: Apparatuses, systems, and techniques to produce an image of a first subject positioned in a pose demonstrated by an image of a second subject. In at least one embodiment, an image of a first subject can be generated from a variety of points of view.
-
公开(公告)号:US20220114698A1
公开(公告)日:2022-04-14
申请号:US17065780
申请日:2020-10-08
Applicant: Nvidia Corporation
Inventor: Ming-Yu Liu
Abstract: Apparatuses, systems, and techniques are presented to generate images. In at least one embodiment, one or more neural networks are used to adjust one or more aspect ratios of one or more objects of one or more images based, at least in part, on input from one or more users.
-
公开(公告)号:US20210374552A1
公开(公告)日:2021-12-02
申请号:US16889376
申请日:2020-06-01
Applicant: Nvidia Corporation
Inventor: Arun Mallya , Ting-Chun Wang , Ming-Yu Liu , Karan Spara
Abstract: Apparatuses, systems, and techniques are presented to synthesize consistent images or video. In at least one embodiment, one or more neural networks are used to generate one or more second images based, at least in part, on one or more point cloud representations of one or more first images.
-
公开(公告)号:US11017556B2
公开(公告)日:2021-05-25
申请号:US16152303
申请日:2018-10-04
Applicant: NVIDIA Corporation
Inventor: Xiaodong Yang , Xitong Yang , Fanyi Xiao , Ming-Yu Liu , Jan Kautz
Abstract: Iterative prediction systems and methods for the task of action detection process an inputted sequence of video frames to generate an output of both action tubes and respective action labels, wherein the action tubes comprise a sequence of bounding boxes on each video frame. An iterative predictor processes large offsets between the bounding boxes and the ground-truth.
-
27.
公开(公告)号:US20200242774A1
公开(公告)日:2020-07-30
申请号:US16721852
申请日:2019-12-19
Applicant: Nvidia Corporation
Inventor: Taesung Park , Ming-Yu Liu , Ting-Chun Wang , Junyan Zhu
Abstract: A user can create a basic semantic layout that includes two or more regions identified by the user, each region being associated with a semantic label indicating a type of object(s) to be rendered in that region. The semantic layout can be provided as input to an image synthesis network. The network can be a trained machine learning network, such as a generative adversarial network (GAN), that includes a conditional, spatially-adaptive normalization layer for propagating semantic information from the semantic layout to other layers of the network. The synthesis can involve both normalization and de-normalization, where each region of the layout can utilize different normalization parameter values. An image is inferred from the network, and rendered for display to the user. The user can change labels or regions in order to cause a new or updated image to be generated.
-
公开(公告)号:US20200160178A1
公开(公告)日:2020-05-21
申请号:US16685795
申请日:2019-11-15
Applicant: NVIDIA Corporation
Inventor: Amlan Kar , Aayush Prakash , Ming-Yu Liu , David Jesus Acuna Marrero , Antonio Torralba Barriuso , Sanja Fidler
IPC: G06N3/08 , G06F16/901 , G06N3/04 , G06T11/60
Abstract: In various examples, a generative model is used to synthesize datasets for use in training a downstream machine learning model to perform an associated task. The synthesized datasets may be generated by sampling a scene graph from a scene grammar—such as a probabilistic grammar—and applying the scene graph to the generative model to compute updated scene graphs more representative of object attribute distributions of real-world datasets. The downstream machine learning model may be validated against a real-world validation dataset, and the performance of the model on the real-world validation dataset may be used as an additional factor in further training or fine-tuning the generative model for generating the synthesized datasets specific to the task of the downstream machine learning model.
-
公开(公告)号:US10595039B2
公开(公告)日:2020-03-17
申请号:US15939098
申请日:2018-03-28
Applicant: NVIDIA Corporation
Inventor: Ming-Yu Liu , Xiaodong Yang , Jan Kautz , Sergey Tulyakov
IPC: H04N19/513 , G06K9/00 , G06N3/08 , G06T13/40 , G06N3/04
Abstract: A method, computer readable medium, and system are disclosed for action video generation. The method includes the steps of generating, by a recurrent neural network, a sequence of motion vectors from a first set of random variables and receiving, by a generator neural network, the sequence of motion vectors and a content vector sample. The sequence of motion vectors and the content vector sample are sampled by the generator neural network to produce a video clip.
-
公开(公告)号:US20250061153A1
公开(公告)日:2025-02-20
申请号:US18935222
申请日:2024-11-01
Applicant: Nvidia Corporation
Inventor: Hang Chu , Daiqing Li , David Jesus Acuna Marrero , Amlan Kar , Maria Shugrina , Ming-Yu Liu , Antonio Torralba Barriuso , Sanja Fidler
IPC: G06F16/901 , G06F30/13 , G06F30/27 , G06N3/044 , G06N3/045 , G06N3/047 , G06N3/08 , G06N3/084 , G06N5/04 , G06N20/10 , G06N20/20 , G06V10/764 , G06V10/82 , G06V10/84 , G06V20/10
Abstract: A generative model can be used for generation of spatial layouts and graphs. Such a model can progressively grow these layouts and graphs based on local statistics, where nodes can represent spatial control points of the layout, and edges can represent segments or paths between nodes, such as may correspond to road segments. A generative model can utilize an encoder-decoder architecture where the encoder is a recurrent neural network (RNN) that encodes local incoming paths into a node and the decoder is another RNN that generates outgoing nodes and edges connecting an existing node to the newly generated nodes. Generation is done iteratively, and can finish once all nodes are visited or another end condition is satisfied. Such a model can generate layouts by additionally conditioning on a set of attributes, giving control to a user in generating the layout.
-
-
-
-
-
-
-
-
-