-
公开(公告)号:US12039456B2
公开(公告)日:2024-07-16
申请号:US18102161
申请日:2023-01-27
发明人: Victor Sergeevich Lempitsky , Aliaksandra Petrovna Shysheya , Egor Olegovich Zakharov , Egor Andreevich Burkov
IPC分类号: G06V40/00 , G06F16/70 , G06N3/08 , G06N3/088 , G06V10/764 , G06V10/774 , G06V10/82 , G06V20/40 , G06V40/16 , G06V40/20
CPC分类号: G06N3/088 , G06F16/70 , G06N3/08 , G06V10/764 , G06V10/7753 , G06V10/82 , G06V20/41 , G06V20/46 , G06V40/00 , G06V40/168 , G06V40/169 , G06V40/172 , G06V40/179 , G06V40/20 , G06T2207/10016 , G06T2207/20081 , G06T2207/20084 , G06T2207/30201
摘要: An electronic device and a controlling method thereof are provided. A controlling method of an electronic device according to the disclosure includes: performing first learning for a neural network model for acquiring a video sequence including a talking head of a random user based on a plurality of learning video sequences including talking heads of a plurality of users, performing second learning for fine-tuning the neural network model based on at least one image including a talking head of a first user different from the plurality of users and first landmark information included in the at least one image, and acquiring a first video sequence including the talking head of the first user based on the at least one image and pre-stored second landmark information using the neural network model for which the first learning and the second learning were performed.
-
2.
公开(公告)号:US11961205B2
公开(公告)日:2024-04-16
申请号:US17282214
申请日:2019-11-07
发明人: Artur Andreevich Grigoriev , Victor Sergeevich Lempitsky , Artem Mikhailovich Sevastopolsky , Alexander Timurovich Vakhitov
CPC分类号: G06T3/18 , G06N3/045 , G06N3/08 , G06T5/77 , G06T2207/20081 , G06T2207/20084
摘要: An image resynthesis system, a system for training a gap filling module to be used in the image resynthesis system, an image resynthesis method, a computer program product, and a computer-readable medium are provided. The image resynthesis system comprises a source image input module, a forward warping module predicting, for each source image pixel of a source image, a corresponding position in a target image, and predicting a forward warping field which is aligned with the source image, and a gap filling module filling in gaps resulting from application of the forward warping module.
-
公开(公告)号:US11823349B2
公开(公告)日:2023-11-21
申请号:US17697436
申请日:2022-03-17
发明人: Ivan Aleksandrovich Anokhin , Kirill Vladislavovich Demochkin , Taras Andreevich Khakhulin , Gleb Mikhailovich Sterkin , Victor Sergeevich Lempitsky , Denis Mikhailovich Korzhenkov
摘要: The disclosure relates to multi-layer perceptron architecture, that may be used for image generation. A new architecture for image generators, where the color value at each pixel is computed independently given the value of a random latent vector and the coordinate of that pixel is provided. No spatial convolutions or similar operations that propagate information across pixels are involved during the synthesis.
-
公开(公告)号:US11823327B2
公开(公告)日:2023-11-21
申请号:US17457078
申请日:2021-12-01
CPC分类号: G06T15/60 , G06F18/2148 , G06N3/084 , G06T7/194 , G06T15/04 , G06T15/10 , G06T17/20 , G06T2207/10016 , G06T2207/10152 , G06T2207/20081 , G06T2207/20084 , G06T2207/30201 , G06T2207/30244 , G06T2215/12
摘要: The disclosure provides a method for generating relightable 3D portrait using a deep neural network and a computing device implementing the method. A possibility of obtaining, in real time and on computing devices having limited processing resources, realistically relighted 3D portraits having quality higher or at least comparable to quality achieved by prior art solutions, but without utilizing complex and costly equipment is provided. A method for rendering a relighted 3D portrait of a person, the method including: receiving an input defining a camera viewpoint and lighting conditions, rasterizing latent descriptors of a 3D point cloud at different resolutions based on the camera viewpoint to obtain rasterized images, wherein the 3D point cloud is generated based on a sequence of images captured by a camera with a blinking flash while moving the camera at least partly around an upper body, the sequence of images comprising a set of flash images and a set of no-flash images, processing the rasterized images with a deep neural network to predict albedo, normals, environmental shadow maps, and segmentation mask for the received camera viewpoint, and fusing the predicted albedo, normals, environmental shadow maps, and segmentation mask into the relighted 3D portrait based on the lighting conditions.
-
公开(公告)号:US11568645B2
公开(公告)日:2023-01-31
申请号:US16823752
申请日:2020-03-19
发明人: Victor Sergeevich Lempitsky , Aliaksandra Petrovna Shysheya , Egor Olegovich Zakharov , Egor Andreevich Burkov
摘要: An electronic device and a controlling method thereof are provided. A controlling method of an electronic device according to the disclosure includes: performing first learning for a neural network model for acquiring a video sequence including a talking head of a random user based on a plurality of learning video sequences including talking heads of a plurality of users, performing second learning for fine-tuning the neural network model based on at least one image including a talking head of a first user different from the plurality of users and first landmark information included in the at least one image, and acquiring a first video sequence including the talking head of the first user based on the at least one image and pre-stored second landmark information using the neural network model for which the first learning and the second learning were performed.
-
-
-
-