-
公开(公告)号:US20240121398A1
公开(公告)日:2024-04-11
申请号:US18458006
申请日:2023-08-29
发明人: Noor Fathima Khanum MOHAMED GHOUSE , Jens PETERSEN , Tianlin XU , Guillaume Konrad SAUTIERE , Auke Joris WIGGERS
IPC分类号: H04N19/137 , H04N19/147 , H04N19/162
CPC分类号: H04N19/137 , H04N19/147 , H04N19/162
摘要: Systems and techniques are described for processing image data using a residual model that can be configured with an adjustable number of sampling steps. For example, a process can include obtaining a latent representation of an image and processing, using a decoder of a machine learning model, the latent representation of the image to generate an initial reconstructed image. The process can further include processing, using the residual model, the initial reconstructed image and noise data to predict a plurality of predictions of a residual over a number of sampling steps. The residual represents a difference between the image and the initial reconstructed image. The process can include obtaining, from the plurality of predictions of the residual, a final residual representing the difference between the image and the initial reconstructed image. The process can further include combining the initial reconstructed image and the residual to generate a final reconstructed image.
-
公开(公告)号:US20230280702A1
公开(公告)日:2023-09-07
申请号:US18049263
申请日:2022-10-24
IPC分类号: G05B13/02 , G05D1/02 , G05D1/00 , G06N3/045 , G06V10/764 , G06V10/80 , G06V10/82 , G06V20/56
CPC分类号: G05B13/027 , G05D1/0221 , G05D1/0231 , G05D1/0088 , G05D1/0257 , G06N3/045 , G06V10/764 , G06V10/811 , G06V10/82 , G06V20/56
摘要: A method includes determining a current state of an environment of an autonomous agent, such as a vehicle. The method also includes determining, via a first neural network, a set of actions based on the current state. The method further includes determining whether further analysis of the set of actions is desired. The method selects an action from the set of actions using a model-based solution based on a reward and a risk of the action when further analysis is desired. The method also includes selecting the action from the set of actions according to a metric when further analysis is not desired. The method controls the autonomous agent to perform the selected action.
-
公开(公告)号:US20240364925A1
公开(公告)日:2024-10-31
申请号:US18636126
申请日:2024-04-15
发明人: Hoang Cong Minh LE , Qiqi HOU , Farzad FARHADZADEH , Amir SAID , Auke Joris WIGGERS , Guillaume Konrad SAUTIERE , Reza POURREZA
IPC分类号: H04N19/597 , H04N19/137 , H04N19/436
CPC分类号: H04N19/597 , H04N19/137 , H04N19/436
摘要: Systems and techniques are described herein for processing video data. For example, a machine-learning based stereo video coding system can obtain video data including at least a right-view image of a right view of a scene and a left-view image of a left view of the scene. The machine-learning based stereo video coding system can compress the right-view image and the left-view image in parallel to generate a latent representation of the right-view image and the left-view image. The right-view image and the left-view image can be compressed in parallel based on inter-view information between the right-view image and the left-view image, determined using one or more parallel autoencoders.
-
公开(公告)号:US20240323415A1
公开(公告)日:2024-09-26
申请号:US18188070
申请日:2023-03-22
发明人: David Wilson ROMERO GUZMAN , Gabriele CESA , Guillaume Konrad SAUTIERE , Yunfan ZHANG , Taco Sebastiaan COHEN , Auke Joris WIGGERS
IPC分类号: H04N19/42 , G06T3/40 , H04N19/182
CPC分类号: H04N19/42 , G06T3/4046 , H04N19/182
摘要: Certain aspects of the present disclosure provide techniques and apparatus for encoding content using a neural network. An example method generally includes encoding video content into a latent space representation through an encoder implemented by a first machine learning model. A code is generated by upsampling the latent space representation of the video content. A prior is calculated based on a conditional probability of obtaining the upsampled latent space representation conditioned by the latent space representation of the video content. A compressed version of the video content is generated based on a probabilistic model implemented by a second machine learning model, the generated code, and the calculated prior, and the compressed version of the video content is output for transmission.
-
公开(公告)号:US20240015318A1
公开(公告)日:2024-01-11
申请号:US17862217
申请日:2022-07-11
IPC分类号: H04N19/52 , H04N19/137 , H04N19/105 , H04N19/172
CPC分类号: H04N19/52 , H04N19/137 , H04N19/105 , H04N19/172
摘要: Systems and techniques are provided for coding video data based on an optical flow correction and a residual correction. For example, a decoding device can obtain a frame of encoded video data associated with an input frame, the frame of encoded video data including an optical flow correction and a residual correction. A predicted optical flow can be generated based on one or more reference frames and a reference optical flow. A corrected prediction frame can be generated based on the predicted optical flow and the optical flow correction. A predicted residual can be generated based on at least the corrected prediction frame and a first reference frame included in the one or more reference frames. The decoding device can generate a reconstructed input frame based on the corrected prediction frame, the predicted residual, and the residual correction.
-
公开(公告)号:US20200150672A1
公开(公告)日:2020-05-14
申请号:US16683129
申请日:2019-11-13
摘要: A method includes determining a current state of an environment of an autonomous agent, such as a vehicle. The method also includes determining, via a first neural network, a set of actions based on the current state. The method further includes determining whether further analysis of the set of actions is desired. The method selects an action from the set of actions using a model-based solution based on a reward and a risk of the action when further analysis is desired. The method also includes selecting the action from the set of actions according to a metric when further analysis is not desired. The method controls the autonomous agent to perform the selected action.
-
-
-
-
-