-
公开(公告)号:US20230280702A1
公开(公告)日:2023-09-07
申请号:US18049263
申请日:2022-10-24
Applicant: Qualcomm Incorporated
Inventor: Mohammad NAGHSHVAR , Ahmed Kamel SADEK , Auke Joris WIGGERS
CPC classification number: G05B13/027 , G05D1/0221 , G05D1/0231 , G05D1/0088 , G05D1/0257 , G06N3/045 , G06V10/764 , G06V10/811 , G06V10/82 , G06V20/56
Abstract: A method includes determining a current state of an environment of an autonomous agent, such as a vehicle. The method also includes determining, via a first neural network, a set of actions based on the current state. The method further includes determining whether further analysis of the set of actions is desired. The method selects an action from the set of actions using a model-based solution based on a reward and a risk of the action when further analysis is desired. The method also includes selecting the action from the set of actions according to a metric when further analysis is not desired. The method controls the autonomous agent to perform the selected action.
-
公开(公告)号:US20240378698A1
公开(公告)日:2024-11-14
申请号:US18362589
申请日:2023-07-31
Applicant: QUALCOMM Incorporated
Inventor: Jens PETERSEN , Michal Jakub STYPULKOWSKI , Noor Fathima Khanum MOHAMED GHOUSE , Auke Joris WIGGERS , Guillaume Konrad SAUTIERE
Abstract: Systems and techniques are provided for processing image data. According to some aspects, a computing device can determine an optical flow between a current frame having a first resolution and a first previous frame having the first resolution. The computing device can warp a second previous frame having a second resolution based on the determined optical flow to generate a warped previous frame having the second resolution, the second resolution being higher than the first resolution. The computing device can process, using a diffusion machine learning model, a noise frame, the current frame, and the warped previous frame to generate an output frame having the second resolution.
-
公开(公告)号:US20240121398A1
公开(公告)日:2024-04-11
申请号:US18458006
申请日:2023-08-29
Applicant: QUALCOMM Incorporated
Inventor: Noor Fathima Khanum MOHAMED GHOUSE , Jens PETERSEN , Tianlin XU , Guillaume Konrad SAUTIERE , Auke Joris WIGGERS
IPC: H04N19/137 , H04N19/147 , H04N19/162
CPC classification number: H04N19/137 , H04N19/147 , H04N19/162
Abstract: Systems and techniques are described for processing image data using a residual model that can be configured with an adjustable number of sampling steps. For example, a process can include obtaining a latent representation of an image and processing, using a decoder of a machine learning model, the latent representation of the image to generate an initial reconstructed image. The process can further include processing, using the residual model, the initial reconstructed image and noise data to predict a plurality of predictions of a residual over a number of sampling steps. The residual represents a difference between the image and the initial reconstructed image. The process can include obtaining, from the plurality of predictions of the residual, a final residual representing the difference between the image and the initial reconstructed image. The process can further include combining the initial reconstructed image and the residual to generate a final reconstructed image.
-
公开(公告)号:US20240364925A1
公开(公告)日:2024-10-31
申请号:US18636126
申请日:2024-04-15
Applicant: QUALCOMM Incorporated
Inventor: Hoang Cong Minh LE , Qiqi HOU , Farzad FARHADZADEH , Amir SAID , Auke Joris WIGGERS , Guillaume Konrad SAUTIERE , Reza POURREZA
IPC: H04N19/597 , H04N19/137 , H04N19/436
CPC classification number: H04N19/597 , H04N19/137 , H04N19/436
Abstract: Systems and techniques are described herein for processing video data. For example, a machine-learning based stereo video coding system can obtain video data including at least a right-view image of a right view of a scene and a left-view image of a left view of the scene. The machine-learning based stereo video coding system can compress the right-view image and the left-view image in parallel to generate a latent representation of the right-view image and the left-view image. The right-view image and the left-view image can be compressed in parallel based on inter-view information between the right-view image and the left-view image, determined using one or more parallel autoencoders.
-
公开(公告)号:US20240323415A1
公开(公告)日:2024-09-26
申请号:US18188070
申请日:2023-03-22
Applicant: QUALCOMM Incorporated
Inventor: David Wilson ROMERO GUZMAN , Gabriele CESA , Guillaume Konrad SAUTIERE , Yunfan ZHANG , Taco Sebastiaan COHEN , Auke Joris WIGGERS
IPC: H04N19/42 , G06T3/40 , H04N19/182
CPC classification number: H04N19/42 , G06T3/4046 , H04N19/182
Abstract: Certain aspects of the present disclosure provide techniques and apparatus for encoding content using a neural network. An example method generally includes encoding video content into a latent space representation through an encoder implemented by a first machine learning model. A code is generated by upsampling the latent space representation of the video content. A prior is calculated based on a conditional probability of obtaining the upsampled latent space representation conditioned by the latent space representation of the video content. A compressed version of the video content is generated based on a probabilistic model implemented by a second machine learning model, the generated code, and the calculated prior, and the compressed version of the video content is output for transmission.
-
公开(公告)号:US20240015318A1
公开(公告)日:2024-01-11
申请号:US17862217
申请日:2022-07-11
Applicant: QUALCOMM Incorporated
Inventor: Reza POURREZA , Hoang Cong Minh LE , Auke Joris WIGGERS
IPC: H04N19/52 , H04N19/137 , H04N19/105 , H04N19/172
CPC classification number: H04N19/52 , H04N19/137 , H04N19/105 , H04N19/172
Abstract: Systems and techniques are provided for coding video data based on an optical flow correction and a residual correction. For example, a decoding device can obtain a frame of encoded video data associated with an input frame, the frame of encoded video data including an optical flow correction and a residual correction. A predicted optical flow can be generated based on one or more reference frames and a reference optical flow. A corrected prediction frame can be generated based on the predicted optical flow and the optical flow correction. A predicted residual can be generated based on at least the corrected prediction frame and a first reference frame included in the one or more reference frames. The decoding device can generate a reconstructed input frame based on the corrected prediction frame, the predicted residual, and the residual correction.
-
公开(公告)号:US20200150672A1
公开(公告)日:2020-05-14
申请号:US16683129
申请日:2019-11-13
Applicant: QUALCOMM Incorporated
Inventor: Mohammad NAGHSHVAR , Ahmed Kamel SADEK , Auke Joris WIGGERS
Abstract: A method includes determining a current state of an environment of an autonomous agent, such as a vehicle. The method also includes determining, via a first neural network, a set of actions based on the current state. The method further includes determining whether further analysis of the set of actions is desired. The method selects an action from the set of actions using a model-based solution based on a reward and a risk of the action when further analysis is desired. The method also includes selecting the action from the set of actions according to a metric when further analysis is not desired. The method controls the autonomous agent to perform the selected action.
-
-
-
-
-
-