-
公开(公告)号:US20250166133A1
公开(公告)日:2025-05-22
申请号:US18596543
申请日:2024-03-05
Applicant: QUALCOMM Incorporated
Inventor: Kumara KAHATAPITIYA , Davide ABATI , Amirhossein HABIBIAN , Yuki ASANO
Abstract: Systems and techniques are described herein for modifying video data. For instance, a method for modifying video data is provided. The method may include obtaining first tokens based on a first frame of video data, wherein each of the first tokens comprises a feature vector corresponding to a respective location within the first frame of video data; obtaining second tokens based on a second frame of video data, wherein each of the second tokens comprises a feature vector corresponding to a respective location within the second frame of video data; determining a destination token from among the first tokens; determining candidate tokens from among the second tokens based on respective relationships between the candidate tokens and the destination token; merging the candidate tokens with the destination token resulting in modified second tokens; and processing the modified second tokens using a diffusion model.