-
公开(公告)号:US20240104312A1
公开(公告)日:2024-03-28
申请号:US17953710
申请日:2022-09-27
Applicant: Google LLC
Inventor: Thomas Jonathan Stone , Darkhan Zholmukhanov , Dawid Michal Wegner
CPC classification number: G06F40/58 , G06N3/0454 , G06T19/006
Abstract: Provided are systems and methods that use generative models (e.g., generative adversarial networks) to enable photorealistic text inpainting in augmented reality. One example application of the proposed systems is to perform augmented reality translation. For example, a user can operate an image capture device (e.g., camera, smartphone, etc.) to capture imagery of a real-world scene that includes real-world text (e.g., signage, restaurant menus, etc.). The real-world text can be translated into a different language. Further, the captured imagery can be processed with a machine-learned generative model to produce an augmented image. The augmented image can depict the real-world scene with the real-world text removed. Specifically, because a machine-learned generative model is used, the augmented image can appear significantly more realistic, for example versus an image in which the real-world text has simply been blocked using a box with a single color.