Photorealistic Text Inpainting for Augmented Reality Using Generative Models

    公开(公告)号:US20240104312A1

    公开(公告)日:2024-03-28

    申请号:US17953710

    申请日:2022-09-27

    Applicant: Google LLC

    CPC classification number: G06F40/58 G06N3/0454 G06T19/006

    Abstract: Provided are systems and methods that use generative models (e.g., generative adversarial networks) to enable photorealistic text inpainting in augmented reality. One example application of the proposed systems is to perform augmented reality translation. For example, a user can operate an image capture device (e.g., camera, smartphone, etc.) to capture imagery of a real-world scene that includes real-world text (e.g., signage, restaurant menus, etc.). The real-world text can be translated into a different language. Further, the captured imagery can be processed with a machine-learned generative model to produce an augmented image. The augmented image can depict the real-world scene with the real-world text removed. Specifically, because a machine-learned generative model is used, the augmented image can appear significantly more realistic, for example versus an image in which the real-world text has simply been blocked using a box with a single color.

Patent Agency Ranking