Image Generation with Minimal Denoising Diffusion Steps

    公开(公告)号:US20250157008A1

    公开(公告)日:2025-05-15

    申请号:US18949522

    申请日:2024-11-15

    Applicant: Google LLC

    Abstract: Provided is a one-step text-to-image generative model, which represents a fusion of GAN and diffusion model elements. In particular, despite the promising outcomes of prior diffusion GAN hybrid models, achieving one-step sampling and extending their utility to text-to-image generation remains a complex challenge. The present disclosure provides a number of innovative techniques to enhance diffusion GAN models, resulting in an ultra-fast text-to-image model capable of producing high-quality images in a single sampling step.

    RESOURCE-EFFICIENT DIFFUSION MODELS

    公开(公告)号:US20250165756A1

    公开(公告)日:2025-05-22

    申请号:US18949875

    申请日:2024-11-15

    Applicant: Google LLC

    Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for generating a data item by performing a single-step denoising process using a diffusion model neural network. For example, the data items can be images, videos, audio waveforms, sensor outputs, and so on.

Patent Agency Ranking