US20250117968
2025-04-10
Physics
G06T11/00
The patent application outlines methods, systems, and apparatuses for generating high-resolution images using diffusion models. It involves obtaining a prompt and using a first diffusion model to generate a predicted denoised image at a low resolution. This image is then upsampled to a higher resolution, and a second diffusion model generates a final output image based on this upsampled version. This process aims to improve image quality and reduce processing time compared to conventional methods.
Diffusion models are machine learning models particularly suited for image generation, where they progressively remove noise from an image through a series of steps. These models typically face challenges in generating high-resolution images due to increased processing time and memory demands. Traditional approaches involve generating a low-resolution image and then upsampling it, which often results in less accurate outputs.
The proposed method first creates a noiseless prediction at a low resolution, which is then upsampled before being processed by a second diffusion model to produce the high-resolution output. This approach contrasts with conventional techniques that might add noise correction steps, potentially introducing errors. By avoiding these corrections, the new method achieves higher quality images more efficiently.
By eliminating the need for noise correction in the upsampling process, the method enhances image quality and accuracy. It also optimizes processing efficiency, enabling the use of larger diffusion models within existing hardware constraints. This leads to better high-resolution images without the drawbacks of traditional approaches that rely on noisy intermediate images.