Stable Diffusion v1-4 Model

Developed by: Robin Rombach, Patrick Esser Model type: Diffusion-based text-to-image generation model Language(s): English License: The CreativeML OpenRAIL M license Model Description: This is a model that can be used to generate and modify images based on text prompts. It is a Latent Diffusion Model that uses a fixed, pretrained text encoder (CLIP ViT-L/14) as suggested in the Imagen paper. Resources for more information: GitHub Repository, Paper.

Realistic Photo Scenes Featured