Stable-Diffusion-v2.1
State-of-the-art generative AI model used to generate detailed images conditioned on text descriptions.
Generates high resolution images from text prompts using a latent diffusion model. This model uses CLIP ViT-L/14 as text encoder, U-Net based latent denoising, and VAE based decoder to generate the final image.
Technical Details
Input:Text prompt to generate image
QNN-SDK:2.20
Text Encoder Number of parameters:340M
UNet Number of parameters:865M
VAE Decoder Number of parameters:83M
Model size:1GB
Applicable Scenarios
- Image Generation
- Image Editing
- Content Creation
Licenses
Source Model:CREATIVEML-OPENRAIL-M
Deployable Model:CREATIVEML-OPENRAIL-M
Terms of Use:Qualcomm® Generative AI usage and limitations
Tags
- generative-aiModels capable of generating text, images, or other data using generative models, often in response to prompts.
- quantizedA “quantized” model can run in low or mixed precision, which can substantially reduce inference latency.
Supported Compute Chipsets
- Snapdragon® X Elite