Qualcomm® AI HubAI Hub

ControlNet

Generating visual arts from text prompt and input guiding image.

On‑device, high‑resolution image synthesis from text and image prompts. ControlNet guides Stable‑diffusion with provided input image to generate accurate images from given input prompt.

Technical Details

Input:Text prompt and input image as a reference
Conditioning Input:Canny-Edge
Text Encoder Number of parameters:340M
UNet Number of parameters:865M
VAE Decoder Number of parameters:83M
ControlNet Number of parameters:361M
Model size:1.4GB

Applicable Scenarios

  • Image Generation
  • Image Editing
  • Content Creation

Licenses

Tags

  • generative-ai
  • quantized

Supported Compute Chipsets

    Related Models

    See all models

    Looking for more? See models created by industry leaders.

    Discover Model Makers