The above tool is a Stable Diffusion Image Variations model that has been fine-tuned to take multiple CLIP image embeddings as inputs, allowing users to combine the image embeddings from multiple images to mix their concepts and add text concepts for greater variation. The output is a 640x640 image and it can be run locally or on Lambda GPU Cloud.
액세스 71.98K 가격 모델 Freemium
액세스 1.10B 가격 모델
액세스 0 가격 모델