MosaicArt / README.md
Guizmus's picture
Update README.md
b77a4f6
|
raw
history blame
2.72 kB
metadata
language:
  - en
license: creativeml-openrail-m
thumbnail: https://huggingface.co/Guizmus/MosaicArt/resolve/main/showcase.png
tags:
  - stable-diffusion
  - text-to-image
  - image-to-image

Mosaic Art

Showcase This is a Dreamboothed Stable Diffusion model trained on pictures of mosaic art.

The total dataset is made of 46 pictures. V2 was trained on Stable diffusion 2.1 768. I used StableTuner to do the training, using full caption on the pictures with almost no recurring word outside the main concept, so that no additionnal regularisation was needed. 6 epochs of 40 repeats on LR 1e-6 were used, with prior preservation.

V1 was trained on runawayml 1.5 and the new VAE. I used EveryDream to do the training, using full caption on the pictures with almost no recurring word outside the main concept, so that no additionnal regularisation was needed. Out of e0 to e11 epochs, e8 was selected as the best application of style while not overtraining. Prior preservation was constated as good. A total of 9 epochs of 40 repeats with a learning rate of 1e-6.

The token "Mosaic Art" will bring in the new concept, trained as a style.

The recommended sampling is k_Euler_a or DPM++ 2M Karras on 20 steps, CFGS 7.5 .

CKPT v2

YAML v2

CKPT v1

CKPT v1 with ema weights

Dataset

🧨 Diffusers

This model can be used just like any other Stable Diffusion model. For more information, please have a look at the Stable Diffusion.

You can also export the model to ONNX, MPS and/or FLAX/JAX.

from diffusers import StableDiffusionPipeline
import torch

model_id = "Guizmus/MosaicArt"
pipe = StableDiffusionPipeline.from_pretrained(model_id, torch_dtype=torch.float16)
pipe = pipe.to("cuda")

prompt = "Mosaic Art dog on the moon"
image = pipe(prompt).images[0]

image.save("./MosaicArt.png")