ariG23498
/

open-image-preferences-v1-sdxl-lora

diffusers-training

template:sd-lora

stable-diffusion-xl

stable-diffusion-xl-diffusers

Model card Files Files and versions Community

open-image-preferences-v1-sdxl-lora / README.md

ariG23498's picture

ariG23498 HF staff

Update README.md

7b22b9e verified about 1 month ago

|

3.35 kB

	---
	base_model: stabilityai/stable-diffusion-xl-base-1.0
	library_name: diffusers
	license: openrail++
	widget:
	- text: anime rat in a jungle
	output:
	url: images/example_xj9kri2rp.png
	- text: photorealistic cat in a jungle
	output:
	url: images/example_0aal2cjg1.png
	tags:
	- text-to-image
	- text-to-image
	- diffusers-training
	- diffusers
	- lora
	- template:sd-lora
	- stable-diffusion-xl
	- stable-diffusion-xl-diffusers

	---

	<!-- This model card has been generated automatically according to the information the training script had access to. You
	should probably proofread and complete it, then remove this comment. -->


	# Low Rank Adapted Supervised Fine Tuned Stable Diffusion XL

	<Gallery />

	## Comparison

	\| Prompt \| SDXL \| Fine Tuned \|
	\| :--: \| :--: \| :--: \|
	\| a boat in the canals of Venice, painted in gouache with soft, flowing brushstrokes and vibrant, translucent colors, capturing the serene reflection on the water under a misty ambiance, with rich textures and a dynamic perspective \| ![image/png](/static-proxy?url=https%3A%2F%2Fcdn-uploads.huggingface.co%2Fproduction%2Fuploads%2F608aabf24955d2bfc3cd99c6%2FCtoZWxDmANYm7d95I3Fcp.png%3C%2Fspan%3E) \| ![image/png](/static-proxy?url=https%3A%2F%2Fcdn-uploads.huggingface.co%2Fproduction%2Fuploads%2F608aabf24955d2bfc3cd99c6%2FhAxmaL-robradk1x_KqwQ.png%3C%2Fspan%3E) \|
	\| Grainy shot of a robot cooking in the kitchen, with soft shadows and nostalgic film texture. \| ![image/png](/static-proxy?url=https%3A%2F%2Fcdn-uploads.huggingface.co%2Fproduction%2Fuploads%2F608aabf24955d2bfc3cd99c6%2FLTQI1NdaEjJUgeDpqzv7k.png%3C%2Fspan%3E) \| ![image/png](/static-proxy?url=https%3A%2F%2Fcdn-uploads.huggingface.co%2Fproduction%2Fuploads%2F608aabf24955d2bfc3cd99c6%2FvAjnMCW0nmbV0zHKT8oCJ.png%3C%2Fspan%3E) \|

	## Model description

	These are ariG23498/open-image-preferences-v1-sdxl-lora LoRA adaption weights for stabilityai/stable-diffusion-xl-base-1.0.

	The weights were trained using [DreamBooth](https://github.com/ariG23498/diffusers/blob/aritra/sdxl-lora/examples/dreambooth/train_dreambooth_lora_sdxl.py) using the
	[open-image-preferences-v1-binarized](https://huggingface.co/datasets/data-is-better-together/open-image-preferences-v1-binarized) dataset.


	## Use with `diffusers`

	```py
	from diffusers import AutoPipelineForText2Image
	import torch

	pipeline = AutoPipelineForText2Image.from_pretrained(
	"stabilityai/stable-diffusion-xl-base-1.0",
	torch_dtype=torch.bfloat16
	).to('cuda')
	pipeline.load_lora_weights('ariG23498/open-image-preferences-v1-sdxl-lora', weight_name='pytorch_lora_weights.safetensors')
	prompt = "ENTER PROMPT"
	image = pipeline(prompt).images[0]
	```

	## Command to train the model

	```shell
	!accelerate launch examples/dreambooth/train_dreambooth_lora_sdxl.py \
	--pretrained_model_name_or_path "stabilityai/stable-diffusion-xl-base-1.0" \
	--dataset_name "data-is-better-together/open-image-preferences-v1-binarized" \
	--hub_model_id "ariG23498/open-image-preferences-v1-sdxl-lora" \
	--push_to_hub \
	--output_dir "open-image-preferences-v1-sdxl-lora" \
	--image_column "chosen" \
	--caption_column "prompt" \
	--mixed_precision="bf16" \
	--resolution=1024 \
	--train_batch_size=1 \
	--repeats=1 \
	--report_to="wandb"\
	--gradient_accumulation_steps=1 \
	--gradient_checkpointing \
	--learning_rate=1.0 \
	--text_encoder_lr=1.0 \
	--optimizer="prodigy"\
	--lr_scheduler="constant" \
	--lr_warmup_steps=0 \
	--rank=8 \
	--checkpointing_steps=2000 \
	--seed="0"
	```