PseudoTerminal X commited on
Commit
4fbe919
·
verified ·
1 Parent(s): 507c224

Model card auto-generated by SimpleTuner

Browse files
Files changed (1) hide show
  1. README.md +108 -0
README.md ADDED
@@ -0,0 +1,108 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ license: creativeml-openrail-m
3
+ base_model: "stabilityai/stable-diffusion-3-medium-diffusers"
4
+ tags:
5
+ - stable-diffusion
6
+ - stable-diffusion-diffusers
7
+ - text-to-image
8
+ - diffusers
9
+ - lora
10
+ - template:sd-lora
11
+ inference: true
12
+ widget:
13
+ - text: 'a studio portrait photograph of emma watson. she looks relaxed and happy.'
14
+ parameters:
15
+ negative_prompt: 'blurry, cropped, ugly'
16
+ output:
17
+ url: ./assets/image_0_0.png
18
+ ---
19
+
20
+ # sd3-lora-celebrities
21
+
22
+ This is a LoRA derived from [stabilityai/stable-diffusion-3-medium-diffusers](https://huggingface.co/stabilityai/stable-diffusion-3-medium-diffusers).
23
+
24
+ The main validation prompt used during training was:
25
+
26
+ ```
27
+ a studio portrait photograph of emma watson. she looks relaxed and happy.
28
+ ```
29
+
30
+ ## Validation settings
31
+ - CFG: `5.0`
32
+ - CFG Rescale: `0.2`
33
+ - Steps: `50`
34
+ - Sampler: `euler`
35
+ - Seed: `2`
36
+ - Resolution: `1280x768`
37
+
38
+ Note: The validation settings are not necessarily the same as the [training settings](#training-settings).
39
+
40
+ You can find some example images in the following gallery:
41
+
42
+
43
+ <Gallery />
44
+
45
+ The text encoder **was not** trained.
46
+ You may reuse the base model text encoder for inference.
47
+
48
+
49
+ ## Training settings
50
+
51
+ - Training epochs: 0
52
+ - Training steps: 200
53
+ - Learning rate: 1e-06
54
+ - Effective batch size: 1
55
+ - Micro-batch size: 1
56
+ - Gradient accumulation steps: 1
57
+ - Number of GPUs: 1
58
+ - Prediction type: v_prediction
59
+ - Rescaled betas zero SNR: True
60
+ - Optimizer: AdamW, stochastic bf16
61
+ - Precision: Pure BF16
62
+ - Xformers: Not used
63
+ - LoRA Rank: 16
64
+ - LoRA Alpha: 16
65
+ - LoRA Dropout: 0.1
66
+ - LoRA initialisation style: default
67
+
68
+
69
+ ## Datasets
70
+
71
+ ### celebrities-sd3
72
+ - Repeats: 0
73
+ - Total number of images: 1830
74
+ - Total number of aspect buckets: 3
75
+ - Resolution: 0.5 megapixels
76
+ - Cropped: False
77
+ - Crop style: None
78
+ - Crop aspect: None
79
+
80
+
81
+ ## Inference
82
+
83
+
84
+ ```python
85
+ import torch
86
+ from diffusers import StableDiffusion3Pipeline
87
+
88
+
89
+
90
+ model_id = "sd3-lora-celebrities"
91
+ prompt = "a studio portrait photograph of emma watson. she looks relaxed and happy."
92
+ negative_prompt = "malformed, disgusting, overexposed, washed-out"
93
+
94
+ pipeline = DiffusionPipeline.from_pretrained(model_id)
95
+ pipeline.to('cuda' if torch.cuda.is_available() else 'mps' if torch.backends.mps.is_available() else 'cpu')
96
+ image = pipeline(
97
+ prompt=prompt,
98
+ negative_prompt='blurry, cropped, ugly',
99
+ num_inference_steps=50,
100
+ generator=torch.Generator(device='cuda' if torch.cuda.is_available() else 'mps' if torch.backends.mps.is_available() else 'cpu').manual_seed(1641421826),
101
+ width=1152,
102
+ height=768,
103
+ guidance_scale=5.0,
104
+ guidance_rescale=0.2,
105
+ ).images[0]
106
+ image.save("output.png", format="PNG")
107
+ ```
108
+