PseudoTerminal X commited on
Commit
06b0947
·
verified ·
1 Parent(s): ca4965b

Model card auto-generated by SimpleTuner

Browse files
Files changed (1) hide show
  1. README.md +110 -0
README.md ADDED
@@ -0,0 +1,110 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ license: creativeml-openrail-m
3
+ base_model: "stabilityai/stable-diffusion-3-medium-diffusers"
4
+ tags:
5
+ - stable-diffusion
6
+ - stable-diffusion-diffusers
7
+ - text-to-image
8
+ - diffusers
9
+ - lora
10
+ - template:sd-lora
11
+ inference: true
12
+ widget:
13
+ - text: 'prompt not found (0)'
14
+ parameters:
15
+ negative_prompt: 'blurry, cropped, ugly'
16
+ output:
17
+ url: ./assets/image_0_0.png
18
+ - text: 'prompt not found (1)'
19
+ parameters:
20
+ negative_prompt: 'blurry, cropped, ugly'
21
+ output:
22
+ url: ./assets/image_1_0.png
23
+ ---
24
+
25
+ # sd3-lora-test
26
+
27
+ This is a LoRA derived from [stabilityai/stable-diffusion-3-medium-diffusers](https://huggingface.co/stabilityai/stable-diffusion-3-medium-diffusers).
28
+
29
+ The main validation prompt used during training was:
30
+
31
+ ```
32
+ a naked man is surfing on top of a horse
33
+ ```
34
+
35
+ ## Validation settings
36
+ - CFG: `5.0`
37
+ - CFG Rescale: `0.2`
38
+ - Steps: `50`
39
+ - Sampler: `euler`
40
+ - Seed: `2`
41
+ - Resolution: `1280x768`
42
+
43
+ Note: The validation settings are not necessarily the same as the [training settings](#training-settings).
44
+
45
+ You can find some example images in the following gallery:
46
+
47
+
48
+ <Gallery />
49
+
50
+ The text encoder **was not** trained.
51
+ You may reuse the base model text encoder for inference.
52
+
53
+
54
+ ## Training settings
55
+
56
+ - Training epochs: 0
57
+ - Training steps: 86
58
+ - Learning rate: 1e-06
59
+ - Effective batch size: 1
60
+ - Micro-batch size: 1
61
+ - Gradient accumulation steps: 1
62
+ - Number of GPUs: 1
63
+ - Prediction type: v_prediction
64
+ - Rescaled betas zero SNR: True
65
+ - Optimizer: AdamW, stochastic bf16
66
+ - Precision: Pure BF16
67
+ - Xformers: Not used
68
+ - LoRA Rank: 16
69
+ - LoRA Alpha: 16
70
+ - LoRA Dropout: 0.1
71
+ - LoRA initialisation style: default
72
+
73
+
74
+ ## Datasets
75
+
76
+ ### celebrities
77
+ - Repeats: 0
78
+ - Total number of images: 1253
79
+ - Total number of aspect buckets: 3
80
+ - Resolution: 1.0 megapixels
81
+ - Cropped: True
82
+ - Crop style: random
83
+ - Crop aspect: random
84
+
85
+
86
+ ## Inference
87
+
88
+
89
+ ```python
90
+ None
91
+
92
+ model_id = "sd3-lora-test"
93
+ prompt = "a naked man is surfing on top of a horse"
94
+ negative_prompt = "malformed, disgusting, overexposed, washed-out"
95
+
96
+ pipeline = DiffusionPipeline.from_pretrained(model_id)
97
+ pipeline.to('cuda' if torch.cuda.is_available() else 'mps' if torch.backends.mps.is_available() else 'cpu')
98
+ image = pipeline(
99
+ prompt=prompt,
100
+ negative_prompt='blurry, cropped, ugly',
101
+ num_inference_steps=50,
102
+ generator=torch.Generator(device='cuda' if torch.cuda.is_available() else 'mps' if torch.backends.mps.is_available() else 'cpu').manual_seed(1641421826),
103
+ width=1152,
104
+ height=768,
105
+ guidance_scale=5.0,
106
+ guidance_rescale=0.2,
107
+ ).images[0]
108
+ image.save(f"output.png", format="PNG")
109
+ ```
110
+