ai-forever
commited on
Update README.md
Browse files
README.md
CHANGED
@@ -1,12 +1,11 @@
|
|
1 |
---
|
2 |
license: apache-2.0
|
3 |
---
|
4 |
-
|
5 |
-
# Kandinsky-3.1: Text-to-image diffusion model
|
6 |
|
7 |
![](assets/title.jpg)
|
8 |
|
9 |
-
[Kandinsky 3.0 Post](https://habr.com/ru/companies/sberbank/articles/775590/) | [Project Page](https://ai-forever.github.io/Kandinsky-3) | [Generate](https://fusionbrain.ai) | [Telegram-bot](https://t.me/kandinsky21_bot) | [Technical Report](https://arxiv.org/pdf/2312.03511.pdf)
|
10 |
|
11 |
# Kandinsky 3.1:
|
12 |
|
@@ -14,14 +13,14 @@ license: apache-2.0
|
|
14 |
|
15 |
We present Kandinsky 3.1, the follow-up to the Kandinsky 3.0 model, a large-scale text-to-image generation model based on latent diffusion, continuing the series of text-to-image Kandinsky models and reflecting our progress to achieve higher quality and realism of image generation, which we have enhanced and enriched with a variety of useful features and modes to give users more opportunities to fully utilise the power of our new model.
|
16 |
|
17 |
-
## Kandinsky Flash
|
18 |
|
19 |
<figure>
|
20 |
<img src="assets/butterly_effect.jpg">
|
21 |
</figure>
|
22 |
|
23 |
|
24 |
-
Diffusion models have problems with fast image generation. To address this problem, we trained a Kandinksy Flash model based on the [Adversarial Diffusion Distillation](https://arxiv.org/abs/2311.17042) approach with some modifications: we trained the model on latents, which reduced the memory overhead and removed distillation loss as it did not affect the training. Also we
|
25 |
|
26 |
### Architecture
|
27 |
|
@@ -48,6 +47,10 @@ t2i_pipe = get_T2I_Flash_pipeline(
|
|
48 |
|
49 |
res = t2i_pipe("A cute corgi lives in a house made out of sushi.")
|
50 |
```
|
|
|
|
|
|
|
|
|
51 |
|
52 |
## Prompt beautification
|
53 |
|
@@ -228,3 +231,4 @@ image = inp_pipe( "A cute corgi lives in a house made out of sushi.", image, mas
|
|
228 |
}
|
229 |
```
|
230 |
|
|
|
|
1 |
---
|
2 |
license: apache-2.0
|
3 |
---
|
4 |
+
# Kandinsky-3: Text-to-image diffusion model
|
|
|
5 |
|
6 |
![](assets/title.jpg)
|
7 |
|
8 |
+
[Kandinsky 3.0 Post](https://habr.com/ru/companies/sberbank/articles/775590/) | [Kandinsky 3.1 Post](https://habr.com/ru/companies/sberbank/articles/805337/) | [Project Page](https://ai-forever.github.io/Kandinsky-3) | [Generate](https://fusionbrain.ai) | [Telegram-bot](https://t.me/kandinsky21_bot) | [Technical Report](https://arxiv.org/pdf/2312.03511.pdf) | [HuggingFace](https://huggingface.co/kandinsky-community/kandinsky-3)
|
9 |
|
10 |
# Kandinsky 3.1:
|
11 |
|
|
|
13 |
|
14 |
We present Kandinsky 3.1, the follow-up to the Kandinsky 3.0 model, a large-scale text-to-image generation model based on latent diffusion, continuing the series of text-to-image Kandinsky models and reflecting our progress to achieve higher quality and realism of image generation, which we have enhanced and enriched with a variety of useful features and modes to give users more opportunities to fully utilise the power of our new model.
|
15 |
|
16 |
+
## Kandinsky Flash (Kandinsky 3.0 Refiner)
|
17 |
|
18 |
<figure>
|
19 |
<img src="assets/butterly_effect.jpg">
|
20 |
</figure>
|
21 |
|
22 |
|
23 |
+
Diffusion models have problems with fast image generation. To address this problem, we trained a Kandinksy Flash model based on the [Adversarial Diffusion Distillation](https://arxiv.org/abs/2311.17042) approach with some modifications: we trained the model on latents, which reduced the memory overhead and removed distillation loss as it did not affect the training. Also, we applied Kandinsky Flash model to images generated from Kandinsky 3.0 to improve visual quality of generated images.
|
24 |
|
25 |
### Architecture
|
26 |
|
|
|
47 |
|
48 |
res = t2i_pipe("A cute corgi lives in a house made out of sushi.")
|
49 |
```
|
50 |
+
### Kandinsky Inpainting
|
51 |
+
|
52 |
+
Also, we released a newer version of inpainting model, which we additionally trained the model on the object detection dataset. This allowed to get more stable generation of objects. The new weights are available at [ai-forever/Kandinsky3.1](https://huggingface.co/ai-forever/Kandinsky3.1). Check the usage [example](https://github.com/ai-forever/Kandinsky-3?tab=readme-ov-file#2-inpainting).
|
53 |
+
|
54 |
|
55 |
## Prompt beautification
|
56 |
|
|
|
231 |
}
|
232 |
```
|
233 |
|
234 |
+
|