ai-forever commited on
Commit
279d199
·
verified ·
1 Parent(s): e32fc97

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +9 -5
README.md CHANGED
@@ -1,12 +1,11 @@
1
  ---
2
  license: apache-2.0
3
  ---
4
-
5
- # Kandinsky-3.1: Text-to-image diffusion model
6
 
7
  ![](assets/title.jpg)
8
 
9
- [Kandinsky 3.0 Post](https://habr.com/ru/companies/sberbank/articles/775590/) | [Project Page](https://ai-forever.github.io/Kandinsky-3) | [Generate](https://fusionbrain.ai) | [Telegram-bot](https://t.me/kandinsky21_bot) | [Technical Report](https://arxiv.org/pdf/2312.03511.pdf)
10
 
11
  # Kandinsky 3.1:
12
 
@@ -14,14 +13,14 @@ license: apache-2.0
14
 
15
  We present Kandinsky 3.1, the follow-up to the Kandinsky 3.0 model, a large-scale text-to-image generation model based on latent diffusion, continuing the series of text-to-image Kandinsky models and reflecting our progress to achieve higher quality and realism of image generation, which we have enhanced and enriched with a variety of useful features and modes to give users more opportunities to fully utilise the power of our new model.
16
 
17
- ## Kandinsky Flash
18
 
19
  <figure>
20
  <img src="assets/butterly_effect.jpg">
21
  </figure>
22
 
23
 
24
- Diffusion models have problems with fast image generation. To address this problem, we trained a Kandinksy Flash model based on the [Adversarial Diffusion Distillation](https://arxiv.org/abs/2311.17042) approach with some modifications: we trained the model on latents, which reduced the memory overhead and removed distillation loss as it did not affect the training. Also we used Kandinsky Flash model to improve visual quality of generation from Kandinsky 3.0.
25
 
26
  ### Architecture
27
 
@@ -48,6 +47,10 @@ t2i_pipe = get_T2I_Flash_pipeline(
48
 
49
  res = t2i_pipe("A cute corgi lives in a house made out of sushi.")
50
  ```
 
 
 
 
51
 
52
  ## Prompt beautification
53
 
@@ -228,3 +231,4 @@ image = inp_pipe( "A cute corgi lives in a house made out of sushi.", image, mas
228
  }
229
  ```
230
 
 
 
1
  ---
2
  license: apache-2.0
3
  ---
4
+ # Kandinsky-3: Text-to-image diffusion model
 
5
 
6
  ![](assets/title.jpg)
7
 
8
+ [Kandinsky 3.0 Post](https://habr.com/ru/companies/sberbank/articles/775590/) | [Kandinsky 3.1 Post](https://habr.com/ru/companies/sberbank/articles/805337/) | [Project Page](https://ai-forever.github.io/Kandinsky-3) | [Generate](https://fusionbrain.ai) | [Telegram-bot](https://t.me/kandinsky21_bot) | [Technical Report](https://arxiv.org/pdf/2312.03511.pdf) | [HuggingFace](https://huggingface.co/kandinsky-community/kandinsky-3)
9
 
10
  # Kandinsky 3.1:
11
 
 
13
 
14
  We present Kandinsky 3.1, the follow-up to the Kandinsky 3.0 model, a large-scale text-to-image generation model based on latent diffusion, continuing the series of text-to-image Kandinsky models and reflecting our progress to achieve higher quality and realism of image generation, which we have enhanced and enriched with a variety of useful features and modes to give users more opportunities to fully utilise the power of our new model.
15
 
16
+ ## Kandinsky Flash (Kandinsky 3.0 Refiner)
17
 
18
  <figure>
19
  <img src="assets/butterly_effect.jpg">
20
  </figure>
21
 
22
 
23
+ Diffusion models have problems with fast image generation. To address this problem, we trained a Kandinksy Flash model based on the [Adversarial Diffusion Distillation](https://arxiv.org/abs/2311.17042) approach with some modifications: we trained the model on latents, which reduced the memory overhead and removed distillation loss as it did not affect the training. Also, we applied Kandinsky Flash model to images generated from Kandinsky 3.0 to improve visual quality of generated images.
24
 
25
  ### Architecture
26
 
 
47
 
48
  res = t2i_pipe("A cute corgi lives in a house made out of sushi.")
49
  ```
50
+ ### Kandinsky Inpainting
51
+
52
+ Also, we released a newer version of inpainting model, which we additionally trained the model on the object detection dataset. This allowed to get more stable generation of objects. The new weights are available at [ai-forever/Kandinsky3.1](https://huggingface.co/ai-forever/Kandinsky3.1). Check the usage [example](https://github.com/ai-forever/Kandinsky-3?tab=readme-ov-file#2-inpainting).
53
+
54
 
55
  ## Prompt beautification
56
 
 
231
  }
232
  ```
233
 
234
+