ai-forever
/

Kandinsky3.1

Model card Files Files and versions Community

ai-forever commited on Apr 21, 2024

Commit

279d199

verified ·

1 Parent(s): e32fc97

Update README.md

Browse files

Files changed (1) hide show

README.md +9 -5

README.md CHANGED Viewed

@@ -1,12 +1,11 @@
 ---
 license: apache-2.0
 ---
-# Kandinsky-3.1: Text-to-image diffusion model
 ![](assets/title.jpg)
-[Kandinsky 3.0 Post](https://habr.com/ru/companies/sberbank/articles/775590/) | [Project Page](https://ai-forever.github.io/Kandinsky-3) | [Generate](https://fusionbrain.ai) | [Telegram-bot](https://t.me/kandinsky21_bot) | [Technical Report](https://arxiv.org/pdf/2312.03511.pdf)
 # Kandinsky 3.1:
@@ -14,14 +13,14 @@ license: apache-2.0
 We present Kandinsky 3.1, the follow-up to the Kandinsky 3.0 model, a large-scale text-to-image generation model based on latent diffusion, continuing the series of text-to-image Kandinsky models and reflecting our progress to achieve higher quality and realism of image generation, which we have enhanced and enriched with a variety of useful features and modes to give users more opportunities to fully utilise the power of our new model.
-## Kandinsky Flash
 <figure>
   <img src="assets/butterly_effect.jpg">
 </figure>
-Diffusion models have problems with fast image generation. To address this problem, we trained a Kandinksy Flash model based on the [Adversarial Diffusion Distillation](https://arxiv.org/abs/2311.17042) approach with some modifications: we trained the model on latents, which reduced the memory overhead and removed distillation loss as it did not affect the training. Also we used Kandinsky Flash model to improve visual quality of generation from Kandinsky 3.0.
 ### Architecture
@@ -48,6 +47,10 @@ t2i_pipe = get_T2I_Flash_pipeline(
 res = t2i_pipe("A cute corgi lives in a house made out of sushi.")
 ```
 ## Prompt beautification
@@ -228,3 +231,4 @@ image = inp_pipe( "A cute corgi lives in a house made out of sushi.", image, mas
 }
 ```

 ---
 license: apache-2.0
 ---
+# Kandinsky-3: Text-to-image diffusion model
 ![](assets/title.jpg)
+[Kandinsky 3.0 Post](https://habr.com/ru/companies/sberbank/articles/775590/) | [Kandinsky 3.1 Post](https://habr.com/ru/companies/sberbank/articles/805337/) | [Project Page](https://ai-forever.github.io/Kandinsky-3) | [Generate](https://fusionbrain.ai) | [Telegram-bot](https://t.me/kandinsky21_bot) | [Technical Report](https://arxiv.org/pdf/2312.03511.pdf) |  [HuggingFace](https://huggingface.co/kandinsky-community/kandinsky-3)
 # Kandinsky 3.1:
 We present Kandinsky 3.1, the follow-up to the Kandinsky 3.0 model, a large-scale text-to-image generation model based on latent diffusion, continuing the series of text-to-image Kandinsky models and reflecting our progress to achieve higher quality and realism of image generation, which we have enhanced and enriched with a variety of useful features and modes to give users more opportunities to fully utilise the power of our new model.
+## Kandinsky Flash (Kandinsky 3.0 Refiner)
 <figure>
   <img src="assets/butterly_effect.jpg">
 </figure>
+Diffusion models have problems with fast image generation. To address this problem, we trained a Kandinksy Flash model based on the [Adversarial Diffusion Distillation](https://arxiv.org/abs/2311.17042) approach with some modifications: we trained the model on latents, which reduced the memory overhead and removed distillation loss as it did not affect the training. Also, we applied Kandinsky Flash model to images generated from Kandinsky 3.0 to improve visual quality of generated images.
 ### Architecture
 res = t2i_pipe("A cute corgi lives in a house made out of sushi.")
 ```
+### Kandinsky Inpainting
+Also, we released a newer version of inpainting model, which we additionally trained the model on the object detection dataset. This allowed to get more stable generation of objects. The new weights are available at [ai-forever/Kandinsky3.1](https://huggingface.co/ai-forever/Kandinsky3.1). Check the usage [example](https://github.com/ai-forever/Kandinsky-3?tab=readme-ov-file#2-inpainting).
 ## Prompt beautification
 }
 ```