Spaces:
Running
Running
name,upload_date,description,parameter_count,creator,result_path,license,link | |
ChatDiT,2024.12.23.15.49.00,A Training-Free Baseline for Task-Agnostic Free-Form Chatting with Diffusion Transformers.,12B,Tongyi Lab,chatdit-1225.csv,MIT License,https://github.com/ali-vilab/ChatDiT | |
GPT-4o + FLUX.1 [dev],2024.12.23.15.50.00,A new open-source image generation model developed by Black Forest Labs. Use GPT-4o for prompt rephrasing. ,12B,Black Forest Labs,gpt4o-flux-1225.csv,FLUX.1 [dev] Non-Commercial License,https://huggingface.co/black-forest-labs/FLUX.1-dev | |
GPT-4o + Stable Diffusion 3 Medium,2024.12.24.15.39.00,"A Multimodal Diffusion Transformer (MMDiT) text-to-image model that features greatly improved performance in image quality, typography, complex prompt understanding, and resource-efficiency. Use GPT-4o for prompt rephrasing. ",2B,Stability AI,gpt4o-sd3-1225.csv,Stability AI Community License,https://huggingface.co/stabilityai/stable-diffusion-3-medium | |
GPT-4o + PixArt-Sigma,2024.12.24.15.39.00,"PixArt-Sigma consists of pure transformer blocks for latent diffusion: It can directly generate 1024px, 2K and 4K images from text prompts within a single sampling process. Use GPT-4o for prompt rephrasing. ",0.6B,Huawei Noah's Ark Lab,gpt4o-pixart-1225.csv,CreativeML Open RAIL++-M License,https://huggingface.co/PixArt-alpha/PixArt-Sigma-XL-2-1024-MS | |
GPT-4o + DALLE-3,2024.12.24.15.39.00,DALL-E 3 is the newest text-to-image generation model from OpenAI. Use GPT-4o for prompt rephrasing. ,12B,OpenAI,gpt4o-dalle3-1225.csv,OpenAI Terms of Use,https://openai.com/index/dall-e-3/ | |
GPT-4o + Emu2,2024.12.24.15.39.00,"A generative multimodal model with 37 billion parameters, trained on large-scale multimodal sequences with a unified autoregressive objective. Use GPT-4o for prompt rephrasing. ",37B,BAAI,gpt4o-emu2-1225.csv,Apache License 2.0,https://huggingface.co/BAAI/Emu2 | |
GPT-4o + OmniGen,2024.12.24.15.39.00,"OmniGen is a unified image generation model that you can use to perform various tasks, including but not limited to text-to-image generation, subject-driven generation, Identity-Preserving Generation, and image-conditioned generation. Use GPT-4o for prompt rephrasing. ",3.8B,BAAI,gpt4o-omnigen-1225.csv,MIT License,https://huggingface.co/spaces/Shitao/OmniGen |