Tolga Cangöz's picture

Tolga Cangöz

tolgacangoz

·

AI & ML interests

AIGC

Recent Activity

liked a Space 1 day ago

webml-community/attention-visualization

upvoted a collection 3 days ago

Flash Diffusion

upvoted a collection 4 days ago

View all activity

Organizations

tolgacangoz's activity

liked a Space 1 day ago

Attention Visualization

Vision Transformer Attention Visualization

upvoted a collection 3 days ago

Flash Diffusion

Collection of models distilled using the method proposed in Flash Diffusion paper • 7 items • Updated Jun 18, 2024 • 15

upvoted a collection 4 days ago

Sana

⚡️Sana: Efficient High-Resolution Image Synthesis with Linear Diffusion Transformer • 17 items • Updated 15 days ago • 66

liked a Space 16 days ago

Scaling test-time compute

upvoted an article 21 days ago

Article

They Said It Couldn’t Be Done

By

•

30 days ago

• 76

upvoted a collection 24 days ago

DeTikZify

Synthesizing Graphics Programs for Scientific Figures and Sketches with TikZ • 11 items • Updated about 1 month ago • 7

upvoted a collection 29 days ago

SigLIP

Contrastive (sigmoid) image-text models from https://arxiv.org/abs/2303.15343 • 10 items • Updated 21 days ago • 48

liked a model about 1 month ago

Derendering/InkSight-Small-p

Updated 23 days ago • 64 • 28

liked 3 Spaces about 1 month ago

Running on CPU Upgrade

Anychat

DeTikZify

Running on Zero

Florence2 + SAM2

upvoted a paper about 1 month ago

LLaVA-o1: Let Vision Language Models Reason Step-by-Step

Paper • 2411.10440 • Published Nov 15, 2024 • 111

upvoted a paper 2 months ago

InstructPix2Pix: Learning to Follow Image Editing Instructions

Paper • 2211.09800 • Published Nov 17, 2022 • 3

upvoted an article 2 months ago

Article

🧨 Diffusers welcomes Stable Diffusion 3.5 Large

Oct 22, 2024

• 49

upvoted a collection 2 months ago

DC-AE

Deep Compression Autoencoder • 14 items • Updated 29 days ago • 14

updated a model 3 months ago

tolgacangoz/matryoshka-diffusion-models

Text-to-Image • Updated Oct 20, 2024 • 163 • 3

liked a Space 3 months ago

Running on Zero

Matryoshka

upvoted an article 3 months ago

Article

Fine-tuning Florence-2 - Microsoft's Cutting-edge Vision Language Models

Jun 24, 2024

• 181

New activity in pcuenq/mdm 3 months ago

🚩 Report: Not working

#1 opened 3 months ago by

reacted to reach-vb's post with 🔥 3 months ago

Post

3113

NEW: Open Source Text/ Image to video model is out - MIT licensed - Rivals Gen-3, Pika & Kling 🔥

> Pyramid Flow: Training-efficient Autoregressive Video Generation method
> Utilizes Flow Matching
> Trains on open-source datasets
> Generates high-quality 10-second videos
> Video resolution: 768p
> Frame rate: 24 FPS
> Supports image-to-video generation

> Model checkpoints available on the hub 🤗: rain1011/pyramid-flow-sd3