hongyu's picture

277

hongyu

learn12138

·

AI & ML interests

None yet

Recent Activity

upvoted a paper 7 days ago

Tensor Product Attention Is All You Need

upvoted a paper 7 days ago

ConceptMaster: Multi-Concept Video Customization on Diffusion Transformer Models Without Test-Time Tuning

upvoted a paper 7 days ago

The GAN is dead; long live the GAN! A Modern GAN Baseline

View all activity

Organizations

None yet

learn12138's activity

upvoted 5 papers 7 days ago

Tensor Product Attention Is All You Need

Paper • 2501.06425 • Published 10 days ago • 72

ConceptMaster: Multi-Concept Video Customization on Diffusion Transformer Models Without Test-Time Tuning

Paper • 2501.04698 • Published 13 days ago • 15

The GAN is dead; long live the GAN! A Modern GAN Baseline

Paper • 2501.05441 • Published 12 days ago • 80

Magic Mirror: ID-Preserved Video Generation in Video Diffusion Transformers

Paper • 2501.03931 • Published 14 days ago • 14

Diffusion as Shader: 3D-aware Video Diffusion for Versatile Video Generation Control

Paper • 2501.03847 • Published 14 days ago • 23

upvoted a paper 12 days ago

STAR: Spatial-Temporal Augmentation with Text-to-Video Models for Real-World Video Super-Resolution

Paper • 2501.02976 • Published 15 days ago • 51

upvoted 3 papers 16 days ago

SeedVR: Seeding Infinity in Diffusion Transformer Towards Generic Video Restoration

Paper • 2501.01320 • Published 19 days ago • 11

LTX-Video: Realtime Video Latent Diffusion

Paper • 2501.00103 • Published 22 days ago • 41

VideoAnydoor: High-fidelity Video Object Insertion with Precise Motion Control

Paper • 2501.01427 • Published 19 days ago • 49

upvoted a paper 19 days ago

VideoMaker: Zero-shot Customized Video Generation with the Inherent Force of Video Diffusion Models

Paper • 2412.19645 • Published 25 days ago • 13

upvoted a paper 23 days ago

VidTwin: Video VAE with Decoupled Structure and Dynamics

Paper • 2412.17726 • Published 29 days ago • 8

upvoted 7 papers 27 days ago

DiTCtrl: Exploring Attention Control in Multi-Modal Diffusion Transformer for Tuning-Free Multi-Prompt Longer Video Generation

Paper • 2412.18597 • Published 28 days ago • 19

Fourier Position Embedding: Enhancing Attention's Periodic Extension for Length Generalization

Paper • 2412.17739 • Published 29 days ago • 39

Large Motion Video Autoencoding with Cross-modal Video VAE

Paper • 2412.17805 • Published 29 days ago • 24

TRecViT: A Recurrent Video Transformer

Paper • 2412.14294 • Published Dec 18, 2024 • 12

CLEAR: Conv-Like Linearization Revs Pre-Trained Diffusion Transformers Up

Paper • 2412.16112 • Published Dec 20, 2024 • 21

Taming Multimodal Joint Training for High-Quality Video-to-Audio Synthesis

Paper • 2412.15322 • Published Dec 19, 2024 • 18

AV-Link: Temporally-Aligned Diffusion Features for Cross-Modal Audio-Video Generation

Paper • 2412.15191 • Published Dec 19, 2024 • 5

upvoted 2 papers about 1 month ago

Autoregressive Video Generation without Vector Quantization

Paper • 2412.14169 • Published Dec 18, 2024 • 14

VividFace: A Diffusion-Based Hybrid Framework for High-Fidelity Video Face Swapping

Paper • 2412.11279 • Published Dec 15, 2024 • 12