Zilong Huang

SereinH

AI & ML interests

None yet

Recent Activity

upvoted a paper about 1 month ago

Imagine360: Immersive 360 Video Generation from Perspective Anchor

liked a model about 1 month ago

Shakker-Labs/AWPortraitCN

upvoted a paper about 1 month ago

SOLAMI: Social Vision-Language-Action Modeling for Immersive Interaction with 3D Autonomous Characters

View all activity

Organizations

None yet

SereinH's activity

upvoted a paper about 1 month ago

Imagine360: Immersive 360 Video Generation from Perspective Anchor

Paper • 2412.03552 • Published Dec 4, 2024 • 26

liked a model about 1 month ago

Shakker-Labs/AWPortraitCN

Text-to-Image • Updated Dec 4, 2024 • 1.34k • 185

upvoted a paper about 1 month ago

SOLAMI: Social Vision-Language-Action Modeling for Immersive Interaction with 3D Autonomous Characters

Paper • 2412.00174 • Published Nov 29, 2024 • 22

liked a model 2 months ago

alimama-creative/FLUX.1-dev-Controlnet-Inpainting-Beta

Image-to-Image • Updated Oct 12, 2024 • 13.6k • 246

liked a model 3 months ago

tencent/DepthCrafter

Depth Estimation • Updated Sep 24, 2024 • 76k • 79

liked a dataset 3 months ago

gvecchio/MatSynth

Updated Apr 16, 2024 • 2.76k • 42

authored a paper 3 months ago

LOKI: A Comprehensive Synthetic Data Detection Benchmark using Large Multimodal Models

Paper • 2410.09732 • Published Oct 13, 2024 • 54

upvoted 2 papers 3 months ago

LOKI: A Comprehensive Synthetic Data Detection Benchmark using Large Multimodal Models

Paper • 2410.09732 • Published Oct 13, 2024 • 54

MinerU: An Open-Source Solution for Precise Document Content Extraction

Paper • 2409.18839 • Published Sep 27, 2024 • 27

upvoted 2 papers 4 months ago

CrossViewDiff: A Cross-View Diffusion Model for Satellite-to-Street View Synthesis

Paper • 2408.14765 • Published Aug 27, 2024 • 15

UrBench: A Comprehensive Benchmark for Evaluating Large Multimodal Models in Multi-View Urban Scenarios

Paper • 2408.17267 • Published Aug 30, 2024 • 23

liked a dataset 5 months ago

WenhaoWang/VidProM

Viewer • Updated Sep 26, 2024 • 1.67M • 1.86k • 59

liked a dataset 6 months ago

bdsqlsz/qinglong_vtuber

Viewer • Updated Jul 13, 2024 • 26 • 39 • 3

liked 2 datasets 7 months ago

EPFL-CVLab/OpenMaterial

Updated Sep 20, 2024 • 44 • 14

deepghs/gelbooru_full

Preview • Updated 5 days ago • 9.85k • 37

liked 2 models 8 months ago

facebook/mask2former-swin-tiny-coco-panoptic

Image Segmentation • Updated Sep 11, 2023 • 6.5k • 8

nvidia/segformer-b1-finetuned-cityscapes-1024-1024

Image Segmentation • Updated Aug 9, 2022 • 7.52k • 12

liked a model 9 months ago

stabilityai/stable-diffusion-xl-refiner-1.0

Image-to-Image • Updated Sep 25, 2023 • 605k • • 1.77k

liked a Space 9 months ago

Running on A10G

2.76k

🕵️‍♂️

CLIP Interrogator

liked a dataset 12 months ago

ShapeNet/ShapeNetCore

Updated Sep 20, 2023 • 455 • 103