JingfengYao's picture

17 5

JingfengYao

MapleF

·

AI & ML interests

None yet

Recent Activity

upvoted a paper 17 days ago

Flowing from Words to Pixels: A Framework for Cross-Modality Evolution

upvoted a paper 25 days ago

LAION-SG: An Enhanced Large-Scale Dataset for Training Complex Image-Text Models with Structural Annotations

upvoted a paper about 1 month ago

DiffusionDrive: Truncated Diffusion Model for End-to-End Autonomous Driving

View all activity

Organizations

None yet

MapleF's activity

upvoted a paper 17 days ago

Flowing from Words to Pixels: A Framework for Cross-Modality Evolution

Paper • 2412.15213 • Published 17 days ago • 25

upvoted a paper 25 days ago

LAION-SG: An Enhanced Large-Scale Dataset for Training Complex Image-Text Models with Structural Annotations

Paper • 2412.08580 • Published 26 days ago • 45

upvoted a paper about 1 month ago

DiffusionDrive: Truncated Diffusion Model for End-to-End Autonomous Driving

Paper • 2411.15139 • Published Nov 22, 2024 • 15

upvoted a paper 3 months ago

Emu3: Next-Token Prediction is All You Need

Paper • 2409.18869 • Published Sep 27, 2024 • 94

upvoted 2 papers 4 months ago

The Mamba in the Llama: Distilling and Accelerating Hybrid Models

Paper • 2408.15237 • Published Aug 27, 2024 • 38

Foundation Models for Music: A Survey

Paper • 2408.14340 • Published Aug 26, 2024 • 44

upvoted 6 papers 5 months ago

MegaFusion: Extend Diffusion Models towards Higher-resolution Image Generation without Further Tuning

Paper • 2408.11001 • Published Aug 20, 2024 • 11

Transfusion: Predict the Next Token and Diffuse Images with One Multi-Modal Model

Paper • 2408.11039 • Published Aug 20, 2024 • 58

xGen-MM (BLIP-3): A Family of Open Large Multimodal Models

Paper • 2408.08872 • Published Aug 16, 2024 • 98

Generative Photomontage

Paper • 2408.07116 • Published Aug 13, 2024 • 20

MedTrinity-25M: A Large-scale Multimodal Dataset with Multigranular Annotations for Medicine

Paper • 2408.02900 • Published Aug 6, 2024 • 25

LKCell: Efficient Cell Nuclei Instance Segmentation with Large Convolution Kernels

Paper • 2407.18054 • Published Jul 25, 2024 • 12

liked 2 models 7 months ago

stabilityai/stable-diffusion-3-medium

Text-to-Image • Updated Aug 12, 2024 • 21.8k • 4.66k

hustvl/vitmatte-base-composition-1k

Updated Sep 21, 2023 • 46k • 10

upvoted 2 papers 7 months ago

ConvLLaVA: Hierarchical Backbones as Visual Encoder for Large Multimodal Models

Paper • 2405.15738 • Published May 24, 2024 • 43

Meteor: Mamba-based Traversal of Rationale for Large Language and Vision Models

Paper • 2405.15574 • Published May 24, 2024 • 53

upvoted 3 papers 9 months ago

COCONut: Modernizing COCO Segmentation

Paper • 2404.08639 • Published Apr 12, 2024 • 27

Leave No Context Behind: Efficient Infinite Context Transformers with Infini-attention

Paper • 2404.07143 • Published Apr 10, 2024 • 104

MagicTime: Time-lapse Video Generation Models as Metamorphic Simulators

Paper • 2404.05014 • Published Apr 7, 2024 • 32

liked a model 9 months ago

hustvl/vitmatte-small-composition-1k

Updated Mar 29, 2024 • 1.54M • 30