Ivan Tang's picture

4 3

Ivan Tang

IvanTang

·

Ivan_Tang_3D

AI & ML interests

Multimodal,3D,PEFT,LLM&MLLM

Recent Activity

upvoted a paper 20 days ago

PixWizard: Versatile Image-to-Image Visual Assistant with Open-Language Instructions

upvoted a paper 5 months ago

SAM2Point: Segment Any 3D as Videos in Zero-shot and Promptable Manners

liked a model 7 months ago

stabilityai/stable-diffusion-3-medium

View all activity

Organizations

None yet

IvanTang's activity

upvoted a paper 20 days ago

PixWizard: Versatile Image-to-Image Visual Assistant with Open-Language Instructions

Paper • 2409.15278 • Published Sep 23, 2024 • 24

upvoted a paper 5 months ago

SAM2Point: Segment Any 3D as Videos in Zero-shot and Promptable Manners

Paper • 2408.16768 • Published Aug 29, 2024 • 27

liked 2 models 7 months ago

stabilityai/stable-diffusion-3-medium

Text-to-Image • Updated Aug 12, 2024 • 27.2k • 4.68k

microsoft/Phi-3-vision-128k-instruct

Text Generation • Updated Aug 20, 2024 • 98.1k • 945

liked a dataset 9 months ago

HuggingFaceFW/fineweb

Viewer • Updated 19 days ago • 48.6B • 323k • 1.83k

upvoted a paper 9 months ago

MathVerse: Does Your Multi-modal LLM Truly See the Diagrams in Visual Math Problems?

Paper • 2403.14624 • Published Mar 21, 2024 • 52

upvoted a paper about 1 year ago

SPHINX: The Joint Mixing of Weights, Tasks, and Visual Embeddings for Multi-modal Large Language Models

Paper • 2311.07575 • Published Nov 13, 2023 • 14