Yash Thube's picture

22 9

Yash Thube

thubZ9

·

AI & ML interests

Multimodal learning, VLM's, CV, NLP, RL

Recent Activity

upvoted a paper about 15 hours ago

SFT Memorizes, RL Generalizes: A Comparative Study of Foundation Model Post-training

updated a collection 7 days ago

My reading list!

upvoted a paper 7 days ago

DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning

View all activity

Organizations

thubZ9's activity

upvoted a paper about 15 hours ago

SFT Memorizes, RL Generalizes: A Comparative Study of Foundation Model Post-training

Paper • 2501.17161 • Published 1 day ago • 27

updated a collection 7 days ago

My reading list!

10 items • Updated 7 days ago • 1

upvoted a paper 7 days ago

DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning

Paper • 2501.12948 • Published 7 days ago • 261

upvoted a paper 8 days ago

Agent-R: Training Language Model Agents to Reflect via Iterative Self-Training

Paper • 2501.11425 • Published 9 days ago • 84

updated a collection 8 days ago

My reading list!

10 items • Updated 7 days ago • 1

upvoted a paper 8 days ago

Mini-Gemini: Mining the Potential of Multi-modality Vision Language Models

Paper • 2403.18814 • Published Mar 27, 2024 • 47

updated a collection 8 days ago

My reading list!

10 items • Updated 7 days ago • 1

upvoted a paper 8 days ago

OmniThink: Expanding Knowledge Boundaries in Machine Writing through Thinking

Paper • 2501.09751 • Published 13 days ago • 47

upvoted a collection 9 days ago

My reading list!

10 items • Updated 7 days ago • 1

updated a collection 9 days ago

My reading list!

10 items • Updated 7 days ago • 1

upvoted a paper 9 days ago

MoE-LLaVA: Mixture of Experts for Large Vision-Language Models

Paper • 2401.15947 • Published Jan 29, 2024 • 51

updated a collection 9 days ago

My reading list!

10 items • Updated 7 days ago • 1

upvoted a paper 9 days ago

DeepSeek-VL: Towards Real-World Vision-Language Understanding

Paper • 2403.05525 • Published Mar 8, 2024 • 42

upvoted 2 papers 10 days ago

Towards Large Reasoning Models: A Survey of Reinforced Reasoning with Large Language Models

Paper • 2501.09686 • Published 13 days ago • 35

Evolving Deeper LLM Thinking

Paper • 2501.09891 • Published 13 days ago • 100

updated a collection 12 days ago

My reading list!

10 items • Updated 7 days ago • 1

upvoted 2 papers 12 days ago

SAM 2: Segment Anything in Images and Videos

Paper • 2408.00714 • Published Aug 1, 2024 • 112

MiniMax-01: Scaling Foundation Models with Lightning Attention

Paper • 2501.08313 • Published 15 days ago • 270

updated a collection 12 days ago

My reading list!

10 items • Updated 7 days ago • 1

liked a dataset 12 days ago

thubZ9/MRI_Classification-Tumor

Viewer • Updated 18 days ago • 800 • 34 • 1