wongyukim's picture

266 66

wongyukim

wongyukim

·

kimwongyuda

AI & ML interests

None yet

Recent Activity

liked a Space about 12 hours ago

TIGER-Lab/MMEB

upvoted a paper about 14 hours ago

Critique Fine-Tuning: Learning to Critique is More Effective than Learning to Imitate

upvoted a paper about 14 hours ago

Atla Selene Mini: A General Purpose Evaluation Model

View all activity

Organizations

None yet

wongyukim's activity

liked a Space about 12 hours ago

MMEB Leaderboard

The massive multimodal embedding benchmark

upvoted 2 papers about 14 hours ago

Critique Fine-Tuning: Learning to Critique is More Effective than Learning to Imitate

Paper • 2501.17703 • Published 2 days ago • 36

Atla Selene Mini: A General Purpose Evaluation Model

Paper • 2501.17195 • Published 4 days ago • 28

upvoted 2 papers 1 day ago

Optimizing Large Language Model Training Using FP4 Quantization

Paper • 2501.17116 • Published 3 days ago • 25

SFT Memorizes, RL Generalizes: A Comparative Study of Foundation Model Post-training

Paper • 2501.17161 • Published 3 days ago • 53

upvoted 7 papers 3 days ago

Towards General-Purpose Model-Free Reinforcement Learning

Paper • 2501.16142 • Published 4 days ago • 21

Qwen2.5-1M Technical Report

Paper • 2501.15383 • Published 6 days ago • 42

Baichuan-Omni-1.5 Technical Report

Paper • 2501.15368 • Published 6 days ago • 45

RL + Transformer = A General-Purpose Problem Solver

Paper • 2501.14176 • Published 8 days ago • 17

Redundancy Principles for MLLMs Benchmarks

Paper • 2501.13953 • Published 12 days ago • 24

Chain-of-Retrieval Augmented Generation

Paper • 2501.14342 • Published 8 days ago • 37

Humanity's Last Exam

Paper • 2501.14249 • Published 8 days ago • 48

upvoted a paper 5 days ago

Can We Generate Images with CoT? Let's Verify and Reinforce Image Generation Step by Step

Paper • 2501.13926 • Published 8 days ago • 31

upvoted a paper 6 days ago

Video-MMMU: Evaluating Knowledge Acquisition from Multi-Discipline Professional Videos

Paper • 2501.13826 • Published 8 days ago • 22

upvoted 6 papers 8 days ago

O1-Pruner: Length-Harmonizing Fine-Tuning for O1-Like Reasoning Pruning

Paper • 2501.12570 • Published 10 days ago • 22

VideoLLaMA 3: Frontier Multimodal Foundation Models for Image and Video Understanding

Paper • 2501.13106 • Published 9 days ago • 76

InternLM-XComposer2.5-Reward: A Simple Yet Effective Multi-Modal Reward Model

Paper • 2501.12368 • Published 10 days ago • 39

Agent-R: Training Language Model Agents to Reflect via Iterative Self-Training

Paper • 2501.11425 • Published 11 days ago • 88

MMVU: Measuring Expert-Level Multi-Discipline Video Understanding

Paper • 2501.12380 • Published 10 days ago • 80

DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning

Paper • 2501.12948 • Published 9 days ago • 279