wang's picture

16

wang

wangxbx

AI & ML interests

None yet

Recent Activity

upvoted a paper about 23 hours ago

MMDocIR: Benchmarking Multi-Modal Retrieval for Long Documents

upvoted a paper about 23 hours ago

Towards Best Practices for Open Datasets for LLM Training

upvoted a paper 1 day ago

O1 Replication Journey -- Part 3: Inference-time Scaling for Medical Reasoning

View all activity

Organizations

None yet

wangxbx's activity

upvoted 2 papers about 23 hours ago

MMDocIR: Benchmarking Multi-Modal Retrieval for Long Documents

Paper • 2501.08828 • Published 2 days ago • 24

Towards Best Practices for Open Datasets for LLM Training

Paper • 2501.08365 • Published 3 days ago • 37

upvoted 3 papers 1 day ago

O1 Replication Journey -- Part 3: Inference-time Scaling for Medical Reasoning

Paper • 2501.06458 • Published 7 days ago • 29

LlamaV-o1: Rethinking Step-by-step Visual Reasoning in LLMs

Paper • 2501.06186 • Published 7 days ago • 54

MiniMax-01: Scaling Foundation Models with Lightning Attention

Paper • 2501.08313 • Published 3 days ago • 254

upvoted 5 papers 8 days ago

Multi-task retriever fine-tuning for domain-specific and efficient RAG

Paper • 2501.04652 • Published 9 days ago • 10

Search-o1: Agentic Search-Enhanced Large Reasoning Models

Paper • 2501.05366 • Published 8 days ago • 75

Agent Laboratory: Using LLM Agents as Research Assistants

Paper • 2501.04227 • Published 10 days ago • 77

LLM4SR: A Survey on Large Language Models for Scientific Research

Paper • 2501.04306 • Published 10 days ago • 33

Towards System 2 Reasoning in LLMs: Learning How to Think With Meta Chain-of-Though

Paper • 2501.04682 • Published 9 days ago • 83

upvoted a paper 9 days ago

Cosmos World Foundation Model Platform for Physical AI

Paper • 2501.03575 • Published 11 days ago • 63

upvoted a paper 10 days ago

Test-time Computing: from System-1 Thinking to System-2 Thinking

Paper • 2501.02497 • Published 13 days ago • 40

upvoted a paper 28 days ago

Qwen2.5 Technical Report

Paper • 2412.15115 • Published 29 days ago • 340

upvoted 3 papers about 2 months ago

Draft Model Knows When to Stop: A Self-Verification Length Policy for Speculative Decoding

Paper • 2411.18462 • Published Nov 27, 2024 • 6

Low-Bit Quantization Favors Undertrained LLMs: Scaling Laws for Quantized LLMs with 100T Training Tokens

Paper • 2411.17691 • Published Nov 26, 2024 • 11

Star Attention: Efficient LLM Inference over Long Sequences

Paper • 2411.17116 • Published Nov 26, 2024 • 49