siyeng feng's picture

303 171

siyeng feng

siyengfeng

·

AI & ML interests

None yet

Recent Activity

upvoted a paper 3 days ago

Transformer^2: Self-adaptive LLMs

upvoted a paper 3 days ago

HuatuoGPT-o1, Towards Medical Complex Reasoning with LLMs

upvoted a paper 4 days ago

Tensor Product Attention Is All You Need

View all activity

Organizations

None yet

siyengfeng's activity

upvoted 2 papers 3 days ago

Transformer^2: Self-adaptive LLMs

Paper • 2501.06252 • Published 11 days ago • 46

HuatuoGPT-o1, Towards Medical Complex Reasoning with LLMs

Paper • 2412.18925 • Published 25 days ago • 94

upvoted a paper 4 days ago

Tensor Product Attention Is All You Need

Paper • 2501.06425 • Published 9 days ago • 66

liked a model 4 days ago

Qwen/Qwen2.5-Coder-32B-Instruct

Text Generation • Updated 8 days ago • 194k • • 1.49k

upvoted a paper 4 days ago

MiniMax-01: Scaling Foundation Models with Lightning Attention

Paper • 2501.08313 • Published 5 days ago • 259

upvoted 4 papers 5 days ago

o1-Coder: an o1 Replication for Coding

Paper • 2412.00154 • Published Nov 29, 2024 • 43

O1 Replication Journey -- Part 3: Inference-time Scaling for Medical Reasoning

Paper • 2501.06458 • Published 8 days ago • 29

Towards System 2 Reasoning in LLMs: Learning How to Think With Meta Chain-of-Though

Paper • 2501.04682 • Published 11 days ago • 83

Agent Laboratory: Using LLM Agents as Research Assistants

Paper • 2501.04227 • Published 12 days ago • 77

liked a model 5 days ago

Qwen/QwQ-32B-Preview

Text Generation • Updated 8 days ago • 158k • • 1.57k

upvoted 4 papers 5 days ago

Virgo: A Preliminary Exploration on Reproducing o1-like MLLM

Paper • 2501.01904 • Published 16 days ago • 31

Test-time Computing: from System-1 Thinking to System-2 Thinking

Paper • 2501.02497 • Published 14 days ago • 40

The Lessons of Developing Process Reward Models in Mathematical Reasoning

Paper • 2501.07301 • Published 6 days ago • 80

DeepSeek-V3 Technical Report

Paper • 2412.19437 • Published 24 days ago • 23

liked a dataset 5 days ago

fka/awesome-chatgpt-prompts

Viewer • Updated 14 days ago • 203 • 5.96k • 6.95k

liked 4 models 5 days ago

NovaSky-AI/Sky-T1-32B-Preview

Text Generation • Updated 6 days ago • 7.51k • 476

deepseek-ai/DeepSeek-V3

Updated 21 days ago • 155k • 2.04k

microsoft/phi-4

Text Generation • Updated 11 days ago • 124k • 1.44k

Qwen/Qwen2.5-32B-Instruct

Text Generation • Updated Sep 25, 2024 • 204k • 178

upvoted a paper 5 days ago

GUI Agents: A Survey

Paper • 2412.13501 • Published Dec 18, 2024 • 24