Zeyu Qin

qqqzzzyyy

https://alan-qin.github.io/

Alan-Qin

AI & ML interests

Trustworthy ML, AI safety

Recent Activity

upvoted a paper 15 days ago

ProcessBench: Identifying Process Errors in Mathematical Reasoning

liked a model about 1 month ago

sentence-transformers/all-mpnet-base-v2

liked a model about 1 month ago

OFA-Sys/InsTagger

View all activity

Organizations

None yet

qqqzzzyyy's activity

upvoted a paper 15 days ago

ProcessBench: Identifying Process Errors in Mathematical Reasoning

Paper • 2412.06559 • Published 29 days ago • 73

upvoted a paper about 2 months ago

AgentInstruct: Toward Generative Teaching with Agentic Flows

Paper • 2407.03502 • Published Jul 3, 2024 • 51

upvoted a paper 2 months ago

Adapting Large Language Models via Reading Comprehension

Paper • 2309.09530 • Published Sep 18, 2023 • 77

upvoted a collection 4 months ago

Qwen2.5-Math

Collection

Math-specific model series based on Qwen2.5 • 9 items • Updated Nov 28, 2024 • 59

upvoted a paper 4 months ago

Iterative Reasoning Preference Optimization

Paper • 2404.19733 • Published Apr 30, 2024 • 47

upvoted 2 articles 5 months ago

Article

Let's talk about LLM evaluation

•

May 23, 2024

• 143

Article

SmolLM - blazingly fast and remarkably powerful

Jul 16, 2024

• 297

upvoted 4 collections 6 months ago

upvoted a paper 6 months ago

Adam-mini: Use Fewer Learning Rates To Gain More

Paper • 2406.16793 • Published Jun 24, 2024 • 68

upvoted a paper 8 months ago

AdvPrompter: Fast Adaptive Adversarial Prompting for LLMs

Paper • 2404.16873 • Published Apr 21, 2024 • 28

upvoted a paper 9 months ago

Localizing Paragraph Memorization in Language Models

Paper • 2403.19851 • Published Mar 28, 2024 • 13

upvoted a paper 10 months ago

Decoding Compressed Trust: Scrutinizing the Trustworthiness of Efficient LLMs Under Compression

Paper • 2403.15447 • Published Mar 18, 2024 • 16

upvoted a collection 10 months ago

Handbook v0.1 models and datasets

Collection

Models and datasets for v0.1 of the alignment handbook • 6 items • Updated Nov 10, 2023 • 24

upvoted a paper 10 months ago

Towards Optimal Learning of Language Models

Paper • 2402.17759 • Published Feb 27, 2024 • 16

upvoted 3 papers 11 months ago

Beyond Training Objectives: Interpreting Reward Model Divergence in Large Language Models

Paper • 2310.08164 • Published Oct 12, 2023 • 4

Step-On-Feet Tuning: Scaling Self-Alignment of LLMs via Bootstrapping

Paper • 2402.07610 • Published Feb 12, 2024 • 7

WARM: On the Benefits of Weight Averaged Reward Models

Paper • 2401.12187 • Published Jan 22, 2024 • 18