dolphinlee
's Collections
System 2 Attention (is something you might need too)
Paper
•
2311.11829
•
Published
•
39
TPTU-v2: Boosting Task Planning and Tool Usage of Large Language
Model-based Agents in Real-world Systems
Paper
•
2311.11315
•
Published
•
6
Paper
•
2312.07000
•
Published
•
11
Steering Llama 2 via Contrastive Activation Addition
Paper
•
2312.06681
•
Published
•
11
Llama Guard: LLM-based Input-Output Safeguard for Human-AI Conversations
Paper
•
2312.06674
•
Published
•
7
Controlled Decoding from Language Models
Paper
•
2310.17022
•
Published
•
14
Vision-Language Models as a Source of Rewards
Paper
•
2312.09187
•
Published
•
11
Secrets of RLHF in Large Language Models Part II: Reward Modeling
Paper
•
2401.06080
•
Published
•
26
Contrastive Preference Optimization: Pushing the Boundaries of LLM
Performance in Machine Translation
Paper
•
2401.08417
•
Published
•
34
Weaver: Foundation Models for Creative Writing
Paper
•
2401.17268
•
Published
•
43
MobileLLM: Optimizing Sub-billion Parameter Language Models for
On-Device Use Cases
Paper
•
2402.14905
•
Published
•
126
StarCoder 2 and The Stack v2: The Next Generation
Paper
•
2402.19173
•
Published
•
136
MathScale: Scaling Instruction Tuning for Mathematical Reasoning
Paper
•
2403.02884
•
Published
•
15
GaLore: Memory-Efficient LLM Training by Gradient Low-Rank Projection
Paper
•
2403.03507
•
Published
•
183
User-LLM: Efficient LLM Contextualization with User Embeddings
Paper
•
2402.13598
•
Published
•
19
CodecLM: Aligning Language Models with Tailored Synthetic Data
Paper
•
2404.05875
•
Published
•
16