zerozeyi
's Collections
OpenMoE: An Early Effort on Open Mixture-of-Experts Language Models
Paper
•
2402.01739
•
Published
•
26
Rethinking Interpretability in the Era of Large Language Models
Paper
•
2402.01761
•
Published
•
22
Self-Discover: Large Language Models Self-Compose Reasoning Structures
Paper
•
2402.03620
•
Published
•
114
Aya Model: An Instruction Finetuned Open-Access Multilingual Language
Model
Paper
•
2402.07827
•
Published
•
45
Chain-of-Thought Reasoning Without Prompting
Paper
•
2402.10200
•
Published
•
104
Generative Representational Instruction Tuning
Paper
•
2402.09906
•
Published
•
53
BitDelta: Your Fine-Tune May Only Be Worth One Bit
Paper
•
2402.10193
•
Published
•
19
SOLAR 10.7B: Scaling Large Language Models with Simple yet Effective
Depth Up-Scaling
Paper
•
2312.15166
•
Published
•
56
SPAR: Personalized Content-Based Recommendation via Long Engagement
Attention
Paper
•
2402.10555
•
Published
•
34
Learning to Learn Faster from Human Feedback with Language Model
Predictive Control
Paper
•
2402.11450
•
Published
•
21
Paper
•
2402.12219
•
Published
•
16
MobiLlama: Towards Accurate and Lightweight Fully Transparent GPT
Paper
•
2402.16840
•
Published
•
23
The Era of 1-bit LLMs: All Large Language Models are in 1.58 Bits
Paper
•
2402.17764
•
Published
•
605
Can large language models explore in-context?
Paper
•
2403.15371
•
Published
•
32
OpenELM: An Efficient Language Model Family with Open-source Training
and Inference Framework
Paper
•
2404.14619
•
Published
•
126
FLAME: Factuality-Aware Alignment for Large Language Models
Paper
•
2405.01525
•
Published
•
24
Octopus v4: Graph of language models
Paper
•
2404.19296
•
Published
•
116
KAN: Kolmogorov-Arnold Networks
Paper
•
2404.19756
•
Published
•
108
LoRA Land: 310 Fine-tuned LLMs that Rival GPT-4, A Technical Report
Paper
•
2405.00732
•
Published
•
118
From Loops to Oops: Fallback Behaviors of Language Models Under
Uncertainty
Paper
•
2407.06071
•
Published
•
7
Human-like Episodic Memory for Infinite Context LLMs
Paper
•
2407.09450
•
Published
•
60
LLMs + Persona-Plug = Personalized LLMs
Paper
•
2409.11901
•
Published
•
31