-
A Survey on Evaluation of Large Language Models
Paper • 2307.03109 • Published • 42 -
SemScore: Automated Evaluation of Instruction-Tuned LLMs based on Semantic Textual Similarity
Paper • 2401.17072 • Published • 25 -
LLM Comparator: Visual Analytics for Side-by-Side Evaluation of Large Language Models
Paper • 2402.10524 • Published • 23
Collections
Discover the best community collections!
Collections including paper arxiv:2401.17072
-
Can Large Language Models Understand Context?
Paper • 2402.00858 • Published • 23 -
OLMo: Accelerating the Science of Language Models
Paper • 2402.00838 • Published • 83 -
Self-Rewarding Language Models
Paper • 2401.10020 • Published • 146 -
SemScore: Automated Evaluation of Instruction-Tuned LLMs based on Semantic Textual Similarity
Paper • 2401.17072 • Published • 25
-
Attention Is All You Need
Paper • 1706.03762 • Published • 50 -
LoRA: Low-Rank Adaptation of Large Language Models
Paper • 2106.09685 • Published • 31 -
Direct Preference Optimization: Your Language Model is Secretly a Reward Model
Paper • 2305.18290 • Published • 52 -
Lost in the Middle: How Language Models Use Long Contexts
Paper • 2307.03172 • Published • 37