7 8 170

Junyeong Song

junyeong-nero

junyeong-nero

AI & ML interests

None yet

Recent Activity

liked a model about 12 hours ago

hexgrad/Kokoro-82M

liked a dataset about 24 hours ago

HuggingFaceM4/ChartQA

liked a dataset about 24 hours ago

MMVP/MMVP

View all activity

Organizations

None yet

junyeong-nero's activity

liked a model about 12 hours ago

hexgrad/Kokoro-82M

Text-to-Speech • Updated 2 days ago • 20.6k • 1.85k

liked 2 datasets about 24 hours ago

HuggingFaceM4/ChartQA

Viewer • Updated Mar 5, 2024 • 32.7k • 2.47k • 19

MMVP/MMVP

Viewer • Updated Jun 1, 2024 • 300 • 405 • 13

liked a model 1 day ago

prometheus-eval/prometheus-13b-v1.0

Text2Text Generation • Updated Oct 14, 2023 • 3.77k • 134

liked 2 datasets 3 days ago

Idavidrein/gpqa

Viewer • Updated Mar 28, 2024 • 1.25k • 17.8k • 106

allenai/winogrande

Updated Jan 18, 2024 • 81.1k • 59

upvoted a paper 4 days ago

O1 Replication Journey -- Part 3: Inference-time Scaling for Medical Reasoning

Paper • 2501.06458 • Published 7 days ago • 29

liked a model 4 days ago

NovaSky-AI/Sky-T1-32B-Preview

Text Generation • Updated 5 days ago • 4.89k • 444

liked 2 datasets 5 days ago

Lin-Chen/ShareGPT4V

Viewer • Updated Jun 6, 2024 • 1.35M • 549 • 277

MoritzLaurer/facts-grounding-prompts

Updated about 15 hours ago • 71 • 3

reacted to MoritzLaurer's post with ❤️ 5 days ago

Post

2913

FACTS is a great paper from @GoogleDeepMind on measuring the factuality of LLM outputs. You can now download their prompt templates from @huggingface to improve LLM-based fact-checking yourself!

📏 The paper introduces the FACTS Grounding benchmark for evaluating the factuality of LLM outputs.

🤖 Fact-checking is automated by an ensemble of LLM judges that verify if a response is fully grounded in a factual reference document.

🧪 The authors tested different prompt templates on held-out data to ensure their generalization.

📚 It's highly educational to read these templates to learn how frontier labs design prompts and understand their limitations.

💾 You can now download and reuse these prompt templates via the prompt-templates library!

🔄 The library simplifies sharing prompt templates on the HF hub or locally via standardized YAML files. Let’s make LLM work more transparent and reproducible by sharing more templates like this!

Links 👇
- prompt-templates docs: https://moritzlaurer.github.io/prompt_templates/
- all templates on the HF Hub: MoritzLaurer/facts-grounding-prompts
- FACTS paper: https://storage.googleapis.com/deepmind-media/FACTS/FACTS_grounding_paper.pdf