factbench / _header.md
farimafatahi's picture
Upload _header.md
0b8b19e verified

A newer version of the Streamlit SDK is available: 1.41.1

Upgrade

πŸ”Ž FactBench: A Dynamic Benchmark for In-the-Wild Language Model Factuality Evaluation

πŸ“‘ Paper | πŸ’» GitHub | | 🐦 X | πŸ’¬ Discussion | βš™οΈ Version: V1 | # Models: {model_num} | Updated: 10/26/2024