Nexusflow
/

Athene-V2-Chat

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

banghua commited on Nov 26, 2024

Commit

b27d6d0

•

1 Parent(s): c0e2375

Update README.md

Files changed (1) hide show

README.md +7 -2

README.md CHANGED Viewed

@@ -18,9 +18,14 @@ base_model:
 </p>
-We introduce Athene-V2-Chat-72B, an open-weights LLM on-par with GPT-4o across benchmarks. It is trained through RLHF with Qwen-2.5-72B-Instruct as base model.
-Athene-V2-Chat-72B excels in chat, math, and coding. Its sister model, [Athene-V2-Agent-72B](https://huggingface.co/Nexusflow/Athene-V2-Agent), surpasses GPT-4o in complex function calling and agentic applications.
 <p align="center" width="100%">
 <a><img src="benchmark.png" alt="Benchmark" style="width: 100%; min-width: 300px; display: block; margin: auto;"></a>

 </p>
+We introduce Athene-V2-Chat-72B, an open-weights LLM on-par with GPT-4o across benchmarks. It is currently the best open model according to [Chatbot Arena](https://lmarena.ai/?leaderboard), where it beats GPT-4o-0513 (the best GPT-4o model on Arena) in hard and math category, and is on-par with GPT-4o-0513 in coding, instruction following, longer query and multi-turn.
+It is trained through RLHF with Qwen-2.5-72B-Instruct as base model. Athene-V2-Chat-72B excels in chat, math, and coding. Its sister model, [Athene-V2-Agent-72B](https://huggingface.co/Nexusflow/Athene-V2-Agent), surpasses GPT-4o in complex function calling and agentic applications.
+<p align="center" width="100%">
+<a><img src="arena.png" alt="Arena" style="width: 100%; min-width: 300px; display: block; margin: auto;"></a>
+</p>
 <p align="center" width="100%">
 <a><img src="benchmark.png" alt="Benchmark" style="width: 100%; min-width: 300px; display: block; margin: auto;"></a>