Update README.md
Browse files
README.md
CHANGED
@@ -18,9 +18,14 @@ base_model:
|
|
18 |
</p>
|
19 |
|
20 |
|
21 |
-
We introduce Athene-V2-Chat-72B, an open-weights LLM on-par with GPT-4o across benchmarks. It is
|
22 |
-
Athene-V2-Chat-72B excels in chat, math, and coding. Its sister model, [Athene-V2-Agent-72B](https://huggingface.co/Nexusflow/Athene-V2-Agent), surpasses GPT-4o in complex function calling and agentic applications.
|
23 |
|
|
|
|
|
|
|
|
|
|
|
|
|
24 |
|
25 |
<p align="center" width="100%">
|
26 |
<a><img src="benchmark.png" alt="Benchmark" style="width: 100%; min-width: 300px; display: block; margin: auto;"></a>
|
|
|
18 |
</p>
|
19 |
|
20 |
|
21 |
+
We introduce Athene-V2-Chat-72B, an open-weights LLM on-par with GPT-4o across benchmarks. It is currently the best open model according to [Chatbot Arena](https://lmarena.ai/?leaderboard), where it beats GPT-4o-0513 (the best GPT-4o model on Arena) in hard and math category, and is on-par with GPT-4o-0513 in coding, instruction following, longer query and multi-turn.
|
|
|
22 |
|
23 |
+
It is trained through RLHF with Qwen-2.5-72B-Instruct as base model. Athene-V2-Chat-72B excels in chat, math, and coding. Its sister model, [Athene-V2-Agent-72B](https://huggingface.co/Nexusflow/Athene-V2-Agent), surpasses GPT-4o in complex function calling and agentic applications.
|
24 |
+
|
25 |
+
|
26 |
+
<p align="center" width="100%">
|
27 |
+
<a><img src="arena.png" alt="Arena" style="width: 100%; min-width: 300px; display: block; margin: auto;"></a>
|
28 |
+
</p>
|
29 |
|
30 |
<p align="center" width="100%">
|
31 |
<a><img src="benchmark.png" alt="Benchmark" style="width: 100%; min-width: 300px; display: block; margin: auto;"></a>
|