CulturalBench / data /CulturalBench_Easy_average.csv
kellycyy's picture
first init
c76f18f
raw
history blame contribute delete
699 Bytes
model,accuracy
gpt-3-5-turbo-1106,71.7196414
gpt35turbo,71.96414018
gpt4omini,83.21108394
gpt4o,88.83455583
gpt-4o-2024-08-06,89.73105134
gpt-4-0125-preview,88.50855746
gpt-4-1106-preview,88.59005705
haiku,61.69519152
sonnet3,60.06519967
opus,81.01059495
sonnet35,79.95110024
mistralnemo,71.47514262
mistralsmall,68.4596577
mistral-large-2402,56.72371638
mistrallarge,85.8190709
llama3-8b,70.25264874
llama3-70b,83.04808476
llama3-1-8b,71.23064385
llama3-1-70b,84.10757946
llama3-1-405b,85.65607172
gemma2-9b,76.20211899
gemma2-27b,79.21760391
mistral-7b-v1,58.10920945
mistral-7b-v2,54.44172779
mixtral-8x22B,74.00162999
qwen1-5-72b-chat,80.11409943
qwen2-72b,83.21108394