Eunsu Kim's picture

3 5 2

Eunsu Kim

EunsuKim

·

AI & ML interests

None yet

Recent Activity

authored a paper about 15 hours ago

BLEnD: A Benchmark for LLMs on Everyday Knowledge in Diverse Cultures and Languages

authored a paper about 15 hours ago

LLM-as-an-Interviewer: Beyond Static Testing Through Dynamic LLM Evaluation

upvoted a paper 1 day ago

LLM-as-an-Interviewer: Beyond Static Testing Through Dynamic LLM Evaluation

View all activity

Organizations

EunsuKim's activity

authored 2 papers about 15 hours ago

BLEnD: A Benchmark for LLMs on Everyday Knowledge in Diverse Cultures and Languages

Paper • 2406.09948 • Published Jun 14, 2024

LLM-as-an-Interviewer: Beyond Static Testing Through Dynamic LLM Evaluation

Paper • 2412.10424 • Published 27 days ago • 2

upvoted a paper 1 day ago

LLM-as-an-Interviewer: Beyond Static Testing Through Dynamic LLM Evaluation

Paper • 2412.10424 • Published 27 days ago • 2

updated 2 Spaces 15 days ago

Configuration Card Sharing Space

No application file

README

upvoted a paper about 2 months ago

Prometheus 2: An Open Source Language Model Specialized in Evaluating Other Language Models

Paper • 2405.01535 • Published May 2, 2024 • 119

upvoted a paper 2 months ago

Survey of Cultural Awareness in Language Models: Text and Beyond

Paper • 2411.00860 • Published Oct 30, 2024 • 23

updated a dataset 2 months ago

interview-eval/DepthQA

Viewer • Updated Oct 30, 2024 • 847

updated 2 collections 3 months ago

olmoe-STEM-case-{1,2,3,5,7}

5 items • Updated Nov 24, 2024

olmoe-MATH-case-{1-8}

8 items • Updated Oct 22, 2024