multilingual-reward-bench

community

Activity Feed

AI & ML interests

None defined yet.

Recent Activity

seungone authored a paper about 17 hours ago

LLM-as-an-Interviewer: Beyond Static Testing Through Dynamic LLM Evaluation

seungone authored a paper 4 days ago

Bridging the Data Provenance Gap Across Text, Speech and Video

shayekh authored a paper 22 days ago

bbOCR: An Open-source Multi-domain OCR Pipeline for Bengali Documents

View all activity

multilingual-reward-bench's activity

seungone

authored a paper about 17 hours ago

LLM-as-an-Interviewer: Beyond Static Testing Through Dynamic LLM Evaluation

Paper • 2412.10424 • Published 25 days ago • 1

seungone

authored a paper 4 days ago

Bridging the Data Provenance Gap Across Text, Speech and Video

Paper • 2412.17847 • Published 16 days ago • 7

shayekh

authored 2 papers 22 days ago

bbOCR: An Open-source Multi-domain OCR Pipeline for Bengali Documents

Paper • 2308.10647 • Published Aug 21, 2023

Maya: An Instruction Finetuned Multilingual Multimodal Model

Paper • 2412.07112 • Published 25 days ago • 25

shivi

authored a paper 26 days ago

Aya Expanse: Combining Research Breakthroughs for a New Multilingual Frontier

Paper • 2412.04261 • Published 30 days ago • 1

scottsuk0306

authored a paper 29 days ago

Evaluating Language Models as Synthetic Data Generators

Paper • 2412.03679 • Published about 1 month ago • 45

shivi

authored 2 papers 29 days ago

INCLUDE: Evaluating Multilingual Language Understanding with Regional Knowledge

Paper • 2411.19799 • Published Nov 29, 2024 • 11

Global MMLU: Understanding and Addressing Cultural and Linguistic Biases in Multilingual Evaluation

Paper • 2412.03304 • Published about 1 month ago • 17

seungone

authored a paper 29 days ago

Evaluating Language Models as Synthetic Data Generators

Paper • 2412.03679 • Published about 1 month ago • 45

shayekh

authored a paper about 1 month ago

INCLUDE: Evaluating Multilingual Language Understanding with Regional Knowledge

Paper • 2411.19799 • Published Nov 29, 2024 • 11

seungone

authored 2 papers 2 months ago

MM-Eval: A Multilingual Meta-Evaluation Benchmark for LLM-as-a-Judge and Reward Models

Paper • 2410.17578 • Published Oct 23, 2024 • 1

Better Instruction-Following Through Minimum Bayes Risk

Paper • 2410.02902 • Published Oct 3, 2024

shayekh

authored 2 papers 2 months ago

MM-Eval: A Multilingual Meta-Evaluation Benchmark for LLM-as-a-Judge and Reward Models

Paper • 2410.17578 • Published Oct 23, 2024 • 1

M-RewardBench: Evaluating Reward Models in Multilingual Settings

Paper • 2410.15522 • Published Oct 20, 2024 • 11

lintang

authored a paper 2 months ago

Pangea: A Fully Open Multilingual Multimodal LLM for 39 Languages

Paper • 2410.16153 • Published Oct 21, 2024 • 44

seungone

authored a paper 2 months ago

Pangea: A Fully Open Multilingual Multimodal LLM for 39 Languages

Paper • 2410.16153 • Published Oct 21, 2024 • 44

hyungjoochae

authored a paper 2 months ago

Web Agents with World Models: Learning and Leveraging Environment Dynamics in Web Navigation

Paper • 2410.13232 • Published Oct 17, 2024 • 41

amphora

updated 3 datasets 3 months ago

AI & ML interests

Recent Activity

Team members 15

multilingual-reward-bench's activity