Boren Hong's picture

3 25

Boren Hong

hongb2

·

AI & ML interests

None yet

Organizations

None yet

hongb2's activity

upvoted a collection 5 months ago

LLMs

370 items • Updated 1 day ago • 25

upvoted 2 papers 5 months ago

WildBench: Benchmarking LLMs with Challenging Tasks from Real Users in the Wild

Paper • 2406.04770 • Published Jun 7, 2024 • 28

WildVis: Open Source Visualizer for Million-Scale Chat Logs in the Wild

Paper • 2409.03753 • Published Sep 5, 2024 • 19