arxiv:2412.20993
Hao Zhang
zhisbug
AI & ML interests
None yet
Recent Activity
authored
a paper
4 days ago
LightSeq: Sequence Level Parallelism for Distributed Training of Long
Context Transformers
authored
a paper
4 days ago
Online Speculative Decoding
authored
a paper
4 days ago
Judging LLM-as-a-judge with MT-Bench and Chatbot Arena
Organizations
Papers
18
models
None public yet
datasets
None public yet