3 2 15

Yixuan Wei

EasonWei

weiyx16

AI & ML interests

None yet

Recent Activity

commented a paper about 1 month ago

COAT: Compressing Optimizer states and Activation for Memory-Efficient FP8 Training

commented a paper about 2 months ago

COAT: Compressing Optimizer states and Activation for Memory-Efficient FP8 Training

upvoted a paper about 2 months ago

On Memorization of Large Language Models in Logical Reasoning

View all activity

Organizations

EasonWei's activity

commented a paper about 1 month ago

COAT: Compressing Optimizer states and Activation for Memory-Efficient FP8 Training

Paper • 2410.19313 • Published Oct 25, 2024 • 19 •

commented a paper about 2 months ago

COAT: Compressing Optimizer states and Activation for Memory-Efficient FP8 Training

Paper • 2410.19313 • Published Oct 25, 2024 • 19 •

upvoted a paper about 2 months ago

On Memorization of Large Language Models in Logical Reasoning

Paper • 2410.23123 • Published Oct 30, 2024 • 18

liked 2 models 3 months ago

stepfun-ai/GOT-OCR2_0

Image-Text-to-Text • Updated Sep 18, 2024 • 846k • 1.31k

black-forest-labs/FLUX.1-dev

Text-to-Image • Updated Aug 16, 2024 • 1.17M • • 7.7k

upvoted a collection 3 months ago

Qwen2.5

Collection

Qwen2.5 language models, including pretrained and instruction-tuned models of 7 sizes, including 0.5B, 1.5B, 3B, 7B, 14B, 32B, and 72B. • 45 items • Updated Nov 28, 2024 • 452

liked a dataset 4 months ago

edinburgh-dawg/mmlu-redux

Viewer • Updated Aug 9, 2024 • 3k • 2.6k • 27

New activity in 1bitLLM/bitnet_b1_58-3B 7 months ago

Is it bitnet {-1,0,1}?

#6 opened 9 months ago by

Remek

authored a paper 7 months ago

Xwin-LM: Strong and Scalable Alignment Practice for LLMs

Paper • 2405.20335 • Published May 30, 2024 • 17

authored a paper 10 months ago

Common 7B Language Models Already Possess Strong Math Capabilities

Paper • 2403.04706 • Published Mar 7, 2024 • 16

liked a model about 1 year ago

adept/fuyu-8b

Image-Text-to-Text • Updated Nov 4, 2023 • 6.77k • 998

liked a dataset about 1 year ago

fka/awesome-chatgpt-prompts

Viewer • Updated Sep 3, 2024 • 170 • 5.85k • 6.71k

liked a model about 1 year ago

01-ai/Yi-6B

Text Generation • Updated Nov 11, 2024 • 5.92k • 372

authored a paper about 1 year ago

FP8-LM: Training FP8 Large Language Models

Paper • 2310.18313 • Published Oct 27, 2023 • 33

liked a dataset about 1 year ago

openbmb/UltraFeedback

Viewer • Updated Dec 29, 2023 • 64k • 1.35k • 344

New activity in OpenAssistant/reward-model-deberta-v3-large-v2 over 1 year ago

Validation split indices?

#6 opened over 1 year ago by

cmglaze

liked a model over 1 year ago

tiiuae/falcon-7b

Text Generation • Updated Oct 12, 2024 • 81.3k • 1.08k

liked a Space over 1 year ago

Running on CPU Upgrade

12.1k

🏆

Open LLM Leaderboard

Track, rank and evaluate open LLMs and chatbots

liked 2 models over 1 year ago

WizardLMTeam/WizardCoder-15B-V1.0

Text Generation • Updated Jan 19, 2024 • 1.27k • 750

bigscience/tr8-104B-logs

Updated Nov 30, 2021 • 5