Yixin Song's picture

Yixin Song

yixinsong

·

AI & ML interests

None yet

Recent Activity

new activity about 2 hours ago

PowerInfer/SmallThinker-3B-Preview:example use colab?

new activity about 13 hours ago

PowerInfer/SmallThinker-3B-Preview:Update README.md

liked a dataset about 17 hours ago

PowerInfer/LONGCOT-Refine-500K

View all activity

Organizations

yixinsong's activity

New activity in PowerInfer/SmallThinker-3B-Preview about 2 hours ago

example use colab?

#3 opened 1 day ago by

New activity in PowerInfer/SmallThinker-3B-Preview about 13 hours ago

Update README.md

#4 opened about 14 hours ago by

New activity in PowerInfer/SmallThinker-3B-Preview 1 day ago

Training: Second Phase

#2 opened 4 days ago by

New activity in PowerInfer/QWQ-LONGCOT-500K 2 days ago

[bot] Conversion to Parquet

#1 opened 24 days ago by

parquet-converter

New activity in PowerInfer/LONGCOT-Refine-500K 2 days ago

[bot] Conversion to Parquet

#1 opened 3 days ago by

parquet-converter

Librarian Bot: Add language metadata for dataset

#2 opened 3 days ago by

New activity in PowerInfer/SmallThinker-3B-Preview 4 days ago

Evaluation

#1 opened 4 days ago by

New activity in PowerInfer/TurboSparse-Mistral-Instruct 4 months ago

problems about sample strategies

#1 opened 4 months ago by

New activity in yixinsong/persona 4 months ago

[bot] Conversion to Parquet

#1 opened 4 months ago by

parquet-converter

New activity in BAAI/Infinity-Instruct 5 months ago

0729聊天数据集有计划开源吗？

#16 opened 5 months ago by

New activity in HuggingFaceTB/SmolLM-1.7B 6 months ago

MMLU doesn't match on lm-evaluation-harness

#2 opened 6 months ago by

New activity in SparseLLM/relu2-5B 7 months ago

Inference API not working properly. Lack of proper modeling file?

#1 opened 7 months ago by

New activity in SparseLLM/relu-5B 7 months ago

Difference between SparseLLM/relu and SparseLLM/reglu - lack of modeling file?

#1 opened 7 months ago by

commented 3 papers 7 months ago

PowerInfer-2: Fast Large Language Model Inference on a Smartphone

Paper • 2406.06282 • Published Jun 10, 2024 • 36 •

Turbo Sparse: Achieving LLM SOTA Performance with Minimal Activated Parameters

Paper • 2406.05955 • Published Jun 10, 2024 • 22 •

PowerInfer-2: Fast Large Language Model Inference on a Smartphone

Paper • 2406.06282 • Published Jun 10, 2024 • 36 •

New activity in migtissera/Tess-v2.5-Qwen2-72B 7 months ago

Nice work! Do we have plan for opening source the datasets?

#1 opened 7 months ago by

New activity in TIGER-Lab/MMLU-Pro 8 months ago

Script for evaluation?

#7 opened 8 months ago by

New activity in Vezora/Mistral-22B-v0.1 8 months ago

Any update about the merge method?

#8 opened 8 months ago by

commented a paper 9 months ago

The Era of 1-bit LLMs: All Large Language Models are in 1.58 Bits

Paper • 2402.17764 • Published Feb 27, 2024 • 605 •