Haihao Shen

Haihao

AI & ML interests

LLM quantization, sparsity, and acceleration

Recent Activity

Articles

Organizations

Intel's profile picture Need4Speed's profile picture Qwen's profile picture Open Platform for Enterprise AI's profile picture

Haihao's activity

reacted to wenhuach's post with 🚀 14 days ago
view post
Post
332
This week, OPEA Space released several new INT4 models, including:
nvidia/Llama-3.1-Nemotron-70B-Instruct-HF
allenai/OLMo-2-1124-13B-Instruct
THUDM/glm-4v-9b
AIDC-AI/Marco-o1
and several others.
Let us know which models you'd like prioritized for quantization, and we'll do our best to make it happen!

https://huggingface.co/OPEA
  • 3 replies
·
New activity in Intel/neural-chat-7b-v3 about 2 months ago
New activity in Intel/neural-chat-7b-v3-3 about 2 months ago
upvoted an article 4 months ago
view article
Article

Building Cost-Efficient Enterprise RAG applications with Intel Gaudi 2 and Intel Xeon

12
upvoted an article 7 months ago
view article
Article

Accelerate StarCoder with 🤗 Optimum Intel on Xeon: Q8/Q4 and Speculative Decoding

9