Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
9
87
587
Anthonny OLIME
Citaman
Follow
KnutJaegersberg's profile picture
BK-Lee's profile picture
ltim's profile picture
11 followers
·
83 following
Citaman
AI & ML interests
None yet
Recent Activity
updated
a collection
1 day ago
Keep in Mind's Model
updated
a collection
3 days ago
omni models
upvoted
a
paper
3 days ago
Demons in the Detail: On Implementing Load Balancing Loss for Training Specialized Mixture-of-Expert Models
View all activity
Organizations
Citaman
's activity
All
Models
Datasets
Spaces
Papers
Collections
Community
Posts
Upvotes
Likes
liked
12 models
4 days ago
m-a-p/YuE-s1-7B-anneal-en-cot
Text Generation
•
Updated
1 day ago
•
7.01k
•
275
meta-llama/Llama-3.3-70B-Instruct
Text Generation
•
Updated
Dec 21, 2024
•
604k
•
•
1.81k
jinaai/ReaderLM-v2
Text Generation
•
Updated
10 days ago
•
21.5k
•
458
HuggingFaceTB/SmolVLM-256M-Instruct
Image-Text-to-Text
•
Updated
8 days ago
•
12.5k
•
112
Qwen/Qwen2.5-7B-Instruct-1M
Text Generation
•
Updated
2 days ago
•
17.6k
•
155
Qwen/Qwen2.5-VL-72B-Instruct
Image-Text-to-Text
•
Updated
4 days ago
•
10.8k
•
175
Qwen/Qwen2.5-VL-7B-Instruct
Image-Text-to-Text
•
Updated
4 days ago
•
60.7k
•
241
Qwen/Qwen2.5-VL-3B-Instruct
Image-Text-to-Text
•
Updated
4 days ago
•
31.2k
•
118
unsloth/DeepSeek-R1-GGUF
Text Generation
•
Updated
about 21 hours ago
•
227k
•
411
deepseek-ai/Janus-Pro-7B
Any-to-Any
•
Updated
4 days ago
•
97.5k
•
2.2k
deepseek-ai/Janus-Pro-1B
Any-to-Any
•
Updated
4 days ago
•
22.7k
•
295
THUDM/glm-4-9b-chat-1m-hf
Text Generation
•
Updated
5 days ago
•
81
•
6
liked
a dataset
4 days ago
THUDM/T1
Viewer
•
Updated
11 days ago
•
10k
•
24
•
2
liked
2 models
5 days ago
baichuan-inc/Baichuan-M1-14B-Instruct
Updated
7 days ago
•
12.7k
•
27
baichuan-inc/Baichuan-Omni-1d5
Updated
6 days ago
•
86
•
28
liked
a model
7 days ago
deepseek-ai/DeepSeek-V3
Text Generation
•
Updated
7 days ago
•
804k
•
•
2.94k
liked
a model
8 days ago
prithivMLmods/SmolLM2-CoT-360M
Text Generation
•
Updated
26 days ago
•
692
•
14
liked
2 models
12 days ago
deepseek-ai/DeepSeek-R1-Zero
Text Generation
•
Updated
6 days ago
•
18k
•
644
deepseek-ai/DeepSeek-R1
Text Generation
•
Updated
6 days ago
•
674k
•
•
5.59k
liked
a model
18 days ago
openbmb/MiniCPM-o-2_6
Any-to-Any
•
Updated
5 days ago
•
191k
•
892
Load more