忍者's picture

忍者

byteprobe

AI & ML interests

RL | NLP | LLM | LMM | agent

Recent Activity

Organizations

LocalLLaMA's profile picture MLX Community's profile picture Hugging Face 1Bit LLMs's profile picture Hugging Face Discord Community's profile picture open/ acc's profile picture

byteprobe's activity

upvoted an article 2 days ago
view article
Article

πŸΊπŸ¦β€β¬› LLM Comparison/Test: DeepSeek-V3, QVQ-72B-Preview, Falcon3 10B, Llama 3.3 70B, Nemotron 70B in my updated MMLU-Pro CS benchmark

By wolfram β€’
β€’ 30
upvoted an article 6 days ago
view article
Article

πŸΊπŸ¦β€β¬› LLM Comparison/Test: 25 SOTA LLMs (including QwQ) through 59 MMLU-Pro CS benchmark runs

By wolfram β€’
β€’ 75
upvoted an article 6 days ago