Evan's picture

Evan

evdcush

·

AI & ML interests

None yet

Recent Activity

liked a model 1 day ago

deepseek-ai/DeepSeek-V3-Base

liked a model 1 day ago

deepseek-ai/DeepSeek-V3

liked a model 1 day ago

StephanST/WALDO30

View all activity

Organizations

evdcush's activity

upvoted 2 papers 22 days ago

I Don't Know: Explicit Modeling of Uncertainty with an [IDK] Token

Paper • 2412.06676 • Published 28 days ago • 9

POINTS1.5: Building a Vision-Language Model towards Real World Applications

Paper • 2412.08443 • Published 26 days ago • 38

upvoted 4 collections 23 days ago

Open LLM Leaderboard best models ❤️‍🔥

A daily uploaded list of models with best evaluations on the LLM leaderboard: • 64 items • Updated 13 minutes ago • 497

Gemma 2 JPN Release

A Gemma 2 2B model fine-tuned on Japanese text. It supports the Japanese language the same level of performance of EN only queries on Gemma 2. • 3 items • Updated 24 days ago • 26

PaliGemma 2 Release

Vision-Language Models available in multiple 3B, 10B and 28B variants. • 23 items • Updated 24 days ago • 123

Meta Motivo

A first-of-its-kind behavioral foundation model to control a virtual physics-based humanoid agent for a wide range of whole-body tasks. • 6 items • Updated 27 days ago • 9

upvoted a collection 25 days ago

Llama 3.3

This collection hosts the transformers and original repos of the Llama 3.3 • 1 item • Updated about 1 month ago • 102

upvoted a paper about 2 months ago

RedPajama: an Open Dataset for Training Large Language Models

Paper • 2411.12372 • Published Nov 19, 2024 • 48

upvoted 3 collections about 2 months ago

Llama-3.1-Nemotron-70B

SOTA models on Arena Hard and RewardBench as of 1 Oct 2024. • 6 items • Updated about 12 hours ago • 149

Qwen2.5-Coder

Code-specific model series based on Qwen2.5 • 40 items • Updated Nov 28, 2024 • 259

Common Corpus

Largest multilingual pretraining data. • 1 item • Updated Nov 13, 2024 • 8

upvoted a paper about 2 months ago

Add-it: Training-Free Object Insertion in Images With Pretrained Diffusion Models

Paper • 2411.07232 • Published Nov 11, 2024 • 63

upvoted a collection about 2 months ago

LLM2CLIP

LLM2CLIP makes SOTA pretrained CLIP modal more SOTA ever. • 10 items • Updated 26 days ago • 50

upvoted a paper about 2 months ago

LLM2CLIP: Powerful Language Model Unlock Richer Visual Representation

Paper • 2411.04997 • Published Nov 7, 2024 • 37

upvoted a collection about 2 months ago

OpenCoder

OpenCoder is an open and reproducible code LLM family which matches the performance of top-tier code LLMs. • 8 items • Updated Nov 23, 2024 • 79

upvoted a paper about 2 months ago

Training Language Models to Self-Correct via Reinforcement Learning

Paper • 2409.12917 • Published Sep 19, 2024 • 136

upvoted a collection 3 months ago

Llama 3.2

This collection hosts the transformers and original repos of the Llama 3.2 and Llama Guard 3 • 15 items • Updated about 1 month ago • 551

upvoted a collection 4 months ago

Moshi v0.1 Release

MLX, Candle & PyTorch model checkpoints released as part of the Moshi release from Kyutai. Run inference via: https://github.com/kyutai-labs/moshi • 13 items • Updated Sep 18, 2024 • 225

upvoted 2 papers 4 months ago

Source2Synth: Synthetic Data Generation and Curation Grounded in Real Data Sources

Paper • 2409.08239 • Published Sep 12, 2024 • 16

Can LLMs Generate Novel Research Ideas? A Large-Scale Human Study with 100+ NLP Researchers

Paper • 2409.04109 • Published Sep 6, 2024 • 43