rinoa's picture

rinoa

rinoa

·

AI & ML interests

None yet

Recent Activity

liked a Space 2 days ago

llamameta/OpenAi-O3-Preview-Mini

liked a Space 2 days ago

llamameta/Google-Gemini-Pro-2-latest-2025

liked a model 2 days ago

prithivMLmods/QwQ-4B-Instruct

View all activity

Organizations

None yet

rinoa's activity

upvoted a collection 12 days ago

Prompt-collection

1 item • Updated 13 days ago • 1

upvoted a collection 18 days ago

Granite 3.1 Language Models

A series of language models with 128K context length trained by IBM licensed under Apache 2.0 license. • 8 items • Updated 19 days ago • 46

upvoted an article about 1 month ago

Article

🐺🐦‍⬛ LLM Comparison/Test: 25 SOTA LLMs (including QwQ) through 59 MMLU-Pro CS benchmark runs

By

•

Dec 4, 2024

• 75

upvoted a paper about 1 month ago

BloombergGPT: A Large Language Model for Finance

Paper • 2303.17564 • Published Mar 30, 2023 • 21

upvoted a collection about 1 month ago

🧠 Reasoning Models

7 items • Updated 6 days ago • 36

upvoted 2 collections about 2 months ago

🍓 Ichigo v0.4

The experimental family designed to train LLMs to understand sound natively. • 2 items • Updated Nov 11, 2024 • 7

OpenCoder

OpenCoder is an open and reproducible code LLM family which matches the performance of top-tier code LLMs. • 8 items • Updated Nov 23, 2024 • 79

upvoted an article 2 months ago

Article

Visually Multilingual: Introducing mcdse-2b

By

•

Oct 27, 2024

• 37

upvoted 2 collections 3 months ago

Llama 3.2 3B & 1B GGUF Quants

Llama.cpp compatible quants for Llama 3.2 3B and 1B Instruct models. • 4 items • Updated Sep 26, 2024 • 46

Moshi v0.1 Release

MLX, Candle & PyTorch model checkpoints released as part of the Moshi release from Kyutai. Run inference via: https://github.com/kyutai-labs/moshi • 13 items • Updated Sep 18, 2024 • 225

upvoted a paper 4 months ago

Self-Harmonized Chain of Thought

Paper • 2409.04057 • Published Sep 6, 2024 • 16

upvoted a collection 5 months ago

Jamba-1.5

The AI21 Jamba family of models are state-of-the-art, hybrid SSM-Transformer instruction following foundation models • 2 items • Updated Aug 22, 2024 • 83

upvoted an article 5 months ago

Article

Tool Use, Unified

Aug 12, 2024

• 70

upvoted 2 papers 5 months ago

Transformer Explainer: Interactive Learning of Text-Generative Models

Paper • 2408.04619 • Published Aug 8, 2024 • 155

OpenDevin: An Open Platform for AI Software Developers as Generalist Agents

Paper • 2407.16741 • Published Jul 23, 2024 • 68

upvoted a collection 6 months ago

H2O Danube3

7 items • Updated Nov 30, 2024 • 56

upvoted a paper 6 months ago

Agentless: Demystifying LLM-based Software Engineering Agents

Paper • 2407.01489 • Published Jul 1, 2024 • 42

upvoted 2 papers 7 months ago

DeepSeek-Coder-V2: Breaking the Barrier of Closed-Source Models in Code Intelligence

Paper • 2406.11931 • Published Jun 17, 2024 • 58

Chameleon: Mixed-Modal Early-Fusion Foundation Models

Paper • 2405.09818 • Published May 16, 2024 • 126

upvoted a collection 8 months ago

🚀GGUF

Llama.cpp compatible models, can be used on CPUs and GPUs! • 987 items • Updated about 8 hours ago • 35