忍者

byteprobe

AI & ML interests

RL | NLP | LLM | LMM | agent

Recent Activity

reacted to Severian's post with 🚀 1 day ago

GraphRAG-Ollama-UI I've been working on a local version of Microsoft's GraphRAG that uses Ollama for everything. It's got a new interactive UI built with Gradio that makes it easier to manage data, run queries, and visualize results. It's not fully featured or set up to harness the entire GraphRAG library yet but it allows you to run all the standard commands for Indexing/Processing and chatting with your graph. Some key features: Uses local models via Ollama for LLM and embeddings 3D graph visualization of the knowledge graph using Plotly File management through the UI (upload, view, edit, delete) Settings management in the interface Real-time logging for debugging https://github.com/severian42/GraphRAG-Ollama-UI

upvoted a paper 1 day ago

Next Token Prediction Towards Multimodal Intelligence: A Comprehensive Survey

upvoted a paper 1 day ago

1.58-bit FLUX

View all activity

Organizations

byteprobe's activity

upvoted 3 papers 1 day ago

upvoted 2 collections 1 day ago

Reasoning Datasets

Collection

Reasoning datasets that are trending 🔥 • 10 items • Updated 3 days ago • 17

OLMo 2

Collection

Artifacts for the second set of OLMo models. • 20 items • Updated 3 days ago • 66

upvoted a paper 1 day ago

2 OLMo 2 Furious

Paper • 2501.00656 • Published 5 days ago • 12

upvoted an article 2 days ago

Article

🐺🐦‍⬛ LLM Comparison/Test: DeepSeek-V3, QVQ-72B-Preview, Falcon3 10B, Llama 3.3 70B, Nemotron 70B in my updated MMLU-Pro CS benchmark

•

4 days ago

• 30

upvoted an article 6 days ago

Article

🐺🐦‍⬛ LLM Comparison/Test: 25 SOTA LLMs (including QwQ) through 59 MMLU-Pro CS benchmark runs

•

Dec 4, 2024

• 75

upvoted a collection 6 days ago

Common Models

Collection

The first generation of models pretrained on Common Corpus. • 5 items • Updated Dec 5, 2024 • 28

upvoted an article 6 days ago

Article

They Said It Couldn’t Be Done

•

Dec 5, 2024

• 76

upvoted 10 papers 7 days ago

LoftQ: LoRA-Fine-Tuning-Aware Quantization for Large Language Models

Paper • 2310.08659 • Published Oct 12, 2023 • 25

Smarter, Better, Faster, Longer: A Modern Bidirectional Encoder for Fast, Memory Efficient, and Long Context Finetuning and Inference

Paper • 2412.13663 • Published 19 days ago • 117

Are Your LLMs Capable of Stable Reasoning?

Paper • 2412.13147 • Published 20 days ago • 91

Compressed Chain of Thought: Efficient Reasoning Through Dense Representations

Paper • 2412.13171 • Published 20 days ago • 31

GenEx: Generating an Explorable World

Paper • 2412.09624 • Published 25 days ago • 87

TheAgentCompany: Benchmarking LLM Agents on Consequential Real World Tasks

Paper • 2412.14161 • Published 19 days ago • 48

Large Action Models: From Inception to Implementation

Paper • 2412.10047 • Published 24 days ago • 31

Apollo: An Exploration of Video Understanding in Large Multimodal Models

Paper • 2412.10360 • Published 24 days ago • 136

Byte Latent Transformer: Patches Scale Better Than Tokens

Paper • 2412.09871 • Published 24 days ago • 83

Qwen2.5 Technical Report

Paper • 2412.15115 • Published 18 days ago • 336