138 15 353

Joseph

Joseph717171

AI & ML interests

None yet

Recent Activity

reacted to tegridydev's post with 🚀 1 day ago

So, what is #MechanisticInterpretability 🤔 Mechanistic Interpretability (MI) is the discipline of opening the black box of large language models (and other neural networks) to understand the underlying circuits, features and/or mechanisms that give rise to specific behaviours Instead of treating a model as a monolithic function, we can: 1. Trace how input tokens propagate through attention heads & MLP layers 2. Identify localized “circuit motifs” 3. Develop methods to systematically break down or “edit” these circuits to confirm we understand the causal structure. Mechanistic Interpretability aims to yield human-understandable explanations of how advanced models represent and manipulate concepts which hopefully leads to 1. Trust & Reliability 2. Safety & Alignment 3. Better Debugging / Development Insights https://bsky.app/profile/mechanistics.bsky.social/post/3lgvvv72uls2x

reacted to tegridydev's post with 👀 1 day ago

liked a model 1 day ago

bartowski/Virtuoso-Lite-GGUF

View all activity

Organizations

Joseph717171's activity

New activity in codelion/Llama-3.2-3B-o1 12 days ago

Nice! Any chance we can have access to the unquantified model files?

#1 opened 13 days ago by

Joseph717171

New activity in Undi95/Phi4-abliterated 22 days ago

Awesome work, Undi95! This looks great!

#1 opened 22 days ago by

Joseph717171

New activity in cognitivecomputations/Dolphin3.0-Llama3.1-8B 26 days ago

Great Model Base for ERP!

#1 opened 26 days ago by

Joseph717171

New activity in Joseph717171/Llama-3.1-SuperNova-Lite-8.0B-OQ8_0-F32.EF32.IQ4_K-Q8_0-GGUF about 1 month ago

Good for summarization.

#1 opened 2 months ago by

ThiloteE

New activity in meta-llama/Llama-3.3-70B-Instruct about 2 months ago

Rejected and No Way of Resubmitting?

#13 opened about 2 months ago by

BuildBackBuehler

Release of an 8B model?

#11 opened about 2 months ago by

Joseph717171

New activity in arcee-ai/SuperNova-Medius 2 months ago

Update SuperNova-Medius with a merge with Qwen/Qwen2.5-Coder-14B-Instruct + Further Training 😋

#12 opened 3 months ago by

Joseph717171

New activity in TheDrummer/Tunguska-39B-v1-GGUF 2 months ago

Wicked cool experiment!

#1 opened 2 months ago by

Joseph717171

New activity in TheDrummer/UnslopNemo-12B-v3-GGUF 3 months ago

Best prose in a model I've ever seen.

#5 opened 3 months ago by

Dsol58

New activity in Joseph717171/Hermes-3-Llama-3.1-8B_TIES_with_Base_Embeds_Initialized_to_Special_Instruct_Toks_dtypeF32 3 months ago

Like this experimental model merge? Leave a comment; I would love to know your thoughts and learn what you are doing with this model 😋

#1 opened 3 months ago by

Joseph717171

New activity in mistralai/Ministral-8B-Instruct-2410 3 months ago

This LLM is hallucinating like crazy. Can someone verify these prompts?

#3 opened 4 months ago by

phil111

New activity in arcee-ai/SuperNova-Medius 3 months ago

Ideal quantization levels

#6 opened 4 months ago by

jadbox

New activity in prince-canuma/Ministral-8B-Instruct-2410-HF 4 months ago

That was fast!

#1 opened 4 months ago by

rollercoasterX

New activity in arcee-ai/SuperNova-Medius-GGUF 4 months ago

different Q4 models

#1 opened 4 months ago by

animax

New activity in rombodawg/Rombos-LLM-V2.5-Qwen-72b 4 months ago

what is your "continuous finetuning"

#2 opened 4 months ago by

MaziyarPanahi

New activity in Joseph717171/Llama-3.1-SuperNova-8B-Lite_TIES_with_Base 4 months ago

Explain these Benchmark Results

#2 opened 4 months ago by

Joseph717171

New activity in arcee-ai/Llama-3.1-SuperNova-Lite 4 months ago

Distill Llama-3.2-1B-Instruct from Llama-405B-Instruct to make SuperNova-Pico

#14 opened 4 months ago by

Joseph717171

New activity in HumanLLMs/README 4 months ago

Paper? 👀

#1 opened 4 months ago by

Joseph717171

New activity in mistral-community/pixtral-12b-240910 5 months ago

This repo revision has at least one file that has been marked as unsafe.

#11 opened 5 months ago by

MayensGuds

New activity in arcee-ai/Llama-3.1-SuperNova-Lite 5 months ago

Why is the tokenizer.json not the same as LLaMa-3.1-8B-Instruct

#6 opened 5 months ago by

Joseph717171