namgoodfire
commited on
Update README.md
Browse files
README.md
CHANGED
@@ -6,4 +6,12 @@ base_model:
|
|
6 |
- meta-llama/Llama-3.3-70B-Instruct
|
7 |
---
|
8 |
|
9 |
-
### Model Information
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
6 |
- meta-llama/Llama-3.3-70B-Instruct
|
7 |
---
|
8 |
|
9 |
+
### Model Information
|
10 |
+
|
11 |
+
The Goodfire SAE (Sparse Autoencoder) for Llama 3.3 70B is an interpreter model designed to analyze and understand
|
12 |
+
the internal representations of Llama-3.3-70B-Instruct. This SAE model is trained specifically on layer 50 of
|
13 |
+
Llama 3.3 70B and achieves an L0 count of 121, enabling the decomposition of complex neural activations
|
14 |
+
into interpretable features. The model is optimized for interpretability tasks and model steering applications,
|
15 |
+
allowing researchers and developers to gain insights into the model's internal processing and behavior patterns.
|
16 |
+
As an open-source tool, it serves as a foundation for advancing interpretability research and enhancing control
|
17 |
+
over large language model operations.
|