namgoodfire commited on
Commit
e0ece42
·
verified ·
1 Parent(s): a73b7ee

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +9 -1
README.md CHANGED
@@ -6,4 +6,12 @@ base_model:
6
  - meta-llama/Llama-3.3-70B-Instruct
7
  ---
8
 
9
- ### Model Information
 
 
 
 
 
 
 
 
 
6
  - meta-llama/Llama-3.3-70B-Instruct
7
  ---
8
 
9
+ ### Model Information
10
+
11
+ The Goodfire SAE (Sparse Autoencoder) for Llama 3.3 70B is an interpreter model designed to analyze and understand
12
+ the internal representations of Llama-3.3-70B-Instruct. This SAE model is trained specifically on layer 50 of 
13
+ Llama 3.3 70B and achieves an L0 count of 121, enabling the decomposition of complex neural activations
14
+ into interpretable features. The model is optimized for interpretability tasks and model steering applications,
15
+ allowing researchers and developers to gain insights into the model's internal processing and behavior patterns.
16
+ As an open-source tool, it serves as a foundation for advancing interpretability research and enhancing control
17
+ over large language model operations.