gradientai
/

Llama-3-8B-Instruct-262k

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

michaelfeil commited on Apr 25, 2024

Commit

ccad071

·

verified ·

1 Parent(s): b0fad48

Update README.md

Files changed (1) hide show

README.md +1 -1

README.md CHANGED Viewed

@@ -19,7 +19,7 @@ This model extends LLama-3 8B's context length from 8k to > 160K, developed by G
 **Infra:**
-We build on top of the EasyContext Blockwise RingAttention library [3] to scalably and efficiently train on contexts up to 256k in length on [Crusoe Energy](https://huggingface.co/crusoeai) high performance L40S cluster.
 **Data:**

 **Infra:**
+We build on top of the EasyContext Blockwise RingAttention library [3] to scalably and efficiently train on contexts up to 262144 tokens on [Crusoe Energy](https://huggingface.co/crusoeai) high performance L40S cluster.
 **Data:**