hivemind
/

gpt-j-6B-8bit

Text Generation

Inference Endpoints

Model card Files Files and versions Community

justheuristic commited on Aug 10, 2022

Commit

f23d189

·

1 Parent(s): 636b67f

Update README.md

Files changed (1) hide show

README.md +5 -0

README.md CHANGED Viewed

@@ -1,3 +1,8 @@
 ### Quantized EleutherAI/gpt-j-6b with 8-bit weights
 This is a version of EleutherAI's GPT-J with 6 billion parameters that is modified so you can generate **and fine-tune the model in colab or equivalent desktop gpu (e.g. single 1080Ti)**.

+Note: this model was superceded by the [`load_in_8bit=True` feature in transformers]https://github.com/huggingface/transformers/pull/17901)
+by Younes Belkada and Tim Dettmers. Please see [this usage example](https://colab.research.google.com/drive/1qOjXfQIAULfKvZqwCen8-MoWKGdSatZ4#scrollTo=W8tQtyjp75O).
+This legacy model was built for [transformers v4.15.0](https://github.com/huggingface/transformers/releases/tag/v4.15.0) and pytorch 1.11. Newer versions could work, but are not supported.
 ### Quantized EleutherAI/gpt-j-6b with 8-bit weights
 This is a version of EleutherAI's GPT-J with 6 billion parameters that is modified so you can generate **and fine-tune the model in colab or equivalent desktop gpu (e.g. single 1080Ti)**.