ticlazau
/

granite-3.1-8b-instruct-GGUF

Text Generation

Model card Files Files and versions Community

ticlazau commited on Dec 19, 2024

Commit

6433616

·

verified ·

1 Parent(s): cf2b706

Update README.md

Files changed (1) hide show

README.md +0 -38

README.md CHANGED Viewed

@@ -39,44 +39,6 @@ The model is designed to respond to general instructions and can be used to buil
 * Multilingual dialog use cases
 * Long-context tasks including long document/meeting summarization, long document QA, etc.
-**Generation:**
-This is a simple example of how to use Granite-3.1-8B-Instruct model.
-Install the following libraries:
-```shell
-pip install torch torchvision torchaudio
-pip install accelerate
-pip install transformers
-```
-Then, copy the snippet from the section that is relevant for your use case.
-```python
-import torch
-from transformers import AutoModelForCausalLM, AutoTokenizer
-device = "auto"
-model_path = "ibm-granite/granite-3.1-8b-instruct"
-tokenizer = AutoTokenizer.from_pretrained(model_path)
-# drop device_map if running on CPU
-model = AutoModelForCausalLM.from_pretrained(model_path, device_map=device)
-model.eval()
-# change input text as desired
-chat = [
-    { "role": "user", "content": "Please list one IBM Research laboratory located in the United States. You should only output its name and location." },
-]
-chat = tokenizer.apply_chat_template(chat, tokenize=False, add_generation_prompt=True)
-# tokenize the text
-input_tokens = tokenizer(chat, return_tensors="pt").to(device)
-# generate output tokens
-output = model.generate(**input_tokens,
-                        max_new_tokens=100)
-# decode output tokens into text
-output = tokenizer.batch_decode(output)
-# print output
-print(output)
-```
 **Model Architecture:**
 Granite-3.1-8B-Instruct is based on a decoder-only dense transformer architecture. Core components of this architecture are: GQA and RoPE, MLP with SwiGLU, RMSNorm, and shared input/output embeddings.

 * Multilingual dialog use cases
 * Long-context tasks including long document/meeting summarization, long document QA, etc.
 **Model Architecture:**
 Granite-3.1-8B-Instruct is based on a decoder-only dense transformer architecture. Core components of this architecture are: GQA and RoPE, MLP with SwiGLU, RMSNorm, and shared input/output embeddings.