BSC-LT
/

salamandra-7b

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

joanllop commited on Oct 1, 2024

Commit

d6eb0ec

·

verified ·

1 Parent(s): 6f830e9

Update README.md

Files changed (1) hide show

README.md +4 -3

README.md CHANGED Viewed

@@ -133,12 +133,13 @@ The accelerated partition is composed of 1,120 nodes with the following specific
 ---
 ## How to use
 ### Inference
-This section covers different methods for running inference, including utilizing Huggingface's Text Generation Pipeline, multi-GPU setups, and vLLM for efficient and scalable generation. Each approach is accompanied by step-by-step instructions to ensure a smooth setup.
 #### Inference with Huggingface's Text Generation Pipeline
-The Huggingface Text Generation Pipeline provides a simple and straightforward way to run inference using the Salamandra-7b model.
 ```bash
 pip install -U transformers
@@ -182,7 +183,7 @@ for output in outputs:
 </details>
 #### Inference with single / multi GPU
-Inference code for Huggingface’s AutoModel.
 ```bash
 pip install transformers torch accelerate sentencepiece protobuf

 ---
 ## How to use
+This section offers examples of how to perform inference using various methods.
 ### Inference
+You'll find different techniques for running inference, including Huggingface's Text Generation Pipeline, multi-GPU configurations, and vLLM for scalable and efficient generation.
 #### Inference with Huggingface's Text Generation Pipeline
+The Huggingface Text Generation Pipeline provides a straightforward way to run inference using the Salamandra-7b model.
 ```bash
 pip install -U transformers
 </details>
 #### Inference with single / multi GPU
+This section provides a simple example of how to run inference using Huggingface's AutoModel class.
 ```bash
 pip install transformers torch accelerate sentencepiece protobuf