MiniMaxAI
/

MiniMax-Text-01

Text Generation

minimax_text_01

Model card Files Files and versions Community

MiniMax-AI commited on about 17 hours ago

Commit

4ca1b78

·

1 Parent(s): 372fb1d

Fix world_size in QuickStart Code

Files changed (1) hide show

README.md +4 -4

README.md CHANGED Viewed

@@ -170,15 +170,15 @@ quantization_config =  QuantoConfig(
             + [f"model.layers.{i}.block_sparse_moe.gate" for i in range(hf_config.num_hidden_layers)]
         )
 # set device map
 device_map = {
     'model.embed_tokens': 'cuda:0',
     'model.norm': f'cuda:{world_size - 1}',
     'lm_head': f'cuda:{world_size - 1}'
 }
-# assume 8 GPUs
-world_size = 8
-layers_per_device = hf_config.num_hidden_layers // world_size
 for i in range(world_size):
     for j in range(layers_per_device):
         device_map[f'model.layers.{i * layers_per_device + j}'] = f'cuda:{i}'
@@ -239,4 +239,4 @@ response = tokenizer.batch_decode(generated_ids, skip_special_tokens=True)[0]
 ## 6. Chatbot & API
 For general use and evaluation, we provide a [Chatbot](https://www.hailuo.ai/) with online search capabilities and the [online API](https://intl.minimaxi.com) for developers.
-Contact us at [[email protected]](mailto:[email protected]).

             + [f"model.layers.{i}.block_sparse_moe.gate" for i in range(hf_config.num_hidden_layers)]
         )
+# assume 8 GPUs
+world_size = 8
+layers_per_device = hf_config.num_hidden_layers // world_size
 # set device map
 device_map = {
     'model.embed_tokens': 'cuda:0',
     'model.norm': f'cuda:{world_size - 1}',
     'lm_head': f'cuda:{world_size - 1}'
 }
 for i in range(world_size):
     for j in range(layers_per_device):
         device_map[f'model.layers.{i * layers_per_device + j}'] = f'cuda:{i}'
 ## 6. Chatbot & API
 For general use and evaluation, we provide a [Chatbot](https://www.hailuo.ai/) with online search capabilities and the [online API](https://intl.minimaxi.com) for developers.
+Contact us at [[email protected]](mailto:[email protected]).