[ERROR] safetensors_rust.SafetensorError: Error while deserializing header: MetadataIncompleteBuffer

by luckychao - opened 7 days ago

7 days ago

Hi there :)

[root] Loading local model /mnt/petrelfs/share_data/quxiaoye/models/QVQ-72B-Preview
`Qwen2VLRotaryEmbedding` can now be fully parameterized by passing the model config through the `config` argument. All other arguments will be removed in v4.46

Loading checkpoint shards:   0%|          | 0/38 [00:00<?, ?it/s]
Loading checkpoint shards:   3%|▎         | 1/38 [00:04<02:37,  4.26s/it]
Loading checkpoint shards:   3%|▎         | 1/38 [00:04<02:37,  4.26s/it]
Traceback (most recent call last):
...
  File "generate_response.py", line 57, in main
    model = qwen.Qwen_Model(args.model_path, temperature=args.temperature, max_tokens=args.max_tokens)
  File "qwen.py", line 58, in __init__
    self.model = Qwen2VLForConditionalGeneration.from_pretrained(self.model_path, torch_dtype=torch.bfloat16,
  File "/lib/python3.9/site-packages/transformers/modeling_utils.py", line 4225, in from_pretrained
    ) = cls._load_pretrained_model(
  File "lib/python3.9/site-packages/transformers/modeling_utils.py", line 4706, in _load_pretrained_model
    state_dict = load_state_dict(
  File "lib/python3.9/site-packages/transformers/modeling_utils.py", line 555, in load_state_dict
    with safe_open(checkpoint_file, framework="pt") as f:
safetensors_rust.SafetensorError: Error while deserializing header: MetadataIncompleteBuffer

When using Transformers to load the QVQ-72B model, the error occurred. There is no problem with my code and environment loading the qwen2-vl-72b model.

Environment:

python 3.9
torch 2.4.0+cu118
transformers 4.46.1

Thanks for the reply!

jeffwadsworth

5 days ago

Just put your error into Claude. Try this:

Based on the error message, it seems you're encountering an issue while loading the QVQ-72B-Preview model, specifically a SafetensorError related to an incomplete metadata buffer. Let me help you troubleshoot this.

There are a few potential causes and solutions:

Incomplete or corrupted download:
- The error suggests that the safetensors file might be corrupted or incompletely downloaded
- Try removing the downloaded model files and re-downloading them
- Verify the checksums of the downloaded files if they're provided
Memory issues:
- Loading a 72B parameter model requires significant memory
- Try increasing your system's swap space
- Consider using model sharding or loading in 8-bit precision if available
Version compatibility:
- The warning about Qwen2VLRotaryEmbedding suggests you might be using an older version of the library
- Try updating your transformers library:
```
pip install --upgrade transformers
```
File permissions:
- Check if your process has proper read permissions for the model files
- Verify the ownership and permissions of the files in /mnt/petrelfs/share_data/quxiaoye/models/QVQ-72B-Preview

Could you try these steps and let me know:

Which version of transformers are you using?
Do you have enough system memory available?
Are all the model shard files present in the directory?

Anyway, see if this helps.

luckychao

2 days ago

•

edited 2 days ago

Just put your error into Claude. Try this:

Based on the error message, it seems you're encountering an issue while loading the QVQ-72B-Preview model, specifically a SafetensorError related to an incomplete metadata buffer. Let me help you troubleshoot this.

There are a few potential causes and solutions:
Incomplete or corrupted download:

The error suggests that the safetensors file might be corrupted or incompletely downloaded

Try removing the downloaded model files and re-downloading them

Verify the checksums of the downloaded files if they're provided

Memory issues:

Loading a 72B parameter model requires significant memory

Try increasing your system's swap space

Consider using model sharding or loading in 8-bit precision if available
Version compatibility:

The warning about Qwen2VLRotaryEmbedding suggests you might be using an older version of the library

Try updating your transformers library:
pip install --upgrade transformers
File permissions:

Check if your process has proper read permissions for the model files

Verify the ownership and permissions of the files in /mnt/petrelfs/share_data/quxiaoye/models/QVQ-72B-Preview
Could you try these steps and let me know:

Which version of transformers are you using?

Do you have enough system memory available?

Are all the model shard files present in the directory?

Anyway, see if this helps.

Thanks for your reply!!
I verified the checksums of the downloaded files and there is no problem with them. The memory is also sufficient to support me to load this model, because qwen2-vl-72b can be loaded.
I suspect there is a problem with the transformers version. Has anyone been able to load it successfully? If it is convenient, can you provide the version of your transformers? Thanks a lot!!

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment