How to see which version of Transformers library is needed to get access to this model

#3
by aero-artem - opened

I am using version transformers==4.41.0 and get this error:

E               ValueError: The checkpoint you are trying to load has model type `modernbert` but Transformers does not recognize this architecture. This could be because of an issue with the checkpoint, or because your version of Transformers is out of date.

Is latest version required? Can't use it with my existing setup.

Thank you!

Answer.AI org

Hey!

As of right now, ModernBERT requires transformers to be installed from the (stable) main branch of the transformers repository. After the next transformers release (4.48.x), it will be supported in the python package available everywhere.

Sadly it can't be used on older versions of transformers, as it's a whole new model type internally so it needs the new code, sorry about that.

For anyone wondering run the following command:

pip install git+https://github.com/huggingface/transformers

running o

For anyone wondering run the following command:

pip install git+https://github.com/huggingface/transformers

tried to run with pip install git+https://github.com/huggingface/transformers on python 3.9
lib/python3.9/site-packages/transformers/utils/import_utils.py", line 1805, in _get_module
raise RuntimeError(
RuntimeError: Failed to import transformers.models.modernbert.modeling_modernbert because of the following error (look up to see its traceback):
unsupported operand type(s) for |: '_GenericAlias' and '_GenericAlias'

simple script

from transformers import AutoTokenizer, AutoModelForMaskedLM

tokenizer = AutoTokenizer.from_pretrained("answerdotai/ModernBERT-base")
model = AutoModelForMaskedLM.from_pretrained("answerdotai/ModernBERT-base")
input_text = "He walked to the [MASK]."
print(model(input_text))

Answer.AI org

I believe this is due to Python 3.9. Using a newer version should fix it.
As a workaround, you can add from __future__ import annotations to the top of transformers/models/modernbert/modeling_modernbert.py but I wouldn't go that route.

  • Tom Aarsen

I believe this is due to Python 3.9. Using a newer version should fix it.
As a workaround, you can add from __future__ import annotations to the top of transformers/models/modernbert/modeling_modernbert.py but I wouldn't go that route.

  • Tom Aarsen
    # Check only the first and last input IDs to reduce overhead.
    if self.config.pad_token_id in input_ids[:, [-1, 0]]:
        warn_string = (
return forward_call(*args, **kwargs)

File "/opt/homebrew/anaconda3/envs/market_prediction_cpu/lib/python3.10/site-packages/transformers/models/modernbert/modeling_modernbert.py", line 871, in forward
self.warn_if_padding_and_no_attention_mask(input_ids, attention_mask)
File "/opt/homebrew/anaconda3/envs/market_prediction_cpu/lib/python3.10/site-packages/transformers/modeling_utils.py", line 5033, in warn_if_padding_and_no_attention_mask
if self.config.pad_token_id in input_ids[:, [-1, 0]]:
TypeError: unhashable type: 'slice'
python-BaseException

python3.10
running on mac os
inputs = tokenizer(
'He walked to the',
return_tensors="pt"
)
model(inputs['input_ids'], inputs['attention_mask'])

nvm, worked after some fixes

I believe this is due to Python 3.9. Using a newer version should fix it.
As a workaround, you can add from __future__ import annotations to the top of transformers/models/modernbert/modeling_modernbert.py but I wouldn't go that route.

  • Tom Aarsen
    # Check only the first and last input IDs to reduce overhead.
    if self.config.pad_token_id in input_ids[:, [-1, 0]]:
        warn_string = (
return forward_call(*args, **kwargs)

File "/opt/homebrew/anaconda3/envs/market_prediction_cpu/lib/python3.10/site-packages/transformers/models/modernbert/modeling_modernbert.py", line 871, in forward
self.warn_if_padding_and_no_attention_mask(input_ids, attention_mask)
File "/opt/homebrew/anaconda3/envs/market_prediction_cpu/lib/python3.10/site-packages/transformers/modeling_utils.py", line 5033, in warn_if_padding_and_no_attention_mask
if self.config.pad_token_id in input_ids[:, [-1, 0]]:
TypeError: unhashable type: 'slice'
python-BaseException

python3.10
running on mac os
inputs = tokenizer(
'He walked to the',
return_tensors="pt"
)
model(inputs)

Works with my current python version : Python 3.10.14
OS : Windows 11 and Ubuntu 24.04 LTS

This comment has been hidden

I wonder whether using mean pooling is enough for ModernBERT.

No sentence-transformers model found with name answerdotai/ModernBERT-base. Creating a new one with mean pooling.

running o

For anyone wondering run the following command:

pip install git+https://github.com/huggingface/transformers

tried to run with pip install git+https://github.com/huggingface/transformers on python 3.9
lib/python3.9/site-packages/transformers/utils/import_utils.py", line 1805, in _get_module
raise RuntimeError(
RuntimeError: Failed to import transformers.models.modernbert.modeling_modernbert because of the following error (look up to see its traceback):
unsupported operand type(s) for |: '_GenericAlias' and '_GenericAlias'

simple script

from transformers import AutoTokenizer, AutoModelForMaskedLM

tokenizer = AutoTokenizer.from_pretrained("answerdotai/ModernBERT-base")
model = AutoModelForMaskedLM.from_pretrained("answerdotai/ModernBERT-base")
input_text = "He walked to the [MASK]."
print(model(input_text))

It works when I change my python version from 3.9 to 3.10

Hi
I have installed the transformer from git as was suggested here, and indeed it solved the problem and helped me to be able to load the model.
When I am trying to use the model, again with sentance transformer, I got the following error exception:

raise BackendCompilerFailed(self.compiler_fn, e) from etorch._dynamo.exc.BackendCompilerFailed: backend='inductor' raised:
CalledProcessError: Command '['/usr/bin/gcc', '/tmp/tmpts7u207v/main.c', '-O3', '-shared', '-fPIC', '-o', '/tmp/tmpts7u207v/cuda_utils.cpython-312-x86_64-linux-gnu.so', '-lcuda', '-L/home/ubuntu/cto-embedding/ehud/lib/python3.12/site-packages/triton/backends/nvidia/lib', '-L/lib/x86_64-linux-gnu', '-I/home/ubuntu/cto-embedding/ehud/lib/python3.12/site-packages/triton/backends/nvidia/include', '-I/tmp/tmpts7u207v', '-I/usr/include/python3.12']' returned non-zero exit status 1.

Set TORCH_LOGS="+dynamo" and TORCHDYNAMO_VERBOSE=1 for more information

You can suppress this exception and fall back to eager by setting:
import torch._dynamo
torch._dynamo.config.suppress_errors = True

Any idea for how to slove this issue?

I am running on ubunto on AWS machine GPU CUDA Version: 12.2 and Python 3.12

Answer.AI org

Hello!
I would try loading this model with from_pretrained(" answerdotai/ModernBERT-base", reference_compile=False) to disable some compilation options.
That might help avoid this crash.

  • Tom Aarsen

Hi again
Thanks for your prompt response.
I am using the SentenceTransformer class to load the model (and the other various operations follow like encode, train etc.). Can I do similar disabling of compilation options while using the SentenceTransformer class?

Ehud

Answer.AI org

Certainly, you can pass the arguments that go to from_pretrained to the model_kwargs parameter in SentenceTransformer:

model = SentenceTransformer("answerdotai/ModernBERT-base", model_kwargs={"reference_compile": False})
  • Tom Aarsen

Thanks a lot again Tom !
I tried your suggestion and got the following error:
TypeError: ModernBertModel.init() got an unexpected keyword argument 'reference_compile'
Any idea ?
Ehud

Answer.AI org

Oh, apologies, perhaps you have to use config_kwargs instead of model_kwargs.

That was helpful, thanks a lot.
However now I run into other issue, I got:
No sentence-transformers model found with name answerdotai/ModernBERT-base. Creating a new one with mean pooling.

I see this issue in another thread (How to use ModernBERT as a sentence transformer? ) and in Training issues latest version #35407.

Sign up or log in to comment