unknown pre-tokenizer type: 'deepseek-r1-qwen'

#1
by Neman - opened

What version of llama.cpp did you use? I get error:
llama_model_load: error loading model: error loading model vocabulary: unknown pre-tokenizer type: 'deepseek-r1-qwen'
with latest version:
From https://github.com/ggerganov/llama.cpp

  • [new tag] b4516 -> b4516

EDIT: llama distil (this repo) works, sorry, should have put it in qwen distill.

What version of llama.cpp did you use? I get error:
llama_model_load: error loading model: error loading model vocabulary: unknown pre-tokenizer type: 'deepseek-r1-qwen'
with latest version:
From https://github.com/ggerganov/llama.cpp

  • [new tag] b4516 -> b4516

EDIT: llama distil (this repo) works, sorry, should have put it in qwen distill.

same error, how to solve? thx

FYI, newer versions of llama.cpp build executables (llama-server, llama-cli, ...) in /llama.cpp/build/bin. That was the issue on my side.

Neman changed discussion status to closed

Sign up or log in to comment