Can I use any Inference Engine(like vllm、ollama) applicable to qwen2.5 to infer Athene-V2-Chat?
1
#5 opened 24 days ago
by
wangdafa
32 B coding model please
3
#4 opened about 2 months ago
by
gopi87
inference api not working
2
#3 opened about 2 months ago
by
llamameta
Smaller versions incoming?
#2 opened about 2 months ago
by
phly95