This model is a quantized version of Sidrap-V2
~/llama.cpp/main -ngl 32 -m sidrap-7b-v2.q4_K_M.gguf --color -c 4096 --temp 0.7 --repeat_penalty 1.1 -n -1 -i -ins
4-bit
8-bit