This is a derived model of chatglm3-6b-32k, has been converted to TensorRT LLM checkpoint for further usage. The model is presented in different quantizations.
This model does not have enough activity to be deployed to Inference API (serverless) yet. Increase its social
visibility and check back later, or deploy to Inference Endpoints (dedicated)
instead.
Model tree for Tridefender/chatglm3_6b_32k_TensorRTReady
Base model
THUDM/chatglm3-6b-32k