Anthonyg5005's picture
Update README.md
d045899 verified
|
raw
history blame
741 Bytes
metadata
license: llama3
language:
  - en
library_name: CTranslate2
pipeline_tag: text-generation
tags:
  - facebook
  - meta
  - llama
  - llama-3
  - kaltcit
  - cat
  - ct2
  - quantized model
  - int8
base_model: turboderp/unknown

CTranslate2 int8 version of turbcat

This is a int8_float16 quantization of [turbcat](not released yet)
See more on CTranslate2: Docs | Github

This model was converted to ct2 format using the following commnd:

ct2-transformers-converter --model kat_turbcat --output_dir turbcat-ct2 --quantization int8_float16 --low_cpu_mem_usage

no converstion needed using the model from this repository as it is already in ct2 format.