GALAXY-16B-v1.0

image/png

Technical notes

  • 72 layers,DUS procedure, mistral(32)->SOLAR(48)->GALAXY(72)
  • 16B parameters
  • model created as an extension of depth upscaling procedure used for SOLAR by upstage

Results

  • model can and will produce NSFW content
  • waiting for eval results

Prompt template

  • Alpaca
  • chat template is embedded in tokenizer config, should load automatically

Context size

  • 4096

All comments are greatly appreciated, download, test and if you appreciate my work, consider buying me my fuel: Buy Me A Coffee

Downloads last month
11
Safetensors
Model size
16B params
Tensor type
BF16
·
Inference Examples
This model does not have enough activity to be deployed to Inference API (serverless) yet. Increase its social visibility and check back later, or deploy to Inference Endpoints (dedicated) instead.

Model tree for TeeZee/GALAXY-16B-v1.0

Quantizations
2 models

Datasets used to train TeeZee/GALAXY-16B-v1.0