SHRDFU-7b LoRA

  • Developed by: maldv
  • License: cc-by-nc-4.0
  • Finetuned from model: cgato/Thespis-CurtainCall-7b-v0.3
  • Methodology: Targeting attention layers with peft to condition; then small full layer tuning; extending intelligence and problem solving w/ crabcanon

As I work on understanding how to layer information in to the model, this dataset proved to be very interesting. It did show that it is very easy to overbake with inadequate sample size; but it sure thinks it is a smart model now.

Downloads last month

-

Downloads are not tracked for this model. How to track
Inference Providers NEW
This model is not currently available via any of the supported third-party Inference Providers, and HF Inference API was unable to determine this model’s pipeline type.

Model tree for maldv/SHRDFU-7b-overbaked-lora

Finetuned
(2)
this model

Dataset used to train maldv/SHRDFU-7b-overbaked-lora