πŸ‹ Humback

The proposed Humback is a novel framework that can augment the instruction data for supervised fine-tuning with high quality.

This is a SFT (supervised fine-tuning) model $M_{0}$ for Humback reproduction.

This model is trained on the seed data.

The seed data is a sampled dataset from oasst1.

You may find more details and usage examples in Spico197/Humback .

πŸ“œ Reference

@misc{li2023selfalignment,
    title={Self-Alignment with Instruction Backtranslation},
    author={Xian Li and Ping Yu and Chunting Zhou and Timo Schick and Luke Zettlemoyer and Omer Levy and Jason Weston and Mike Lewis},
    year={2023},
    eprint={2308.06259},
    archivePrefix={arXiv},
    primaryClass={cs.CL}
}
Downloads last month
28
Inference Examples
This model does not have enough activity to be deployed to Inference API (serverless) yet. Increase its social visibility and check back later, or deploy to Inference Endpoints (dedicated) instead.

Dataset used to train Spico/Humback-M0