File size: 641 Bytes
98230d0 cf690c3 98230d0 cf690c3 |
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 |
---
license: apache-2.0
language:
- en
---
# 4season/final_model_test_v2
# **Introduction**
Supervised fine-tuning refers to a machine learning technique where a pre-trained model is further trained on a specific task or dataset with labeled examples (supervised learning). The process involves taking a model that has been pre-trained on a large general dataset and then adapting it to a more focused task by continuing the training using task-specific data.
This model is test version, alignment-tuned model.
We utilize state-of-the-art instruction fine-tuning methods including direct preference optimization (DPO).
|