--- library_name: transformers license: mit base_model: MoritzLaurer/deberta-v3-large-zeroshot-v2.0-c tags: - generated_from_trainer model-index: - name: zsc_finetune_results results: [] --- # zsc_finetune_results This model is a fine-tuned version of [MoritzLaurer/deberta-v3-large-zeroshot-v2.0-c](https://huggingface.co/MoritzLaurer/deberta-v3-large-zeroshot-v2.0-c) on the None dataset. It achieves the following results on the evaluation set: - Loss: 0.9992 ## Model description More information needed ## Intended uses & limitations More information needed ## Training and evaluation data More information needed ## Training procedure ### Training hyperparameters The following hyperparameters were used during training: - learning_rate: 2e-05 - train_batch_size: 1 - eval_batch_size: 1 - seed: 42 - optimizer: Use adamw_torch with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments - lr_scheduler_type: linear - num_epochs: 1 - mixed_precision_training: Native AMP ### Training results | Training Loss | Epoch | Step | Validation Loss | |:-------------:|:------:|:----:|:---------------:| | 1.0003 | 0.0978 | 500 | 0.9619 | | 0.8545 | 0.1956 | 1000 | 1.2778 | | 1.0119 | 0.2934 | 1500 | 0.9095 | | 0.9421 | 0.3912 | 2000 | 1.0197 | | 1.062 | 0.4890 | 2500 | 1.0226 | | 0.9204 | 0.5869 | 3000 | 0.9799 | | 0.9559 | 0.6847 | 3500 | 1.0734 | | 1.0267 | 0.7825 | 4000 | 1.0456 | | 1.0528 | 0.8803 | 4500 | 1.0008 | | 0.9935 | 0.9781 | 5000 | 0.9992 | ### Framework versions - Transformers 4.46.2 - Pytorch 2.5.0+cu121 - Datasets 3.1.0 - Tokenizers 0.20.3