iamTangsang
/

208k-MarianMT-Nepali-to-English

Text2Text Generation

Generated from Trainer

Inference Endpoints

Model card Files Files and versions Community

208k-MarianMT-Nepali-to-English

This model is a fine-tuned version of iamTangsang/MarianMT-Nepali-to-English-Synthetic-Pretrain-Continued on an unknown dataset. It achieves the following results on the evaluation set:

Loss: 1.7458
Bleu: 23.0471
Gen Len: 26.8454

Model description

More information needed

Intended uses & limitations

More information needed

Training and evaluation data

More information needed

Training procedure

Training hyperparameters

The following hyperparameters were used during training:

learning_rate: 3e-05
train_batch_size: 16
eval_batch_size: 16
seed: 42
optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
lr_scheduler_type: linear
num_epochs: 1

Training results

Training Loss	Epoch	Step	Validation Loss	Bleu	Gen Len
2.108	0.2500	3230	1.9853	15.2869	30.3236
1.9731	0.4999	6460	1.8364	21.4787	27.7903
1.8547	0.7499	9690	1.7712	27.1901	25.855
1.8446	0.9998	12920	1.7458	23.0471	26.8454

Framework versions

Transformers 4.45.2
Pytorch 2.5.1+cu121
Datasets 3.2.0
Tokenizers 0.20.3

Downloads last month: 25

Safetensors

Model size

77.1M params

Tensor type

F32

·

Inference Examples

Text2Text Generation

This model does not have enough activity to be deployed to Inference API (serverless) yet. Increase its social visibility and check back later, or deploy to Inference Endpoints (dedicated) instead.

Model tree for iamTangsang/208k-MarianMT-Nepali-to-English

Base model

iamTangsang/MarianMT-Nepali-to-English

Finetuned

iamTangsang/MarianMT-Nepali-to-English-Synthetic-Pretrain-Continued

Finetuned

(1)

this model

Evaluation results

Metadata error: specify a dataset to view leaderboard