This is a smaller version of the facebook/mbart-large-50 with only Russian and English embeddings left.

sentencepiece vocabulary was shrinked from 250k to 25k (most common 10k English tokens and most common 15k Russian tokens). The creation of this model is heavily based on David Dale's method described here, but with some specific to MBart additions.

Downloads last month: 236

Safetensors

Model size

380M params

Tensor type

F32

Inference Providers NEW

Text2Text Generation

This model is not currently available via any of the supported third-party Inference Providers, and the model is not deployed on the HF Inference API.

Model tree for sn4kebyt3/ru-bart-large

Finetunes

2 models