Levanti (Hebrew -> colloquial Levantine Arabic) translator

Trained on the Levanti dataset by fine-tuning Helsinki-NLP/opus-mt-he-ar for 8 epochs. This model is trained to support dialect conditional generation by utilizing the first token (followed by a space) as an indicator of the desired dialect:

  • P for Palestinian
  • L for Lebanese
  • S for Syrian
  • E for Egyptian

Example usage

from transformers import pipeline
trans = pipeline("translation", "guymorlan/levanti_translate_he_ar")
trans("P אני רוצה ללכת מחר לחנות")
Out[1]: [{'translation_text': 'بدي أروح ع الدكان بكرا'}]

Attribution

Created by Guy Mor-Lan.
Contact: guy.mor AT mail.huji.ac.il

Downloads last month
63
Safetensors
Model size
76.6M params
Tensor type
F32
·
Inference Providers NEW
This model is not currently available via any of the supported third-party Inference Providers, and the model is not deployed on the HF Inference API.

Dataset used to train guymorlan/levanti_translate_he_ar