File size: 626 Bytes
429d619 23f75fc |
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 |
---
language:
- he
tags:
- language model
datasets:
- responsa
---
**AlephBERT-base-finetuned-for-shut**
**Hebrew Language Model**
Based on alephbert-base: https://huggingface.co/onlplab/alephbert-base#alephbert
**How to use:**
from transformers import AutoModelForMaskedLM, AutoTokenizer
checkpoint = 'ysnow9876/alephbert-base-finetuned-for-shut'
tokenizer = AutoTokenizer.from_pretrained(checkpoint)
model= AutoModelForMaskedLM.from_pretrained(checkpoint)
#if not finetuning - disable dropout
model.eval()
**Training Data**
about 26,000 different responsa from different rabbis from the past few hundred years
|