aamodthakur
commited on
Update README.md
Browse files
README.md
CHANGED
@@ -14,10 +14,7 @@ base_model: LingoIITGN/ganga-1b
|
|
14 |
|
15 |
# Model Card for Ganga-1b! ๐
|
16 |
|
17 |
-
The
|
18 |
-
|
19 |
-
*(The first pre-trained Hindi model by any academic research lab in India ๐ฎ๐ณ!)**
|
20 |
-
|
21 |
|
22 |
![image/png](https://cdn-uploads.huggingface.co/production/uploads/667b8f8ba271fc5a8e6929de/jG3tZnGPvH6vcGrvxO-YC.png)
|
23 |
|
@@ -58,28 +55,6 @@ print(tokenizer.decode(output[0][input_ids[0].shape[0]:], skip_special_tokens=Tr
|
|
58 |
|
59 |
```
|
60 |
|
61 |
-
## Technical Specifications ๐ค
|
62 |
-
|
63 |
-
- **Precision**: *Float32*
|
64 |
-
- **Context Length**: *2,048*
|
65 |
-
- **Learning Rate**: *4e-4*
|
66 |
-
- **Optimizer**: *AdamW*
|
67 |
-
- **LR Scheduler**: *Cosine*
|
68 |
-
|
69 |
-
### Model Architecture and Objective
|
70 |
-
|
71 |
-
|
72 |
-
Ganga-1b is a decoder-only transformer model, featuring the following specifications:
|
73 |
-
|
74 |
-
|
75 |
-
* Layers: 16
|
76 |
-
* Attention heads: 32
|
77 |
-
* Embedding dimension: 2,048
|
78 |
-
* Vocabulary size: 30,000
|
79 |
-
* Sliding window: 512
|
80 |
-
* Intermediate dimension: 7,168
|
81 |
-
|
82 |
-
|
83 |
## Evaluation
|
84 |
[More Information Needed]
|
85 |
|
@@ -104,7 +79,6 @@ Ganga-1b is a decoder-only transformer model, featuring the following specificat
|
|
104 |
## Summary
|
105 |
|
106 |
|
107 |
-
|
108 |
## Bias, Risks, and Limitations ๐จ
|
109 |
|
110 |
|
|
|
14 |
|
15 |
# Model Card for Ganga-1b! ๐
|
16 |
|
17 |
+
The model **``Ganga-en-hi-1b``** is a fine-tuned version of **``Ganga-1b``** for the English to Hindi Translation Task.
|
|
|
|
|
|
|
18 |
|
19 |
![image/png](https://cdn-uploads.huggingface.co/production/uploads/667b8f8ba271fc5a8e6929de/jG3tZnGPvH6vcGrvxO-YC.png)
|
20 |
|
|
|
55 |
|
56 |
```
|
57 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
58 |
## Evaluation
|
59 |
[More Information Needed]
|
60 |
|
|
|
79 |
## Summary
|
80 |
|
81 |
|
|
|
82 |
## Bias, Risks, and Limitations ๐จ
|
83 |
|
84 |
|