Safetensors
Hindi
English
mistral
aamodthakur commited on
Commit
c3c9482
ยท
verified ยท
1 Parent(s): 1330fe5

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +1 -27
README.md CHANGED
@@ -14,10 +14,7 @@ base_model: LingoIITGN/ganga-1b
14
 
15
  # Model Card for Ganga-1b! ๐ŸŒŠ
16
 
17
- The base model **``Ganga-1b``** trained on a monolingual **Hindi** language dataset as part of ***Project Unity***. We propose the name *Ganga* ๐ŸŒŠ to honor the longest river flowing through the Hindi-speaking region of India ๐Ÿ‡ฎ๐Ÿ‡ณ.
18
-
19
- *(The first pre-trained Hindi model by any academic research lab in India ๐Ÿ‡ฎ๐Ÿ‡ณ!)**
20
-
21
 
22
  ![image/png](https://cdn-uploads.huggingface.co/production/uploads/667b8f8ba271fc5a8e6929de/jG3tZnGPvH6vcGrvxO-YC.png)
23
 
@@ -58,28 +55,6 @@ print(tokenizer.decode(output[0][input_ids[0].shape[0]:], skip_special_tokens=Tr
58
 
59
  ```
60
 
61
- ## Technical Specifications ๐Ÿค–
62
-
63
- - **Precision**: *Float32*
64
- - **Context Length**: *2,048*
65
- - **Learning Rate**: *4e-4*
66
- - **Optimizer**: *AdamW*
67
- - **LR Scheduler**: *Cosine*
68
-
69
- ### Model Architecture and Objective
70
-
71
-
72
- Ganga-1b is a decoder-only transformer model, featuring the following specifications:
73
-
74
-
75
- * Layers: 16
76
- * Attention heads: 32
77
- * Embedding dimension: 2,048
78
- * Vocabulary size: 30,000
79
- * Sliding window: 512
80
- * Intermediate dimension: 7,168
81
-
82
-
83
  ## Evaluation
84
  [More Information Needed]
85
 
@@ -104,7 +79,6 @@ Ganga-1b is a decoder-only transformer model, featuring the following specificat
104
  ## Summary
105
 
106
 
107
-
108
  ## Bias, Risks, and Limitations ๐Ÿšจ
109
 
110
 
 
14
 
15
  # Model Card for Ganga-1b! ๐ŸŒŠ
16
 
17
+ The model **``Ganga-en-hi-1b``** is a fine-tuned version of **``Ganga-1b``** for the English to Hindi Translation Task.
 
 
 
18
 
19
  ![image/png](https://cdn-uploads.huggingface.co/production/uploads/667b8f8ba271fc5a8e6929de/jG3tZnGPvH6vcGrvxO-YC.png)
20
 
 
55
 
56
  ```
57
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
58
  ## Evaluation
59
  [More Information Needed]
60
 
 
79
  ## Summary
80
 
81
 
 
82
  ## Bias, Risks, and Limitations ๐Ÿšจ
83
 
84