JunxiongWang commited on
Commit
533247d
1 Parent(s): a312bc7

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +12 -0
README.md CHANGED
@@ -73,3 +73,15 @@ The following hyperparameters were used during training:
73
  - Pytorch 2.1.0+cu118
74
  - Datasets 2.20.0
75
  - Tokenizers 0.19.1
 
 
 
 
 
 
 
 
 
 
 
 
 
73
  - Pytorch 2.1.0+cu118
74
  - Datasets 2.20.0
75
  - Tokenizers 0.19.1
76
+
77
+
78
+ [MambaInLlama](arxiv.org/abs/2408.15237)
79
+
80
+ ```
81
+ @article{junxiongdaniele2024mambainllama,
82
+ title = {The Mamba in the Llama: Distilling and Accelerating Hybrid Models},
83
+ author = {Junxiong Wang and Daniele Paliotta and Avner May and Alexander M. Rush and Tri Dao},
84
+ journal = {arXiv preprint arXiv:2408.15237},
85
+ year = {2024}
86
+ }
87
+ ```