JunxiongWang commited on
Commit
7c055bb
1 Parent(s): 7ca5856

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +11 -0
README.md CHANGED
@@ -72,3 +72,14 @@ The following hyperparameters were used during training:
72
  - Pytorch 2.1.0+cu118
73
  - Datasets 2.20.0
74
  - Tokenizers 0.19.1
 
 
 
 
 
 
 
 
 
 
 
 
72
  - Pytorch 2.1.0+cu118
73
  - Datasets 2.20.0
74
  - Tokenizers 0.19.1
75
+
76
+ [MambaInLlama](arxiv.org/abs/2408.15237)
77
+
78
+ ```
79
+ @article{junxiongdaniele2024mambainllama,
80
+ title = {The Mamba in the Llama: Distilling and Accelerating Hybrid Models},
81
+ author = {Junxiong Wang and Daniele Paliotta and Avner May and Alexander M. Rush and Tri Dao},
82
+ journal = {arXiv preprint arXiv:2408.15237},
83
+ year = {2024}
84
+ }
85
+ ```