Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
ldwang
/
mamba-1.4b-aquila-400b
like
0
Transformers
PyTorch
Inference Endpoints
arxiv:
2312.00752
Model card
Files
Files and versions
Community
Train
Deploy
Use this model
eda73a2
mamba-1.4b-aquila-400b
/
config.json
ldwang
Upload config.json with huggingface_hub
65275f6
verified
12 months ago
raw
Copy download link
history
blame
Safe
187 Bytes
{
"d_model"
:
2048
,
"fused_add_norm"
:
true
,
"n_layer"
:
48
,
"pad_vocab_size_multiple"
:
1
,
"residual_in_fp32"
:
true
,
"rms_norm"
:
false
,
"ssm_cfg"
:
{
}
,
"vocab_size"
:
100008
}