Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
deepseek-ai
/
DeepSeek-V2-Lite
like
105
Follow
DeepSeek
10.1k
Text Generation
Transformers
Safetensors
deepseek_v2
conversational
custom_code
text-generation-inference
Inference Endpoints
arxiv:
2405.04434
License:
deepseek
Model card
Files
Files and versions
Community
8
Train
Deploy
Use this model
75f2232
DeepSeek-V2-Lite
5 contributors
History:
22 commits
LiangliangMa
Use try-except for flash_attn import
75f2232
verified
4 months ago
.gitattributes
1.52 kB
initial commit
8 months ago
LICENSE
13.8 kB
Upload LICENSE
8 months ago
README.md
15.5 kB
docs: update README.md
8 months ago
config.json
1.52 kB
Upload folder using huggingface_hub
8 months ago
configuration_deepseek.py
10.3 kB
Upload folder using huggingface_hub
8 months ago
generation_config.json
181 Bytes
Upload folder using huggingface_hub
8 months ago
model-00001-of-000004.safetensors
8.59 GB
LFS
Upload folder using huggingface_hub
8 months ago
model-00002-of-000004.safetensors
8.59 GB
LFS
Upload folder using huggingface_hub
8 months ago
model-00003-of-000004.safetensors
8.59 GB
LFS
Upload folder using huggingface_hub
8 months ago
model-00004-of-000004.safetensors
5.64 GB
LFS
Upload folder using huggingface_hub
8 months ago
model.safetensors.index.json
480 kB
Upload folder using huggingface_hub
8 months ago
modeling_deepseek.py
78.6 kB
Use try-except for flash_attn import
4 months ago
tokenization_deepseek_fast.py
1.37 kB
Upload folder using huggingface_hub
8 months ago
tokenizer.json
4.61 MB
Upload folder using huggingface_hub
8 months ago
tokenizer_config.json
1.28 kB
Upload folder using huggingface_hub
8 months ago