Commit History
Rename inference-cache-config/llama2.json to inference-cache-config/llama2-7b-13b.json
be28bda
verified
Create llama2-70b.json
6fe6ee4
verified
Rename inference-cache-config/llama3.json to inference-cache-config/llama3-8b.json
06bc70d
verified
Create llama3-70b.json
2695ea9
verified
Create mixtral.json
57652e6
verified
Add more batch_size for mistral on smaller instances
545cd4d
verified
Update Mistral cached configurations
ee458f5
verified
Use princeton-nlp/Sheared-LLaMA-1.3B as a test model
695b341
verified
Remove llama2 7B config for 24 cores
17e7257
verified
Update inference-cache-config/llama3.json
5d8c4f2
verified
Update inference-cache-config/llama3.json
f5aae68
verified
Create llama3.json
f93cadb
verified
Rename inference-cache-config/llama.json to inference-cache-config/llama2.json
f06a55a
verified
Add more gpt2 configurations
3fbf810
verified
Add more llama config
2d87237
verified
Add Mistral-v2
20e585f
verified
Create stable-diffusion.json (#43)
32561fe
verified
Remove SalesForce embedding model
1cd13f9
verified
Add Zephyr to mistral variants
9164704
verified
Remove variants from main mistral config
ef07aca
verified
Add mistral most popular variants
d3983e8
verified
Add most popular llama variants
594abb2
verified
Added teknium/OpenHermes-2.5-Mistral-7B
1518247
verified
Added Llama-70b batch_size 4 to inference cache
593822e
verified
Create mistral.json
b5d0afd
verified
philschmid
commited on
Create gpt2.json
3bdb891
verified
philschmid
commited on
Create inference-cache-config/llama.json
1960ccb
verified
philschmid
commited on