Commit History
Remove obsolete llama variants
eee32f0
verified
Rename inference-cache-config/Llama3.1-70b.json to inference-cache-config/llama3.1-70b.json
563ba38
verified
Update inference-cache-config/Llama3.1-70b.json
7b0370b
verified
Update inference-cache-config/mistral.json
8ea3b57
verified
Update inference-cache-config/llama.json
d05f579
verified
Rename inference-cache-config/Llama3.1-70B.json to inference-cache-config/Llama3.1-70b.json
a92cfe3
verified
Update inference-cache-config/mixtral.json
7342c16
verified
Rename inference-cache-config/Llama-3.1-70B.json to inference-cache-config/Llama3.1-70B.json
b41e94c
verified
Create Llama-3.1-70B.json
b1279f9
verified
Delete inference-cache-config/llama3-8b.json
5b0b2de
verified
Update inference-cache-config/llama.json
0548cd2
verified
Delete inference-cache-config/llama2-7b-13b.json
219c5fd
verified
Update inference-cache-config/llama.json
afb9fe6
verified
Rename inference-cache-config/llama-3.1-8B.json to inference-cache-config/llama.json
14844a0
verified
Update inference-cache-config/mistral.json
6c4c814
verified
Create llama-3.1-8B.json
320841a
verified
Update inference-cache-config/llama3-8b.json
de9e259
verified
Update inference-cache-config/llama3-70b.json
5694f75
verified
Update inference-cache-config/stable-diffusion.json
5272eb2
verified
Temporarily remove SD 1.5 from Runway
a74d412
verified
Update inference-cache-config/llama-variants.json
e7179a3
verified
Rename inference-cache-config/llama2.json to inference-cache-config/llama2-7b-13b.json
be28bda
verified
Create llama2-70b.json
6fe6ee4
verified
Rename inference-cache-config/llama3.json to inference-cache-config/llama3-8b.json
06bc70d
verified
Create llama3-70b.json
2695ea9
verified
Create mixtral.json
57652e6
verified
Add more batch_size for mistral on smaller instances
545cd4d
verified
Update Mistral cached configurations
ee458f5
verified
Use princeton-nlp/Sheared-LLaMA-1.3B as a test model
695b341
verified
Remove llama2 7B config for 24 cores
17e7257
verified
Update inference-cache-config/llama3.json
5d8c4f2
verified
Update inference-cache-config/llama3.json
f5aae68
verified
Create llama3.json
f93cadb
verified
Rename inference-cache-config/llama.json to inference-cache-config/llama2.json
f06a55a
verified
Add more gpt2 configurations
3fbf810
verified
Add more llama config
2d87237
verified
Add Mistral-v2
20e585f
verified
Create stable-diffusion.json (#43)
32561fe
verified
Remove SalesForce embedding model
1cd13f9
verified
Add Zephyr to mistral variants
9164704
verified
Remove variants from main mistral config
ef07aca
verified
Add mistral most popular variants
d3983e8
verified
Add most popular llama variants
594abb2
verified
Added teknium/OpenHermes-2.5-Mistral-7B
1518247
verified
Added Llama-70b batch_size 4 to inference cache
593822e
verified
Create mistral.json
b5d0afd
verified
philschmid
commited on
Create gpt2.json
3bdb891
verified
philschmid
commited on
Create inference-cache-config/llama.json
1960ccb
verified
philschmid
commited on