-
-
-
-
-
-
Inference status
Active filters:
rl
ContextualAI/Contextual_KTO_Mistral_PairRM
Text Generation
•
Updated
•
219
•
31
sryu1/jsbgym_models
Reinforcement Learning
•
Updated
d-byrne/snake-v1_training_state
Updated
InstaDeepAI/jumanji-benchmark-a2c-BinPack-v2
Updated
InstaDeepAI/jumanji-benchmark-a2c-CVRP-v1
ContextualAI/archangel_sft_pythia1-4b
Text Generation
•
Updated
•
51
ContextualAI/archangel_sft_pythia2-8b
Text Generation
•
Updated
•
18
•
1
ContextualAI/archangel_sft_pythia6-9b
Text Generation
•
Updated
•
28
ContextualAI/archangel_sft_pythia12-0b
Text Generation
•
Updated
•
24
ContextualAI/archangel_sft_llama7b
Text Generation
•
Updated
•
432
•
1
ContextualAI/archangel_sft_llama13b
Text Generation
•
Updated
•
229
ContextualAI/archangel_sft_llama30b
Text Generation
•
Updated
•
26
ContextualAI/archangel_slic_llama30b
Text Generation
•
Updated
•
25
ContextualAI/archangel_slic_pythia1-4b
Text Generation
•
Updated
•
20
ContextualAI/archangel_slic_pythia2-8b
Text Generation
•
Updated
•
18
ContextualAI/archangel_slic_pythia6-9b
Text Generation
•
Updated
•
21
ContextualAI/archangel_slic_pythia12-0b
Text Generation
•
Updated
•
21
ContextualAI/archangel_slic_llama7b
Text Generation
•
Updated
•
20
•
1
ContextualAI/archangel_slic_llama13b
Text Generation
•
Updated
•
20
ContextualAI/archangel_dpo_pythia1-4b
Text Generation
•
Updated
•
20
ContextualAI/archangel_dpo_pythia2-8b
Text Generation
•
Updated
•
19
ContextualAI/archangel_dpo_pythia6-9b
Text Generation
•
Updated
•
19
ContextualAI/archangel_dpo_pythia12-0b
Text Generation
•
Updated
•
23
ContextualAI/archangel_dpo_llama7b
Text Generation
•
Updated
•
288
ContextualAI/archangel_dpo_llama13b
Text Generation
•
Updated
•
85
ContextualAI/archangel_dpo_llama30b
Text Generation
•
Updated
•
31
ContextualAI/archangel_kto_pythia1-4b
Text Generation
•
Updated
•
24
ContextualAI/archangel_kto_pythia2-8b
Text Generation
•
Updated
•
24
ContextualAI/archangel_kto_pythia6-9b
Text Generation
•
Updated
•
24
ContextualAI/archangel_kto_pythia12-0b
Text Generation
•
Updated
•
41