arxiv:2410.12491
Jared Joselowitz
jaredjoss
·
AI & ML interests
None yet
Recent Activity
authored
a paper
about 2 months ago
Insights from the Inverse: Reconstructing LLM Training Goals Through
Inverse RL
Organizations
None yet
Papers
1
spaces
2
models
22
jaredjoss/pythia-70m-irl-29eps-01-rlhf-model
Updated
jaredjoss/pythia-70m-irl-29eps-0035-rlhf-model
Updated
•
2
jaredjoss/pythia-410m-irl-6eps-15reps-rlhf-model
Updated
jaredjoss/pythia-70m-dahoas-hh-1-epoch-1000-steps-1e-7-lr-sft
Updated
jaredjoss/pythia-160m-dahoas-hh-1-epoch-1000-steps-1e-7-lr-sft
Updated
jaredjoss/pythia-410m-dahoas-hh-1-epoch-10000-steps-sft
Updated
jaredjoss/pythia-160m-dahoas-hh-1-epoch-10000-steps-sft
Updated
jaredjoss/pythia-70m-dahoas-hh-1-epoch-10000-steps-sft
Updated
jaredjoss/pythia-70m-irl-10eps-58reps-rlhf-model
Updated
jaredjoss/pythia-410m-roberta-lr_8e7-kl_01-steps_12000-rlhf-model
Text Generation
•
Updated
•
18
datasets
6
jaredjoss/jaredjoss-jigsaw-long-2000_160M_toxic
Viewer
•
Updated
•
1k
•
30
jaredjoss/jaredjoss-jigsaw-long-2000_160M_non_toxic
Viewer
•
Updated
•
1k
•
30
jaredjoss/jaredjoss-jigsaw-long-2000_410M_toxic
Viewer
•
Updated
•
1k
•
32
jaredjoss/jaredjoss-jigsaw-long-2000_410M_non_toxic
Viewer
•
Updated
•
1k
•
30
jaredjoss/jigsaw-long-2000
Viewer
•
Updated
•
2k
•
8
jaredjoss/allenai-combined
Viewer
•
Updated
•
999
•
2