Mikhail Terekhov
terekhov
AI & ML interests
Reinforcement Learning, Multi-objective Reinforcement Learning, RLHF
Recent Activity
liked
a dataset
3 days ago
Rapidata/text-2-image-Rich-Human-Feedback
liked
a dataset
2 months ago
Rapidata/117k_human_coherence_flux1.0_V_flux1.1Blueberry
liked
a model
4 months ago
allenai/open-instruct-pythia-6.9b-tulu
Organizations
models
None public yet
datasets
None public yet