arxiv:2501.03271
Basab Ghosh
basab1142
AI & ML interests
Computer Vision NLP, and RL
Recent Activity
updated
a model
4 days ago
basab1142/FPO_Gemma_7b_it_with_human_fetched_article
updated
a model
4 days ago
basab1142/gemma-2b_it_fpo
Organizations
None yet
Papers
1
models
8
basab1142/FPO_Gemma_7b_it_with_human_fetched_article
Updated
•
10
basab1142/gemma-2b_it_fpo
Updated
basab1142/FPO_Gemma_7b_it
Updated
•
17
basab1142/a2c-PandaReachDense-v3
Reinforcement Learning
•
Updated
basab1142/dqn-SpaceInvadersNoFrameskip-v4
Reinforcement Learning
•
Updated
•
1
basab1142/Taxi-v3-CQ
Reinforcement Learning
•
Updated
basab1142/q-FrozenLake-v1-4x4-noSlippery
Reinforcement Learning
•
Updated
basab1142/ppo-LunarLander-v2
Reinforcement Learning
•
Updated
•
1