Alex J. Chan's picture

Alex J. Chan

XanderJC

·

https://alexjchan.com/

AI & ML interests

None yet

Recent Activity

authored a paper 17 days ago

How to Catch an AI Liar: Lie Detection in Black-Box LLMs by Asking Unrelated Questions

authored a paper 17 days ago

Dense Reward for Free in Reinforcement Learning from Human Feedback

updated a model 5 months ago

XanderJC/sft-llava-1.5-7b-hf

View all activity

Organizations

XanderJC's activity

authored 2 papers 17 days ago

How to Catch an AI Liar: Lie Detection in Black-Box LLMs by Asking Unrelated Questions

Paper • 2309.15840 • Published Sep 26, 2023

Dense Reward for Free in Reinforcement Learning from Human Feedback

Paper • 2402.00782 • Published Feb 1, 2024

updated 2 models 5 months ago

XanderJC/sft-llava-1.5-7b-hf

Image-Text-to-Text • Updated Sep 14, 2024 • 4

XanderJC/sft_openassistant-guanaco

Text Generation • Updated Sep 12, 2024 • 108

updated 3 models 6 months ago

XanderJC/llama-3-8b-orca-abc

Reinforcement Learning • Updated Aug 11, 2024 • 2

XanderJC/llama-3-8b-orca-rlhf

Reinforcement Learning • Updated Aug 11, 2024 • 2

XanderJC/llama-3-8b-orca-rm

Updated Aug 10, 2024 • 3

authored a paper 8 months ago

Discovering Preference Optimization Algorithms with and for Large Language Models

Paper • 2406.08414 • Published Jun 12, 2024 • 14

updated 7 models about 1 year ago

XanderJC/phi2-sft-tldr-merged

Text Generation • Updated Jan 24, 2024 • 3

XanderJC/phi2-sft-tldr

Updated Jan 24, 2024 • 4

XanderJC/gptj-rm-tldr-merged

Text Classification • Updated Jan 21, 2024 • 2

XanderJC/gptj-sft-tldr-merged

Text Generation • Updated Jan 21, 2024 • 153

XanderJC/gptj-sft-tldr

Updated Jan 21, 2024 • 2

XanderJC/gpt2-rm-tldr

Text Classification • Updated Jan 20, 2024 • 2

XanderJC/gpt2-rm-imdb

Text Classification • Updated Dec 6, 2023 • 2.87k