arxiv:2402.05000
Kangqi (Kevin) Ni
kangqi-ni
AI & ML interests
NLP, CV, RLHF
Organizations
None yet
Papers
1
models
11
kangqi-ni/zephyr-7b-beta_bio-tutor_kto
Updated
•
2
kangqi-ni/Mistral-7B-Instruct-v0.2_bio-tutor_kto
Updated
•
2
kangqi-ni/Llama-3.1-8b-Instruct_bio-tutor_kto
Updated
•
5
kangqi-ni/Llama-3.1-8B-Instruct_bio-tutor_dpo
Updated
•
10
kangqi-ni/Llama-3.1-8B-Instruct_bio-tutor_sft
Updated
•
2
kangqi-ni/zephyr-7b-beta_bio-tutor_sft
Text Generation
•
Updated
•
8
kangqi-ni/zephyr-7b-beta_bio-tutor_dpo
Text Generation
•
Updated
•
18
kangqi-ni/Mistral-7B-Instruct-v0.2_bio-tutor_sft
Text Generation
•
Updated
•
19
kangqi-ni/Mistral-7B-Instruct-v0.2_bio-tutor_dpo
Text Generation
•
Updated
•
11
kangqi-ni/roberta-large-mnli-ricechem
Text Classification
•
Updated
•
107
datasets
None public yet