AronXiang
/

RetrospexLLaMA3

Model card Files Files and versions Community

Model Card for Model ID

This model is trained by lora for Retrospex based on AgentInstruct and ShareGPT datasets. The base model is Llama-3-8B-Instruct.

Model Details

Model Description

Developed by: Convai NJU
Shared by [optional]: Convai NJU
Model type: Llama model
Language(s) (NLP): en
License: llama3
Finetuned from model [optional]: Llama-3-8B-Instruct

Model Sources

Repository: https://github.com/Yufei-Xiang/Retrospex.git

Training Details

Training Data

AgentInstruct: https://huggingface.co/datasets/THUDM/AgentInstruct

ShareGPT: https://huggingface.co/datasets/anon8231489123/ShareGPT_Vicuna_unfiltered

Training Hyperparameters

fp16: True
lr: 2e-5
batch size: 8
lora r: 16
lora alpha: 64

Downloads last month: 504

Safetensors

Model size

8.03B params

Tensor type

F32

·

Inference Providers NEW

This model is not currently available via any of the supported third-party Inference Providers, and HF Inference API was unable to determine this model's library.

Datasets used to train AronXiang/RetrospexLLaMA3