Finefuning LLaMA2 model using autotrain advanced

Gloria79 · February 15, 2024, 8:47am

Hello, I am trying to finetune llama2 7B model with my own dataset.
I am stuck at setting the data-path correctly. Here are my attempts:

1st command:
autotrain llm --train --data-path ./data --text-column text --peft --auto_find_batch_size --epochs 3 --trainer sft --model meta-llama/Llama-2-7b-hf --project-name ftllama2

=> Somehow, the data_path got changed to “ftllama2/autotrain-data”

2nd command:
autotrain llm --train --data-path ./data/train.csv --text-column text --peft --auto_find_batch_size --epochs 3 --trainer sft --model meta-llama/Llama-2-7b-hf --project-name ftllama2

=> The data_path is correct, but I am getting this error:
ERROR | 2024-02-15 08:44:06 | autotrain.trainers.common:wrapper:92 - Couldn’t find a dataset script at /home/ubuntu/workspace/git/language-agnostic-embedding/data/train.csv/train.csv.py or any data file in the same directory.

Any comments are welcomed.

maneln · May 5, 2024, 2:14pm

Hello, I also got this error before, you need to upload the dataset in your huggingface account and then call the created dataset from huggingface in the data-path parameter. For an instance, it will be like this : your-user-name/the-name-of-your-created-dataset.
I hope this helps!

Topic		Replies	Views
Train huggingface Beginners	2	380	November 10, 2023
Fine tune a finetuned model Beginners	1	47	December 16, 2024
Why my finetuned model size so small and unable to load Beginners	0	78	July 9, 2024
LLAMA-2 Finetune Models	0	522	July 27, 2023
Llama2 fine-tunning with PEFT QLora and testing the model 🤗Transformers	13	13799	December 21, 2023

Finefuning LLaMA2 model using autotrain advanced

Related topics