monsterapi
/

Falcon_180B_dolly15k

tiiuae-falcon-180B

databricks-dolly-15k

Model card Files Files and versions Community

Falcon_180B_dolly15k / README.md

Zangs3011's picture

Update README.md

1eba9b4 over 1 year ago

|

1.97 kB

	---
	library_name: peft
	tags:
	- tiiuae-falcon-180B
	- code
	- instruct
	- databricks-dolly-15k
	- falcon-180B
	datasets:
	- databricks/databricks-dolly-15k
	base_model: tiiuae/falcon-180B
	---

	### Finetuning Overview:

	Model Used: tiiuae/falcon-180B
	Dataset: Databricks-dolly-15k

	#### Dataset Insights:

	The Databricks-dolly-15k dataset represents a substantial collection of over 15,000 records, curated through the dedicated and collective efforts of numerous Databricks professionals. It's meticulously designed to:

	- Enhance the magical interactivity of ChatGPT-like models.
	- Offer prompt/response pairs across eight different instruction categories, comprising the seven categories from the InstructGPT paper and an added open-ended category.
	- Ensure authenticity with restrictions against online sourcing (with the exception of Wikipedia for some categories) and the use of generative AI in crafting content.

	During the dataset's creation, contributors responded to peer questions. A focus was placed on rephrasing the original queries and emphasizing accurate responses. Furthermore, certain data subsets incorporate Wikipedia references, identifiable by bracketed citation numbers like [42].
	#### Finetuning Details:

	Our finetuning harnessed the capabilities of [MonsterAPI](https://monsterapi.ai)'s no-code [LLM finetuner](https://docs.monsterapi.ai/fine-tune-a-large-language-model-llm):

	- Duration: The session spanned 41.7 hours.
	- Cost: The entire process cost `$184.314`.
	- Hardware Utilized: 2x A100 80GB GPUs.

	#### Hyperparameters & Additional Details:

	- Model Path: tiiuae/falcon-180B
	- Learning Rate: 0.0002
	- Epochs: 1
	- Data Split: Training 90% / Validation 10%
	- Gradient Accumulation Steps: 1

	---

	### Prompt Used:

	```
	### INSTRUCTION:
	[instruction]

	[context]

	### RESPONSE:
	[response]
	```

	Loss metrics

	Training loss:
	![training loss](train-loss.png "Training loss")



	---

	license: apache-2.0