OpenLLM-France
/

Claire-7B-FR-Instruct-0.1

@@ -6,7 +6,7 @@ language:
 base_model: OpenLLM-France/Claire-7B-0.1
 ---
-# Model Card for Model ID
 <!-- Provide a quick summary of what the model is/does. -->
@@ -16,24 +16,25 @@ base_model: OpenLLM-France/Claire-7B-0.1
 ### Model Description
-This is the instruction-finetuned model based on ([OpenLLM-France/Claire-7B-EN-0.1](https://huggingface.co/OpenLLM-France/Claire-7B-EN-0.1)), using the [Vigogne dataset](https://github.com/bofenghuang/vigogne).
 Note: This is not a chat model. The finetuning was done on instruction-following data, and the model should be used with the template as shown in "How to Get Started with the Model".
-- **Developed by:** OpenLLM-France
 - **Language(s) (NLP):** French
 - **License:** CC-BY-NC-SA 4.0
-- **Finetuned from model: [OpenLLM-France/Claire-7B-EN-0.1](https://huggingface.co/OpenLLM-France/Claire-7B-EN-0.1)
 ### Model Sources
-- **Repository:** [OpenLLM-France/Claire-7B-EN-0.1](https://huggingface.co/OpenLLM-France/Claire-7B-EN-0.1)
 - **Paper:** [Claire: Large Language Models for Spontaneous French Dialogue](https://aclanthology.org/2024.jeptalnrecital-taln.36/)
 ## Uses
-This instruction-finetuned model is designed for tasks requiring detailed responses to user instructions.
-It can be used for generating natural language responses, content creation, answering queries, and other instruction-based tasks.
 ## Bias, Risks, and Limitations
@@ -85,7 +86,7 @@ print(decoded_output[0])
 ### Training Data
-The model was finetuned on the [Vigogne dataset](https://github.com/bofenghuang/vigogne), which is a translation of the [Alpaca dataset](https://huggingface.co/datasets/tatsu-lab/alpaca).
 ### Training Procedure
@@ -104,45 +105,3 @@ lora_task_type: CAUSAL_LM
 num_train_epochs: 1
 ```
-## Evaluation
-<!-- This section describes the evaluation protocols and provides the results. -->
-### Results
-#### Summary
-## Citation [optional]
-<!-- If there is a paper or blog post introducing the model, the APA and Bibtex information for that should go in this section. -->
-**BibTeX:**
-[More Information Needed]
-**APA:**
-[More Information Needed]
-## Glossary [optional]
-<!-- If relevant, include terms and calculations in this section that can help readers understand the model or model card. -->
-[More Information Needed]
-## More Information [optional]
-[More Information Needed]
-## Model Card Authors [optional]
-[More Information Needed]
-## Model Card Contact
-[More Information Needed]

 base_model: OpenLLM-France/Claire-7B-0.1
 ---
+# Model Card for Claire-7B-FR-Instruct
 <!-- Provide a quick summary of what the model is/does. -->
 ### Model Description
+This is the instruction-finetuned model based on [OpenLLM-France/Claire-7B-0.1](https://huggingface.co/OpenLLM-France/Claire-7B-0.1), using the [Vigogne dataset](https://github.com/bofenghuang/vigogne).
 Note: This is not a chat model. The finetuning was done on instruction-following data, and the model should be used with the template as shown in "How to Get Started with the Model".
+- **Developed by:** LINAGORA with the support of OpenLLM-France
 - **Language(s) (NLP):** French
 - **License:** CC-BY-NC-SA 4.0
+- **Finetuned from model: [OpenLLM-France/Claire-7B-0.1](https://huggingface.co/OpenLLM-France/Claire-7B-0.1)
 ### Model Sources
+- **Repository:** [OpenLLM-France/Claire-7B-0.1](https://huggingface.co/OpenLLM-France/Claire-7B-EN-0.1)
 - **Paper:** [Claire: Large Language Models for Spontaneous French Dialogue](https://aclanthology.org/2024.jeptalnrecital-taln.36/)
 ## Uses
+The base model, [Claire-7B-0.1](https://huggingface.co/OpenLLM-France/Claire-7B-0.1), results from continuing the pretraining of [Falcon-7B](https://huggingface.co/tiiuae/falcon-7b) on French conversation transcripts and theater plays. The idea was to attune the base model to features of spontaneous conversation so that it could be more efficiently fine-tuned for downstream tasks requiring understanding of spoken conversation.
+This instruction-finetuned model serves as a first level of fine-tuning for such tasks. It is designed to provide detailed responses to user instructions. It can be used for generating natural language responses, content creation, answering queries, and other instruction-based tasks.
 ## Bias, Risks, and Limitations
 ### Training Data
+The model was finetuned on the [Vigogne dataset](https://github.com/bofenghuang/vigogne), which is a cleaned version of the [Alpaca dataset](https://huggingface.co/datasets/tatsu-lab/alpaca), translated by `gpt-3.5-turbo`.
 ### Training Procedure
 num_train_epochs: 1
 ```