SmolLM CPT
Collection
Continued Pre-Training of SmolLM models on the Fineweb-2 portions of Scandinavian languages.
•
6 items
•
Updated
This is a SmolLM2-135M-Instruct model fine-tuned first on the Icelandic and then on the Faroese portion of Fineweb-2. It is intended for my research and has not been evaluated more broadly yet.
Training:
Base model
HuggingFaceTB/SmolLM2-135M