ChessLlama

image/png

Generated by DALL-E 3.

Model Details

This pre-trained model has been trained on the Llama architecture with the games of grand master chess players.

Model Description

How to Get Started with the Model

This notebook is created to test the model's capabilities. You can use it to evaluate performance of the model.

Open In Colab

Challenge

You can use this model or dataset to train your own models as well, and challenge me in this new field.

Training Details

Training Data

Q-bert/Elite-Chess-Games

Training Procedure

This model was fully trained from scratch with random weights. It was created from the ground up with a new configuration and model, and trained using the Hugging Face Trainer for 1200 steps. There is still potential for further training. You can see the training code below. Open In Colab

Training Loss Graph: image/png

Downloads last month
11
Safetensors
Model size
143M params
Tensor type
F32
·
Inference Examples
This model does not have enough activity to be deployed to Inference API (serverless) yet. Increase its social visibility and check back later, or deploy to Inference Endpoints (dedicated) instead.

Dataset used to train Q-bert/ChessLlama