AmelieSchreiber
/

esm2_t12_35M_lora_binding_sites_v2_cp3

Token Classification

protein language model

Model card Files Files and versions Community

AmelieSchreiber commited on Sep 15, 2023

Commit

3db1969

·

1 Parent(s): 1e414d1

Update README.md

Files changed (1) hide show

README.md +9 -0

README.md CHANGED Viewed

@@ -43,6 +43,15 @@ dataset [found here](https://huggingface.co/datasets/AmelieSchreiber/binding_sit
 this model has a high recall, meaning it is likely to detect binding sites, but it has a precision score that is somewhat lower than the SOTA
 structural models mentioned above, meaning the model may return some false positives as well.
 ## Training procedure
 This model was finetuned with LoRA on ~549K protein sequences from the UniProt database. The dataset can be found

 this model has a high recall, meaning it is likely to detect binding sites, but it has a precision score that is somewhat lower than the SOTA
 structural models mentioned above, meaning the model may return some false positives as well.
+## Running Inference
+You can download and run [this notebook](https://huggingface.co/AmelieSchreiber/esm2_t12_35M_lora_binding_sites_v2_cp3/blob/main/testing_and_inference.ipynb)
+to test out any of the ESMB models. Note, if you would like to run the models on the train/test split to get the metrics, you may need to do
+locally or in a Colab Pro instance as the datasets are quite large and will not run in a standard Colab (you can still run inference on your own
+protein sequences though).
 ## Training procedure
 This model was finetuned with LoRA on ~549K protein sequences from the UniProt database. The dataset can be found