AmelieSchreiber commited on
Commit
b3cf64d
·
1 Parent(s): 5c28821

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +0 -2
README.md CHANGED
@@ -30,8 +30,6 @@ This model is trained to predict general binding sites of proteins using on the
30
  `esm2_t6_8M_UR50D`, trained on [this dataset](https://huggingface.co/datasets/AmelieSchreiber/general_binding_sites). The data is
31
  not filtered by family, and thus the model may be overfit to some degree. In the Hugging Face Inference API widget to the right
32
  there are three protein sequence examples. The first is a DNA binding protein ([see UniProt entry here](https://www.uniprot.org/uniprotkb/D3ZG52/entry)).
33
- Note there is nontrivial (GMPGTGK) overlap in the predicted binding sites and the binding sites given in UniProt. Note also that
34
- some of the extraneous predictions are near misses and are very close to the binding sites given in UniProt.
35
 
36
  The second and third were obtained using [EvoProtGrad](https://github.com/Amelie-Schreiber/sampling_protein_language_models/blob/main/EvoProtGrad_copy.ipynb)
37
  a Markov Chain Monte Carlo method of (in silico) directed evolution of proteins based on a form of Gibbs sampling. The mutatant-type
 
30
  `esm2_t6_8M_UR50D`, trained on [this dataset](https://huggingface.co/datasets/AmelieSchreiber/general_binding_sites). The data is
31
  not filtered by family, and thus the model may be overfit to some degree. In the Hugging Face Inference API widget to the right
32
  there are three protein sequence examples. The first is a DNA binding protein ([see UniProt entry here](https://www.uniprot.org/uniprotkb/D3ZG52/entry)).
 
 
33
 
34
  The second and third were obtained using [EvoProtGrad](https://github.com/Amelie-Schreiber/sampling_protein_language_models/blob/main/EvoProtGrad_copy.ipynb)
35
  a Markov Chain Monte Carlo method of (in silico) directed evolution of proteins based on a form of Gibbs sampling. The mutatant-type