tasksource
/

ModernBERT-large-nli

Zero-Shot Classification

text-classification

natural-language-inference

Inference Endpoints

Model card Files Files and versions Community

sileod commited on 18 days ago

Commit

7df9c7c

·

verified ·

1 Parent(s): ffc6bf7

Update README.md

Files changed (1) hide show

README.md +2 -1

README.md CHANGED Viewed

@@ -23,7 +23,8 @@ The model was trained for 200k steps on an Nvidia A30 GPU.
 It is very good at reasoning tasks (better than llama 3.1 8B Instruct on ANLI and FOLIO), long context reasoning, sentiment analysis and zero-shot classification with new labels.
-The following table shows model test accuracy. It is the accuracy of the same single model with different classification heads, further gains can be obtained by fine-tuning on a single-task, e.g. SST, but it this checkpoint is very hard to beat for zero-shot classification, NLI generalization).
 | test_name                             |   test_accuracy |
 |:--------------------------------------|----------------:|

 It is very good at reasoning tasks (better than llama 3.1 8B Instruct on ANLI and FOLIO), long context reasoning, sentiment analysis and zero-shot classification with new labels.
+The following table shows model test accuracy. It is the accuracy of the same single transformer with different classification heads on top.
+Further gains can be obtained by fine-tuning on a single-task, e.g. SST, but it this checkpoint is great for zero-shot classification and natural language inference (contradiction/entailment/neutral classification).
 | test_name                             |   test_accuracy |
 |:--------------------------------------|----------------:|