sentence-transformers/all-MiniLM-L6-v2 · Semantic similarity between two texts !

May 14, 2024

Hello, I have created a small PHP script that utilizes the API with the model /static-proxy?url=https%3A%2F%2Fapi-inference.huggingface.co%2Fmodels%2Fsentence-transformers%2Fall-MiniLM-L6-v2%3C%2Fa%3E%2C designed to evaluate the percentage of semantic similarity between two French texts. However, the results I often receive are around 50%, which doesn't seem very relevant, especially when the texts express the same idea with different words. Typically, I would expect scores between 75 and 100%. Do you think it would be better to use a different model, or are there adjustments that could be made to this model to improve the results?

tomaarsen

Sentence Transformers org May 14, 2024

Hello!

This model has been trained specifically on English texts, so I think you will get better performance with a model trained for multilinguality or for French in specific, such as:

(Note: I used https://huggingface.co/spaces/mteb/leaderboard?task=sts&language=french and filtered away the larger models)

Tom Aarsen

Systeme

May 14, 2024

Hello Tom,
Thank you, Tom, for your response. I have chosen these two models for their good efficiency.

/static-proxy?url=https%3A%2F%2Fapi-inference.huggingface.co%2Fmodels%2Fsentence-transformers%2Fparaphrase-multilingual-MiniLM-L12-v2%3C%2Fa%3E%3Cbr%3E%3Ca rel="nofollow" href="/static-proxy?url=https%3A%2F%2Fapi-inference.huggingface.co%2Fmodels%2Fintfloat%2Fmultilingual-e5-small">/static-proxy?url=https%3A%2F%2Fapi-inference.huggingface.co%2Fmodels%2Fintfloat%2Fmultilingual-e5-small%3C%2Fa%3E%3C%2Fp%3E

$text1 = "Au dernier trimestre, il y eu de nouvelles technologies.";
$text2 = "Au cours des trois derniers mois, des technologies innovantes sont apparues.";

When applying the same text to these models, I still observe differences in the results obtained.
The intfloat/multilingual-e5-small model displays a similarity score of 94%, while the sentence-transformers/paraphrase-multilingual-MiniLM-L12-v2 model reaches a score of 78%.

Unfortunately, the precision of interpretation remains inferior to that achieved by human analysis.
Thanks again for your help.
Marc