Aurélien-Morgan CLAUDON's picture

Aurélien-Morgan CLAUDON

Aurelien-Morgan

·

https://huggingface.co/retrain-pipelines

AI & ML interests

None yet

Recent Activity

liked a model about 12 hours ago

nomic-ai/modernbert-embed-base

reacted to tomaarsen's post with ❤️ about 12 hours ago

That didn't take long! Nomic AI has finetuned the new ModernBERT-base encoder model into a strong embedding model for search, classification, clustering and more! Details: 🤖 Based on ModernBERT-base with 149M parameters. 📊 Outperforms both nomic-embed-text-v1 and nomic-embed-text-v1.5 on MTEB! 🏎️ Immediate FA2 and unpacking support for super efficient inference. 🪆 Trained with Matryoshka support, i.e. 2 valid output dimensionalities: 768 and 256. ➡️ Maximum sequence length of 8192 tokens! 2️⃣ Trained in 2 stages: unsupervised contrastive data -> high quality labeled datasets. ➕ Integrated in Sentence Transformers, Transformers, LangChain, LlamaIndex, Haystack, etc. 🏛️ Apache 2.0 licensed: fully commercially permissible Try it out here: https://huggingface.co/nomic-ai/modernbert-embed-base Very nice work by Zach Nussbaum and colleagues at Nomic AI.

upvoted a collection about 12 hours ago

View all activity

Articles

Fancy Stateful Metaflow Service + UI on Google Colab ?

Organizations

Aurelien-Morgan's activity

upvoted a collection about 12 hours ago

OLMo 2

Artifacts for the second set of OLMo models. • 20 items • Updated about 14 hours ago • 62

upvoted a paper 13 days ago

Qwen2.5 Technical Report

Paper • 2412.15115 • Published 15 days ago • 334

upvoted a paper 30 days ago

Mooncake: A KVCache-centric Disaggregated Architecture for LLM Serving

Paper • 2407.00079 • Published Jun 24, 2024 • 5

upvoted a paper about 1 month ago

GRAPE: Generalizing Robot Policy via Preference Alignment

Paper • 2411.19309 • Published Nov 28, 2024 • 42

upvoted an article about 1 month ago

Article

Let’s make a generation of amazing image generation models

By

•

Nov 26, 2024

• 34

upvoted a collection about 2 months ago

🚀 Trending Demo

13 items • Updated 10 days ago • 9

upvoted 3 papers about 2 months ago

The Surprising Effectiveness of Test-Time Training for Abstract Reasoning

Paper • 2411.07279 • Published Nov 11, 2024 • 3

OpenCoder: The Open Cookbook for Top-Tier Code Large Language Models

Paper • 2411.04905 • Published Nov 7, 2024 • 111

SALSA: Soup-based Alignment Learning for Stronger Adaptation in RLHF

Paper • 2411.01798 • Published Nov 4, 2024 • 8

upvoted 3 collections 2 months ago

SmolLM2

State-of-the-art compact LLMs for on-device applications: 1.7B, 360M, 135M • 15 items • Updated 12 days ago • 197

📑 Trending Papers - October 🔟

10 items • Updated 10 days ago • 6

MobileLLM

Optimizing Sub-billion Parameter Language Models for On-Device Use Cases (ICML 2024) https://arxiv.org/abs/2402.14905 • 9 items • Updated Nov 27, 2024 • 101

upvoted 3 articles 3 months ago

Article

Fancy Stateful Metaflow Service + UI on Google Colab ?

By

•

Oct 14, 2024

• 4

Article

Fixing Gradient Accumulation

Oct 16, 2024

• 44

Article

Model Card Generator Interface: Crafting Clear Insights into AI Models

By

•

Sep 27, 2024

• 4

upvoted a paper 3 months ago

Pixtral 12B

Paper • 2410.07073 • Published Oct 9, 2024 • 62

upvoted 2 articles 3 months ago

Article

A Short Summary of Chinese AI Global Expansion

Oct 3, 2024

• 20

Article

Llama can now see and run on your device - welcome Llama 3.2

Sep 25, 2024

• 180

upvoted a collection 3 months ago

Llama 3.2

This collection hosts the transformers and original repos of the Llama 3.2 and Llama Guard 3 • 15 items • Updated 29 days ago • 551

upvoted an article 4 months ago

Article

Introducing Community Tools on HuggingChat

Sep 16, 2024

• 34