s3nh's picture

s3nh

s3nh

AI & ML interests

Quantization, LLMs, Deep Learning for good. Follow me if you like my work. Patreon.com/s3nh

Recent Activity

Organizations

ESPnet's profile picture Gradio-Blocks-Party's profile picture Lajonbot's profile picture The Waifu Research Department's profile picture AblateIt's profile picture Blog-explorers's profile picture BangumiBase's profile picture CyberHarem's profile picture HydraLM's profile picture GOAT.AI's profile picture That Time I got Reincarnated as a Hugging Face Organization's profile picture ZeroGPU Explorers's profile picture Social Post Explorers's profile picture Spinner-GPT-4's profile picture Hugging Face Discord Community's profile picture open/ acc's profile picture Smol Community's profile picture

s3nh's activity

reacted to sayakpaul's post with 🔥 9 days ago
updated a Space 10 days ago
New activity in SmolTuners/README 10 days ago

Gh organization

6
#3 opened 11 days ago by
s3nh
New activity in SmolTuners/README 11 days ago

Optimizers

#2 opened 11 days ago by
s3nh
New activity in SmolTuners/README 13 days ago

Datasets

3
#1 opened 15 days ago by
s3nh
reacted to merve's post with 🧠 15 days ago
view post
Post
1739
A complete RAG pipeline includes a reranker, which ranks the documents to find the best document 📓
Same goes for multimodal RAG, multimodal rerankers which we can integrate to multimodal RAG pipelines!
Learn how to build a complete multimodal RAG pipeline with vidore/colqwen2-v1.0 as retriever, lightonai/MonoQwen2-VL-v0.1 as reranker, Qwen/Qwen2-VL-7B-Instruct as VLM in this notebook that runs on a GPU as small as L4 🔥 https://huggingface.co/learn/cookbook/multimodal_rag_using_document_retrieval_and_reranker_and_vlms
reacted to fdaudens's post with 🤗 15 days ago
view post
Post
1207
🤝 Want to share your AI models while protecting your work? Licenses are key!

Fascinating to see that nearly 60% of models on the Hub use Apache & MIT licenses.

Explore the viz here: huggingface/open-source-ai-year-in-review-2024
reacted to Lewdiculous's post with 15 days ago
reacted to fdaudens's post with 👍 15 days ago
view post
Post
1244
🔍 From instruction-following to creative storytelling, dive into 2024's most impactful AI datasets! These gems are shaping everything from scientific research to video understanding.

Check it out: huggingface/open-source-ai-year-in-review-2024
replied to louisbrulenaudet's post 15 days ago
reacted to louisbrulenaudet's post with 🤗 15 days ago
view post
Post
1785
I’ve published a new dataset to simplify model merging 🤗

This dataset facilitates the search for compatible architectures for model merging with @arcee_ai’s mergekit, streamlining the automation of high-performance merge searches 📖

Dataset : louisbrulenaudet/mergekit-configs
  • 1 reply
·
reacted to nyuuzyou's post with 👍 15 days ago
view post
Post
1509
✈️ Aircraft Dataset & Generation Model nyuuzyou/aircraft-images & nyuuzyou/AircraftFLUX-LoRA

Dataset Features:
• 165,340 high-res aircraft images with metadata
• Machine-generated English captions
• Detailed aircraft specs, registration & flight info
• Environmental context descriptions

LoRA model specializes in:
• Realistic aircraft generation
• Accurate technical details for unpopular airplanes compared to black-forest-labs/FLUX.1-schnell
• Proper airline liveries
• Contextual aviation scenes
replied to danielhanchen's post 15 days ago
reacted to danielhanchen's post with 🤗👍 15 days ago
reacted to stefan-it's post with ❤️ 15 days ago
view post
Post
1183
My latest project is the outcome of the last 2+ years working with TPUs from the amazing TPU Research Cloud (TRC) program and training Encoder-only LMs with the TensorFlow Model Garden library.

👉 Link: https://github.com/stefan-it/model-garden-lms

An overview of some features:

- Cheatsheet for setting-up a TPU VM Pod (with all necessary dependencies) to pretrain LMs with TF Model Garden
- Conversion scripts that convert TF Model Garden weights to Hugging Face Transformers-compatible models
- Supported architectures include BERT, BERT with Token Dropping and TEAMS

I also released BERT-based models pretrained on the great Hugging Face FineWeb and FineWeb-Edu datasets (10BT subset). With more to come!

👉 Model Hub Link: https://huggingface.co/model-garden-lms

If you find these resources useful, please give them a like!

Made from Bavarian Oberland with ❤️ and 🥨.