blog-explorers (Blog-explorers)

wolfram

in blog-explorers/README about 15 hours ago

[Support] Community Articles

67

#5 opened 10 months ago by

victor

nroggendorff

posted an update 2 days ago

Post

4246

hey nvidia, can you send me a gpu?
comment or react if you want ~~me~~ to get one too. 👉👈

22 replies

·

roseking

posted an update 3 days ago

Post

2435

🤗 Hugging Face Download Tool

The Hugging Face Download Tool is a sophisticated graphical user interface application designed to simplify the process of downloading resources from Hugging Face repositories. This tool addresses common challenges in model and file downloads through its intelligent features and user-friendly interface.

✨ Key Features
- 🖥️ Intuitive graphical interface for easy operation
- 🔄 Advanced retry mechanism with smart error handling
- ⏸️ Resume capability for interrupted downloads
- 📊 Real-time download status monitoring
- 🔐 Secure access to private repositories via token authentication

🛠️ Technical Highlights
The tool implements several advanced features to ensure reliable downloads:
- 📦 Chunk-based downloading with 1MB segments
- ⚡ Adaptive retry intervals (5-300 seconds) based on error types
- 🔌 Connection pooling for optimized performance
- 🛡️ Built-in rate limiting protection
- 🔑 Secure token handling for private repository access

This tool is ideal for researchers, developers, and AI practitioners who regularly work with Hugging Face resources and need a reliable, user-friendly download solution. 💻 It supports all major operating systems and requires minimal setup, making it accessible to users of all technical levels. 🚀

GitHub：https://github.com/2404589803/hf_downloader

2 replies

·

tomaarsen

posted an update 3 days ago

Post

2344

That didn't take long! Nomic AI has finetuned the new ModernBERT-base encoder model into a strong embedding model for search, classification, clustering and more!

Details:
🤖 Based on ModernBERT-base with 149M parameters.
📊 Outperforms both nomic-embed-text-v1 and nomic-embed-text-v1.5 on MTEB!
🏎️ Immediate FA2 and unpacking support for super efficient inference.
🪆 Trained with Matryoshka support, i.e. 2 valid output dimensionalities: 768 and 256.
➡️ Maximum sequence length of 8192 tokens!
2️⃣ Trained in 2 stages: unsupervised contrastive data -> high quality labeled datasets.
➕ Integrated in Sentence Transformers, Transformers, LangChain, LlamaIndex, Haystack, etc.
🏛️ Apache 2.0 licensed: fully commercially permissible

Try it out here: nomic-ai/modernbert-embed-base

Very nice work by Zach Nussbaum and colleagues at Nomic AI.

AdinaY

posted an update 9 days ago

Post

3472

The Chinese community is shipping 🚢

DeepSeek V3 (685 B MoE) has quietly released on the hub!
Base: deepseek-ai/DeepSeek-V3-Base
Instruct: deepseek-ai/DeepSeek-V3

Can’t wait to see what’s next!

1 reply

·

AdinaY

posted an update 10 days ago

Post

2883

QvQ-72B-Preview🎄 an open weight model for visual reasoning just released by Alibaba_Qwen team
Qwen/qvq-676448c820912236342b9888
✨ Combines visual understanding & language reasoning.
✨ Scores 70.3 on MMMU
✨ Outperforms Qwen2-VL-72B-Instruct in complex problem-solving

nroggendorff

posted an update 12 days ago

Post

3542

im so tired

3 replies

·

nroggendorff

posted an update 14 days ago

Post

843

Has anyone else noticed that ZeroGPU quota is per space, not per user as of a few weeks ago?

2 replies

·

Abhaykoul

posted an update 14 days ago

Post

1616

🔥 BIG ANNOUNCEMENT: THE HELPINGAI API IS LIVE! 🔥

Yo, the moment you’ve all been waiting for is here! 🚀 The HelpingAI API is now LIVE and ready to level up your projects! 🔥 We’re bringing that next-level AI goodness straight to your fingertips. 💯

No more waiting— it’s time to build something epic! 🙌

From now on, you can integrate our cutting-edge AI models into your own applications, workflows, and everything in between. Whether you’re a developer, a creator, or just someone looking to make some serious moves, this is your chance to unlock the full potential of emotional intelligence and adaptive AI.

Check out the docs 🔥 and let’s get to work! 🚀

👉 Check out the docs and start building (https://helpingai.co/docs)
👉 Visit the HelpingAI website (https://helpingai.co/)

6 replies

·

tomaarsen

authored a paper 15 days ago

Smarter, Better, Faster, Longer: A Modern Bidirectional Encoder for Fast, Memory Efficient, and Long Context Finetuning and Inference

Paper • 2412.13663 • Published 17 days ago • 116

nroggendorff

posted an update 16 days ago

Post

1306

Can we please do something about this? It makes everything I do so much harder, and because my local machine is so terrible, I am forced to test in production. This makes debugging so difficult.
nroggendorff/system-exit

cc @victor

1 reply

·

k-l-lambda

posted an update 17 days ago

Post

356

Lotus is online now, my lilypond editor. Hope hf space can host it stably. Have fun!
k-l-lambda/lotus

celinah

posted an update 18 days ago

Post

596

🚀 We've just dropped a new release v0.27.0 of the 𝚑𝚞𝚐𝚐𝚒𝚗𝚐𝚏𝚊𝚌𝚎_𝚑𝚞𝚋 Python library!

This release includes:
- 💾 New torch model loading utilities in the serialization module — providing a standardized way to save and load torch models with built-in support for sharding and safe serialization.
- 📦 Tooling for something exciting — if you like single-file formats for models like GGUF, you'll love what we're cooking up 👀 More coming soon!
- 🛠️ Loads of quality-of-life improvements and bug fixes!

release notes and full details here 👇
Wauplin/huggingface_hub#10

$ pip install -U huggingface_hub

AdinaY

posted an update 19 days ago

Post

535

Megrez-3B-Omni 🔥 an on-device multimodal LLM by Infinigence AI, another startup emerging from the Tsinghua University ecosystem.
Model: Infinigence/Megrez-3B-Omni
Demo: Infinigence/Megrez-3B-Omni
✨Supports analysis of image, text, and audio modalities
✨Leads in bilingual speech ( English & Chinese ) input, multi-turn conversations, and voice-based queries
✨Outperforms in scene understanding and OCR across major benchmarks

nroggendorff

posted an update 20 days ago

Post

2335

I hate it when I'm debugging a space and I run out of ZeroGPU quota.

Also, I'm totally not biased btw, but I think inference quota should scale with contributions too.

3 replies

·

julien-c

posted an update 24 days ago

Post

7922

After some heated discussion 🔥, we clarify our intent re. storage limits on the Hub

TL;DR:
- public storage is free, and (unless blatant abuse) unlimited. We do ask that you consider upgrading to PRO and/or Enterprise Hub if possible
- private storage is paid above a significant free tier (1TB if you have a paid account, 100GB otherwise)

docs: https://huggingface.co/docs/hub/storage-limits

We optimize our infrastructure continuously to scale our storage for the coming years of growth in Machine learning, to the benefit of the community 🔥

cc: @reach-vb @pierric @victor and the HF team

28 replies

·

AdinaY

posted an update 26 days ago

Post

881

Updates from the Chinese community last week 🔥

LLM:
✨ Sailor 2 , multilingual model supporting 10+ South Asian languages by Sea AI Lab. https://huggingface.co/sailor2

MLLM:
✨InternVL 2.5 , new open multimodal LLM by OpenGVLab
https://huggingface.co/collections/OpenGVLab/internvl-25-673e1019b66e2218f68d7c1c
✨Qwen2-VL 2B/7B/72B base model, the latest iteration of our Qwen-VL model by Alibaba Qwen
Qwen/qwen2-vl-66cee7455501d7126940800d

Video model:
✨HunyuanVideo , 13B open video model by Tencent
tencent/HunyuanVideo

Reasoning model:
✨ LLaMA-O1 🦙 base & supervised model; pretrain & finetune datasets and demo all released
zh-ai-community/reasoning-models-67409fb3aa1ed78f10087cd7

Audio model:
✨Fish Speech 1.5, Text-to-speech in 13 languages, trained on 1M+ hours of audio by FishAudio
fishaudio/fish-speech-1.5
✨ClearVoice, An advanced voice processing framework by Alibaba Tongyi SpeechAI https://huggingface.co/alibabasglab

More details 👉 https://huggingface.co/zh-ai-community

BramVanroy

posted an update 29 days ago

Post

460

In the spirit of "Better late than never", I've finally written a brief overview paper for GEITje 7B Ultra. Initially released 10 months ago (oops), but still reaching around 1300 monthly downloads across the HF ecosystem (not including ollama).

GEITje 7B Ultra: A Conversational Model for Dutch (2412.04092)

While the paper discusses the model a little bit, I especially wanted to write about the datasets, which to this day seem an important asset for Dutch LLM training (SFT and preference tuning). We have a long way to go for Dutch, but publishing transparent and reproducible artefacts seems an important step to me, alongside having open discussions about data, bias, architectures.

In that spirit, thanks are in order for the creation of GEITje 7B Ultra and all related datasets:

- Michiel Buisman and UWV for providing the means to create the datasets
- Flemish Supercomputer Center (VSC) for the compute
- The Hugging Face Fellows and rest of the team for their discussions and insights
- The Dutch NLP community, notably @Rijgersberg for building the base GEITje model and the fruitful discussions we've had

More to come, step by step!

BramVanroy/geitje-7b-ultra-65c1ee010ad80fd1f6a8f208

nroggendorff

posted an update about 1 month ago

Post

1952

uhh..

8 replies

·

AdinaY

posted an update about 1 month ago

Post

1578

Sailor 2 🚢 open multilingual model for Southeast Asia by Sea AI Lab🔥
https://huggingface.co/sailor2
sail/Sailor2-20B-Chat

✨ Fully open code & ALL datasets 🙌
✨ 1B/ 8B/20B base & chat expanded on Qwen2.5
✨ Apache 2.0
✨ Supports 15 languages including English, Chinese, Burmese, Cebuano, Ilocano, Indonesian, Javanese, Khmer, Lao, Malay, Sundanese, Tagalog, Thai, Vietnamese, and Waray🇬🇧🇨🇳🇱🇦🇲🇾🇲🇲🇻🇳🇹🇭

Blog-explorers

AI & ML interests

Recent Activity

blog-explorers's activity

[Support] Community Articles

Smarter, Better, Faster, Longer: A Modern Bidirectional Encoder for Fast, Memory Efficient, and Long Context Finetuning and Inference

AI & ML interests

Recent Activity

Team members 670

blog-explorers's activity

[Support] Community Articles