1198 173 350

Pedro Cuenca

pcuenq

AI & ML interests

None yet

Recent Activity

liked a Space about 17 hours ago

reach-vb/2024-ai-timeline

liked a dataset 2 days ago

code-search-net/code_search_net

liked a dataset 2 days ago

google/code_x_glue_tc_nl_code_search_adv

View all activity

Articles

Powerful ASR + diarization + speculative decoding with Hugging Face Inference Endpoints

May 1, 2024

• 69

Welcome Llama 3 - Meta's new open LLM

Apr 18, 2024

• 281

CodeGemma - an official Google release for code LLMs

Apr 9, 2024

• 99

Welcome Gemma - Google's new open LLM

Feb 21, 2024

• 21

Welcome Mixtral - a SOTA Mixture of Experts on Hugging Face

Dec 11, 2023

• 11

SDXL in 4 steps with Latent Consistency LoRAs

Nov 9, 2023

• 11

Accelerating Stable Diffusion XL Inference with JAX on Cloud TPU v5e

Oct 3, 2023

• 5

Introducing Würstchen: Fast Diffusion for Image Generation

Sep 13, 2023

• 13

Spread Your Wings: Falcon 180B is here

Sep 6, 2023

• 4

Code Llama: Llama 2 learns to code

Aug 25, 2023

• 9

Releasing Swift Transformers: Run On-Device LLMs in Apple Devices

Aug 8, 2023

• 27

Stable Diffusion XL on Mac with Advanced Core ML Quantization

Jul 27, 2023

• 5

Happy 1st anniversary 🤗 Diffusers!

Jul 20, 2023

• 1

Llama 2 is here - get it on Hugging Face

Jul 18, 2023

• 23

Faster Stable Diffusion with Core ML on iPhone, iPad, and Mac

Jun 15, 2023

• 4

The Falcon has landed in the Hugging Face ecosystem

Jun 5, 2023

• 10

Train your ControlNet with diffusers

Mar 24, 2023

• 18

Swift Diffusers: Fast Stable Diffusion for Mac

Feb 24, 2023

• 4

Using LoRA for Efficient Stable Diffusion Fine-Tuning

Jan 26, 2023

• 42

Using Stable Diffusion with Core ML on Apple Silicon

Dec 1, 2022

• 6

Hugging Face Machine Learning Demos on arXiv

Nov 17, 2022

Training Stable Diffusion with Dreambooth using 🧨 Diffusers

Nov 7, 2022

• 16

Stable Diffusion in JAX/Flax 🚀

Oct 13, 2022

• 2

Stable Diffusion with 🧨 Diffusers

Aug 22, 2022

• 42

Organizations

pcuenq's activity

liked a Space about 17 hours ago

Running

📉

code-search-net/code_search_net

Updated Jan 18, 2024 • 4.85k • 271

google/code_x_glue_tc_nl_code_search_adv

Viewer • Updated Jan 24, 2024 • 281k • 173 • 9

bigcode/the-stack-v2-dedup

Viewer • Updated Apr 23, 2024 • 2.3B • 2.6k • 71

liked 2 models 4 days ago

Qwen/QVQ-72B-Preview

Image-Text-to-Text • Updated 7 days ago • 41.1k • 419

Qwen/QwQ-32B-Preview

Text Generation • Updated Nov 29, 2024 • 115k • • 1.47k

upvoted a collection 4 days ago

Llama 3.3

Collection

This collection hosts the transformers and original repos of the Llama 3.3 • 1 item • Updated 26 days ago • 99

reacted to reach-vb's post with 🚀🔥 4 days ago

Post

3369

VLMs are going through quite an open revolution AND on-device friendly sizes:

1. Google DeepMind w/ PaliGemma2 - 3B, 10B & 28B: google/paligemma-2-release-67500e1e1dbfdd4dee27ba48

2. OpenGVLabs w/ InternVL 2.5 - 1B, 2B, 4B, 8B, 26B, 38B & 78B: https://huggingface.co/collections/OpenGVLab/internvl-25-673e1019b66e2218f68d7c1c

3. Qwen w/ Qwen 2 VL - 2B, 7B & 72B: Qwen/qwen2-vl-66cee7455501d7126940800d

4. Microsoft w/ FlorenceVL - 3B & 8B: https://huggingface.co/jiuhai

5. Moondream2 w/ 0.5B: https://huggingface.co/vikhyatk/

What a time to be alive! 🔥

reacted to thomwolf's post with 🤗🔥🚀 4 days ago

Post

4448

We are proud to announce HuggingFaceFW/fineweb-2: A sparkling update to HuggingFaceFW/fineweb with 1000s of 🗣️languages.

We applied the same data-driven approach that led to SOTA English performance in🍷 FineWeb to thousands of languages.

🥂 FineWeb2 has 8TB of compressed text data and outperforms other multilingual datasets in our experiments.

The dataset is released under the permissive 📜 ODC-By 1.0 license, and the 💻 code to reproduce it and our evaluations is public.

We will very soon announce a big community project, and are working on a 📝 blogpost walking you through the entire dataset creation process. Stay tuned!

In the mean time come ask us question on our chat place: HuggingFaceFW/discussion

H/t @guipenedo @hynky @lvwerra as well as @vsabolcec Bettina Messmer @negar-foroutan and @mjaggi

2 replies

upvoted 2 papers 4 days ago

ESPnet-EZ: Python-only ESPnet for Easy Fine-tuning and Integration

Paper • 2409.09506 • Published Sep 14, 2024 • 4

Lina-Speech: Gated Linear Attention is a Fast and Parameter-Efficient Learner for text-to-speech synthesis

Paper • 2410.23320 • Published Oct 30, 2024 • 8

upvoted a collection 7 days ago

QVQ

Collection

QVQ: Qwen models for visual reasoning • 7 items • Updated 13 minutes ago • 31

liked a model 8 days ago

Qwen/Qwen2.5-1.5B-Instruct

Text Generation • Updated Sep 25, 2024 • 345k • • 260

reacted to julien-c's post with 🤗❤️🔥 9 days ago

Post

7828

After some heated discussion 🔥, we clarify our intent re. storage limits on the Hub

TL;DR:
- public storage is free, and (unless blatant abuse) unlimited. We do ask that you consider upgrading to PRO and/or Enterprise Hub if possible
- private storage is paid above a significant free tier (1TB if you have a paid account, 100GB otherwise)

docs: https://huggingface.co/docs/hub/storage-limits

We optimize our infrastructure continuously to scale our storage for the coming years of growth in Machine learning, to the benefit of the community 🔥

cc: @reach-vb @pierric @victor and the HF team

28 replies

upvoted a collection 12 days ago

GTE models

Collection

General Text Embedding Models Released by Tongyi Lab of Alibaba Group • 19 items • Updated 11 days ago • 18

Pedro Cuenca

AI & ML interests

Recent Activity

Articles

Welcome PaliGemma 2 – New vision language models by Google

SmolVLM - small yet mighty Vision Language Model

Faster Text Generation with Self-Speculative Decoding

Fixing Gradient Accumulation

Llama can now see and run on your device - welcome Llama 3.2

FineVideo: behind the scenes

Fine-tuning LLMs to 1.58bit: extreme quantization made easy

Google releases Gemma 2 2B, ShieldGemma and Gemma Scope

Llama 3.1 - 405B, 70B & 8B with multilinguality and long context

WWDC 24: Running Mistral 7B with Core ML

Welcome Gemma 2 - Google's new open LLM

PaliGemma – Google's Cutting-Edge Open Vision Language Model

License to Call: Introducing Transformers Agents 2.0

Powerful ASR + diarization + speculative decoding with Hugging Face Inference Endpoints

Welcome Llama 3 - Meta's new open LLM

CodeGemma - an official Google release for code LLMs

Welcome Gemma - Google's new open LLM

Mixture of Experts Explained

Welcome Mixtral - a SOTA Mixture of Experts on Hugging Face

SDXL in 4 steps with Latent Consistency LoRAs

Accelerating Stable Diffusion XL Inference with JAX on Cloud TPU v5e

Inference for PROs

Introducing Würstchen: Fast Diffusion for Image Generation

Spread Your Wings: Falcon 180B is here

Code Llama: Llama 2 learns to code

Releasing Swift Transformers: Run On-Device LLMs in Apple Devices

Stable Diffusion XL on Mac with Advanced Core ML Quantization

Happy 1st anniversary 🤗 Diffusers!

Llama 2 is here - get it on Hugging Face

Faster Stable Diffusion with Core ML on iPhone, iPad, and Mac

The Falcon has landed in the Hugging Face ecosystem

Train your ControlNet with diffusers

Swift Diffusers: Fast Stable Diffusion for Mac

Using LoRA for Efficient Stable Diffusion Fine-Tuning

Using Stable Diffusion with Core ML on Apple Silicon

Hugging Face Machine Learning Demos on arXiv

Training Stable Diffusion with Dreambooth using 🧨 Diffusers

Stable Diffusion in JAX/Flax 🚀

Stable Diffusion with 🧨 Diffusers

Organizations

pcuenq's activity

2024 AI Timeline