1 36 198

Santiago Garcia

santyzenith

AI & ML interests

Large language models, Natural Language Processing, Computer Vision, Spanish Large language models.

Recent Activity

updated a model 17 days ago

santyzenith/UDA-LIDI-Whisper-large-ECU-911

updated a model 17 days ago

santyzenith/UDA-LIDI-Whisper-large-v3-ECU-911

liked a model 18 days ago

ibm-granite/granite-3.1-8b-instruct

View all activity

Organizations

santyzenith's activity

upvoted a collection 21 days ago

RLHF

Collection

A collection of models trained with Reinforcement Learning from Human Feedback (RLHF). • 4 items • Updated about 12 hours ago • 5

upvoted a collection 3 months ago

LLM2Vec

Collection

16 items • Updated Oct 8, 2024 • 39

upvoted 2 articles 3 months ago

Article

Train a Llama model from scratch

•

Jul 29, 2024

• 48

Article

Vision Language Models Explained

Apr 11, 2024

• 239

upvoted an article 4 months ago

Article

Fine-tuning 20B LLMs with RLHF on a 24GB consumer GPU

Mar 9, 2023

• 35

upvoted 2 papers 5 months ago

Compact Language Models via Pruning and Knowledge Distillation

Paper • 2407.14679 • Published Jul 19, 2024 • 39

MoLE : Mixture of Language Experts for Multi-Lingual Automatic Speech Recognition

Paper • 2302.13750 • Published Feb 27, 2023 • 2

upvoted 3 articles 5 months ago

Article

Introduction to Graph Machine Learning

Jan 3, 2023

• 20

Article

Llama 3.1 - 405B, 70B & 8B with multilinguality and long context

Jul 23, 2024

• 225

Article

Welcome Gemma 2 - Google's new open LLM

Jun 27, 2024

• 124

upvoted a paper 6 months ago

DataComp-LM: In search of the next generation of training sets for language models

Paper • 2406.11794 • Published Jun 17, 2024 • 50

upvoted 2 articles 6 months ago

Article

SmolLM - blazingly fast and remarkably powerful

Jul 16, 2024

• 295

Article

From PyTorch DDP to 🤗 Accelerate to 🤗 Trainer, mastery of distributed training with ease

Oct 21, 2022

• 17

upvoted a paper 6 months ago

Tuna: Instruction Tuning using Feedback from Large Language Models

Paper • 2310.13385 • Published Oct 20, 2023 • 10

upvoted a collection 6 months ago

Knowledge distillation

Collection

88 items • Updated Feb 7, 2024 • 7

upvoted 2 articles 6 months ago

Article

Putting RL back in RLHF

Jun 12, 2024

• 66

Article

Fine-Tune Whisper with 🤗 Transformers

Nov 3, 2022

• 140

upvoted 3 papers 6 months ago

Datasets: A Community Library for Natural Language Processing

Paper • 2109.02846 • Published Sep 7, 2021 • 11

Estimating Knowledge in Large Language Models Without Generating a Single Token

Paper • 2406.12673 • Published Jun 18, 2024 • 7

A Systematic Survey of Text Summarization: From Statistical Methods to Large Language Models

Paper • 2406.11289 • Published Jun 17, 2024 • 5