Matricardi Fabio's picture

Matricardi Fabio

FM-1976

·

https://medium.com/@fabio.matricardi

AI & ML interests

control system engineering, AI, LLM with python. ThePoorGPUguy on substack

Recent Activity

updated a collection 15 days ago

updated a collection 15 days ago

updated a collection 15 days ago

View all activity

Organizations

None yet

FM-1976's activity

upvoted a paper 15 days ago

Smarter, Better, Faster, Longer: A Modern Bidirectional Encoder for Fast, Memory Efficient, and Long Context Finetuning and Inference

Paper • 2412.13663 • Published 17 days ago • 116

upvoted a paper 20 days ago

Hermes 3 Technical Report

Paper • 2408.11857 • Published Aug 15, 2024 • 42

upvoted 5 papers about 1 month ago

RedPajama: an Open Dataset for Training Large Language Models

Paper • 2411.12372 • Published Nov 19, 2024 • 47

FluidML: Fast and Memory Efficient Inference Optimization

Paper • 2411.09242 • Published Nov 14, 2024 • 1

Natural Language Reinforcement Learning

Paper • 2411.14251 • Published Nov 21, 2024 • 27

TÜLU 3: Pushing Frontiers in Open Language Model Post-Training

Paper • 2411.15124 • Published Nov 22, 2024 • 57

Ultra-Sparse Memory Network

Paper • 2411.12364 • Published Nov 19, 2024 • 19

upvoted a collection 2 months ago

SmolLM2

State-of-the-art compact LLMs for on-device applications: 1.7B, 360M, 135M • 15 items • Updated 12 days ago • 197

upvoted a paper 2 months ago

Scalable MatMul-free Language Modeling

Paper • 2406.02528 • Published Jun 4, 2024 • 11

upvoted a collection 3 months ago

LLM

Collection of OpenVINO optimized LLMs • 135 items • Updated 12 days ago • 19

upvoted an article 3 months ago

Article

Fine-tuning LLMs to 1.58bit: extreme quantization made easy

Sep 18, 2024

• 215

upvoted a collection 3 months ago

Llama 3.2

This collection hosts the transformers and original repos of the Llama 3.2 and Llama Guard 3 • 15 items • Updated 29 days ago • 551

upvoted a collection 4 months ago

DataGemma Release

A series of pioneering open models that help ground LLMs in real-world data through Data Commons. • 2 items • Updated 22 days ago • 82

upvoted 2 collections 5 months ago

LLM

Multimodal LLM • 238 items • Updated Sep 26, 2024 • 11

Trained Models 🏋️

They may be small, but they're training like giants! • 8 items • Updated Dec 3, 2024 • 17

upvoted a collection 8 months ago

Minerva LLMs

The first family of LLMs pretrained from scratch on Italian. • 6 items • Updated 28 days ago • 32