Lukas Bug's picture

45 206

Lukas Bug

1ucky1uke

·

AI & ML interests

None yet

Recent Activity

liked a model 19 days ago

google/paligemma2-10b-pt-896

upvoted an article about 1 month ago

🐺🐦‍⬛ LLM Comparison/Test: 25 SOTA LLMs (including QwQ) through 59 MMLU-Pro CS benchmark runs

upvoted a paper about 1 month ago

PaliGemma 2: A Family of Versatile VLMs for Transfer

View all activity

Organizations

None yet

1ucky1uke's activity

upvoted an article about 1 month ago

Article

🐺🐦‍⬛ LLM Comparison/Test: 25 SOTA LLMs (including QwQ) through 59 MMLU-Pro CS benchmark runs

By

•

Dec 4, 2024

• 75

upvoted a paper about 1 month ago

PaliGemma 2: A Family of Versatile VLMs for Transfer

Paper • 2412.03555 • Published Dec 4, 2024 • 121

upvoted a paper 4 months ago

ColPali: Efficient Document Retrieval with Vision Language Models

Paper • 2407.01449 • Published Jun 27, 2024 • 42

upvoted a paper 5 months ago

Transformer Explainer: Interactive Learning of Text-Generative Models

Paper • 2408.04619 • Published Aug 8, 2024 • 156

upvoted an article 6 months ago

Article

Assisted Generation: a new direction toward low-latency text generation

May 11, 2023

• 38

upvoted 2 collections 6 months ago

Llama 3.1 GPTQ, AWQ, and BNB Quants

Optimised Quants for high-throughput deployments! Compatible with Transformers, TGI & VLLM 🤗 • 9 items • Updated Sep 26, 2024 • 56

Llama 3.1

This collection hosts the transformers and original repos of the Llama 3.1, Llama Guard 3 and Prompt Guard models • 11 items • Updated about 1 month ago • 638

upvoted an article 6 months ago

Article

Llama 3.1 - 405B, 70B & 8B with multilinguality and long context

Jul 23, 2024

• 225

upvoted a paper 6 months ago

Spectra: A Comprehensive Study of Ternary, Quantized, and FP16 Language Models

Paper • 2407.12327 • Published Jul 17, 2024 • 77

upvoted a collection 6 months ago

NuminaMath

Datasets and models for training SOTA math LLMs. See our GitHub for training & inference code: https://github.com/project-numina/aimo-progress-prize • 6 items • Updated Jul 21, 2024 • 69

upvoted an article 6 months ago

Article

The Rise of Agentic Data Generation

By

•

Jul 15, 2024

• 80

upvoted a collection 6 months ago

🪐 SmolLM

A series of smol LLMs: 135M, 360M and 1.7B. We release base and Instruct models as well as the training corpus and some WebGPU demos • 12 items • Updated 14 days ago • 206

upvoted 2 articles 6 months ago

Article

SmolLM - blazingly fast and remarkably powerful

Jul 16, 2024

• 295

Article

PaliGemma – Google's Cutting-Edge Open Vision Language Model

May 14, 2024

• 231

upvoted 3 papers 6 months ago

Tree of Thoughts: Deliberate Problem Solving with Large Language Models

Paper • 2305.10601 • Published May 17, 2023 • 11

FunAudioLLM: Voice Understanding and Generation Foundation Models for Natural Interaction Between Humans and LLMs

Paper • 2407.04051 • Published Jul 4, 2024 • 35

AgentInstruct: Toward Generative Teaching with Agentic Flows

Paper • 2407.03502 • Published Jul 3, 2024 • 51

upvoted 3 articles 6 months ago

Article

How NuminaMath Won the 1st AIMO Progress Prize

Jul 11, 2024

• 110

Article

Google Cloud TPUs made available to Hugging Face users

Jul 9, 2024

• 19

Article

BigCodeBench: Benchmarking Large Language Models on Solving Practical and Challenging Programming Tasks

Jun 18, 2024

• 43