Quentin Tardif's picture

Quentin Tardif

ntnq

·

AI & ML interests

None yet

Recent Activity

liked a Space 2 days ago

Qwen/Qwen2.5-Coder-Artifacts

upvoted a paper 4 days ago

HumanEval Pro and MBPP Pro: Evaluating Large Language Models on Self-invoking Code Generation

liked a model 4 days ago

bigcode/starpii

View all activity

Organizations

ntnq's activity

upvoted 2 papers 4 days ago

HumanEval Pro and MBPP Pro: Evaluating Large Language Models on Self-invoking Code Generation

Paper • 2412.21199 • Published 4 days ago • 9

SelfCodeAlign: Self-Alignment for Code Generation

Paper • 2410.24198 • Published Oct 31, 2024 • 23

upvoted a paper 15 days ago

Qwen2.5 Technical Report

Paper • 2412.15115 • Published 15 days ago • 334

upvoted a paper 18 days ago

Scaling LLM Test-Time Compute Optimally can be More Effective than Scaling Model Parameters

Paper • 2408.03314 • Published Aug 6, 2024 • 53

upvoted a paper 20 days ago

Phi-4 Technical Report

Paper • 2412.08905 • Published 23 days ago • 95

upvoted a paper 24 days ago

Evaluating and Aligning CodeLLMs on Human Preference

Paper • 2412.05210 • Published 29 days ago • 47

upvoted a paper about 1 month ago

TÜLU 3: Pushing Frontiers in Open Language Model Post-Training

Paper • 2411.15124 • Published Nov 22, 2024 • 57

upvoted a paper 2 months ago

CORAL: Benchmarking Multi-turn Conversational Retrieval-Augmentation Generation

Paper • 2410.23090 • Published Oct 30, 2024 • 54

upvoted a collection 2 months ago

C4AI Aya Expanse

Aya Expanse is an open-weight research release of a model with highly advanced multilingual capabilities. • 3 items • Updated 18 days ago • 30

upvoted 2 papers 3 months ago

StructRAG: Boosting Knowledge Intensive Reasoning of LLMs via Inference-time Hybrid Information Structurization

Paper • 2410.08815 • Published Oct 11, 2024 • 44

Baichuan-Omni Technical Report

Paper • 2410.08565 • Published Oct 11, 2024 • 85

upvoted a collection 3 months ago

Salamandra 🦎

16 items • Updated 15 days ago • 39

upvoted 2 papers 3 months ago

Law of the Weakest Link: Cross Capabilities of Large Language Models

Paper • 2409.19951 • Published Sep 30, 2024 • 54

Molmo and PixMo: Open Weights and Open Data for State-of-the-Art Multimodal Models

Paper • 2409.17146 • Published Sep 25, 2024 • 104

upvoted 2 collections 3 months ago

Llama 3.2

This collection hosts the transformers and original repos of the Llama 3.2 and Llama Guard 3 • 15 items • Updated 29 days ago • 551

LLM Reasoning Papers

Papers to improve reasoning capabilities of LLMs • 17 items • Updated 12 days ago • 93

upvoted a paper 3 months ago

EuroLLM: Multilingual Language Models for Europe

Paper • 2409.16235 • Published Sep 24, 2024 • 25

upvoted a collection 4 months ago

OLMoE

Artifacts for open mixture-of-experts language models. • 13 items • Updated Nov 27, 2024 • 29

upvoted a paper 4 months ago

OLMoE: Open Mixture-of-Experts Language Models

Paper • 2409.02060 • Published Sep 3, 2024 • 77

upvoted an article 4 months ago

Article

Synthetic dataset generation techniques: generating custom sentence similarity data

By

•

May 23, 2024

• 16