Marwa El Kamil's picture

Marwa El Kamil

maghwa

·

AI & ML interests

None yet

Recent Activity

upvoted a collection 2 days ago

PaliGemma FT Models

liked a dataset 4 days ago

bigcode/the-stack-smol

liked a Space 4 days ago

opencompass/open_vlm_leaderboard

View all activity

Organizations

maghwa's activity

upvoted a collection 2 days ago

PaliGemma FT Models

108 items • Updated 22 days ago • 31

upvoted a collection 5 days ago

Preference Datasets for DPO

This collection contains a list of curated preference datasets for DPO fine-tuning for intent alignment of LLMs • 7 items • Updated 24 days ago • 38

upvoted an article 26 days ago

Article

Finding Moroccan Arabic (Darija) in Fineweb 2

By

•

26 days ago

• 20

upvoted an article 3 months ago

Article

The Open Medical-LLM Leaderboard: Benchmarking Large Language Models in Healthcare

Apr 19, 2024

• 126

upvoted a collection 4 months ago

Arabic Aya DPO Datasets

Our synthetic DPO datasets for Arabic Aya. • 5 items • Updated Jun 4, 2024 • 4

upvoted a paper 6 months ago

101 Billion Arabic Words Dataset

Paper • 2405.01590 • Published Apr 29, 2024 • 5

upvoted an article 6 months ago

Article

Tokenization Is A Dead Weight (Tokun Part 1)

By

•

Jun 27, 2024

• 16

upvoted 2 papers 7 months ago

Tokenization Falling Short: The Curse of Tokenization

Paper • 2406.11687 • Published Jun 17, 2024 • 15

CroissantLLM: A Truly Bilingual French-English Language Model

Paper • 2402.00786 • Published Feb 1, 2024 • 25

upvoted an article 7 months ago

Article

🥐CroissantLLM: A Truly Bilingual French-English Language Model

By

•

Feb 5, 2024

• 11

upvoted a collection 7 months ago

FrenchBench Evaluation datasets

These datasets are used to evaluate models on French performance using: https://github.com/EleutherAI/lm-evaluation-harness (from CroissantLLM paper) • 11 items • Updated Jun 7, 2024 • 5

upvoted an article 7 months ago

Article

Introducing the Open Arabic LLM Leaderboard

May 14, 2024

• 77

upvoted a paper 7 months ago

Judging LLM-as-a-judge with MT-Bench and Chatbot Arena

Paper • 2306.05685 • Published Jun 9, 2023 • 32

upvoted an article 7 months ago

Article

Let's talk about LLM evaluation

By

•

May 23, 2024

• 143

upvoted an article 8 months ago

Article

Text2SQL using Hugging Face Dataset Viewer API and Motherduck DuckDB-NSQL-7B

Apr 4, 2024

• 25

upvoted a paper 9 months ago

Dynamic Typography: Bringing Words to Life

Paper • 2404.11614 • Published Apr 17, 2024 • 44

upvoted 2 papers 11 months ago

BloombergGPT: A Large Language Model for Finance

Paper • 2303.17564 • Published Mar 30, 2023 • 21

Mistral 7B

Paper • 2310.06825 • Published Oct 10, 2023 • 47