Esmaeiliyan's picture

Esmaeiliyan PRO

Mohammadreza

·

https://t.me/AI_360

AI & ML interests

VLM and LLM interest

Recent Activity

liked a Space 3 days ago

PartAI/pteb-leaderboard

upvoted a paper 15 days ago

Smarter, Better, Faster, Longer: A Modern Bidirectional Encoder for Fast, Memory Efficient, and Long Context Finetuning and Inference

liked a Space 16 days ago

HuggingFaceH4/blogpost-scaling-test-time-compute

View all activity

Organizations

Mohammadreza's activity

upvoted a paper 15 days ago

Smarter, Better, Faster, Longer: A Modern Bidirectional Encoder for Fast, Memory Efficient, and Long Context Finetuning and Inference

Paper • 2412.13663 • Published 17 days ago • 116

upvoted an article about 1 month ago

Article

Zero to Hero with the TRL learning link bomb 💣

By

•

Nov 25, 2024

• 4

upvoted a collection about 1 month ago

The Big Benchmarks Collection

Gathering benchmark spaces on the hub (beyond the Open LLM Leaderboard) • 13 items • Updated Nov 18, 2024 • 183

upvoted a collection 3 months ago

LLM Reasoning Papers

Papers to improve reasoning capabilities of LLMs • 17 items • Updated 12 days ago • 93

upvoted 2 papers 3 months ago

Only-IF:Revealing the Decisive Effect of Instruction Diversity on Generalization

Paper • 2410.04717 • Published Oct 7, 2024 • 18

Emu3: Next-Token Prediction is All You Need

Paper • 2409.18869 • Published Sep 27, 2024 • 94

upvoted a paper 4 months ago

Spectrum: Targeted Training on Signal to Noise Ratio

Paper • 2406.06623 • Published Jun 7, 2024 • 12

upvoted a collection 5 months ago

Persian Models

This is the largest collection of Persian models available on Huggingface • 652 items • Updated about 17 hours ago • 4

upvoted 2 papers 5 months ago

xGen-MM (BLIP-3): A Family of Open Large Multimodal Models

Paper • 2408.08872 • Published Aug 16, 2024 • 98

Improving Text Embeddings for Smaller Language Models Using Contrastive Fine-tuning

Paper • 2408.00690 • Published Aug 1, 2024 • 23

upvoted an article 5 months ago

Article

Deploy hundreds of open source models on one GPU using LoRAX

By

•

Jul 18, 2024

• 3

upvoted a collection 5 months ago

Llama 3.1

This collection hosts the transformers and original repos of the Llama 3.1, Llama Guard 3 and Prompt Guard models • 11 items • Updated 29 days ago • 637

upvoted an article 5 months ago

Article

Llama 3.1 - 405B, 70B & 8B with multilinguality and long context

Jul 23, 2024

• 225

upvoted a collection 6 months ago

Product Catalog Generator

Product Catalog Generator for Persian products which is hosted by Basalam • 7 items • Updated Sep 7, 2024 • 8

upvoted a paper 6 months ago

Qwen2 Technical Report

Paper • 2407.10671 • Published Jul 15, 2024 • 160

upvoted 2 articles 6 months ago

Article

The Rise of Agentic Data Generation

By

•

Jul 15, 2024

• 79

Article

Preference Optimization for Vision Language Models

Jul 10, 2024

• 54

upvoted a paper 7 months ago

PersianMind: A Cross-Lingual Persian-English Large Language Model

Paper • 2401.06466 • Published Jan 12, 2024 • 3

upvoted a collection 7 months ago

LLaVa-NeXT

LLaVa-NeXT (also known as LLaVa-1.6) improves upon the 1.5 series by incorporating higher image resolutions and more reasoning/OCR datasets. • 8 items • Updated Jul 19, 2024 • 27

upvoted an article 7 months ago

Article

Design choices for Vision Language Models in 2024

By

•

Apr 16, 2024

• 25