Max Glushko

Pelmeshek

AI & ML interests

None yet

Recent Activity

reacted to Jaward's post with 👀 about 1 month ago

nanoBLT: Simplified lightweight implementation of a character-level Byte Latent Transformer model (under 500 lines of code). The model is 2x4x2 (n_layers_encoder, n_layers_latent, n_layers_decoder) layer deep trained on ~1M bytes of tiny Shakespeare with a patch size of 4. Code: https://github.com/Jaykef/ai-algorithms/blob/main/byte_latent_transformer.ipynb

liked a model 4 months ago

meta-llama/Llama-3.2-1B-Instruct

updated a model 6 months ago

Pelmeshek/sd-class-butterflies-64

View all activity

Organizations

None yet

Pelmeshek's activity

reacted to Jaward's post with 👀 about 1 month ago

Post

3018

nanoBLT: Simplified lightweight implementation of a character-level Byte Latent Transformer model (under 500 lines of code). The model is 2x4x2 (n_layers_encoder, n_layers_latent, n_layers_decoder) layer deep trained on ~1M bytes of tiny Shakespeare with a patch size of 4.

Code: https://github.com/Jaykef/ai-algorithms/blob/main/byte_latent_transformer.ipynb

liked a model 4 months ago

meta-llama/Llama-3.2-1B-Instruct

Text Generation • Updated Oct 24, 2024 • 1.35M • • 734

updated 2 models 6 months ago

Pelmeshek/sd-class-butterflies-64

Updated Jul 21, 2024

Pelmeshek/sd-class-butterflies-32

Unconditional Image Generation • Updated Jul 21, 2024

upvoted 2 collections 8 months ago

The Big Benchmarks Collection

Collection

Gathering benchmark spaces on the hub (beyond the Open LLM Leaderboard) • 13 items • Updated Nov 18, 2024 • 189

Nemotron 4 340B

Collection

Nemotron-4: open models for Synthetic Data Generation (SDG). Includes Base, Instruct, and Reward models. • 4 items • Updated 15 days ago • 161

liked a model 9 months ago

apple/OpenELM

Updated May 2, 2024 • 1.43k

liked a model 10 months ago

hpcai-tech/grok-1

Text Generation • Updated Mar 28, 2024 • 2.46k • 73

liked 2 Spaces 11 months ago

Paused

135

🌟

StarChat2 Demo

Running on CPU Upgrade

12.4k

🏆

Open LLM Leaderboard

Track, rank and evaluate open LLMs and chatbots

updated 2 datasets 11 months ago

Pelmeshek/enbeds_for_int20h

Viewer • Updated Mar 2, 2024 • 8.47k • 6

Pelmeshek/twitter_liberal

Viewer • Updated Feb 23, 2024 • 1.84k • 45

reacted to clefourrier's post with ❤️ 12 months ago

Post

🔥 New LLM leaderboard on the hub: an Enterprise Scenarios Leaderboard!

This work evaluates LLMs on several real world use cases (Finance documents, Legal confidentiality, Customer support, ...), which makes it grounded, and interesting for companies! 🏢
Bonus: the test set is private, so it's hard to game 🔥
PatronusAI/enterprise_scenarios_leaderboard

Side note: I discovered through this benchmark that you could evaluate "Engagingness" of an LLM, which could also be interesting for our LLM fine-tuning community out there.

Read more about their different tasks and metrics in the intro blog: https://huggingface.co/blog/leaderboards-on-the-hub-patronus

Congrats to @sunitha98 who led the leaderboard implementation, and to @rebeccaqian and @anandnk24 , all at Patronus AI !

2 replies

reacted to julien-c's post with 👍 12 months ago

Post

📣 NEW on HF

the Dataset Viewer is now available on *private datasets* too

You need to be a PRO or a Enterprise Hub user. 🔥

Great work from our Datasets team 🥰: @lhoestq @severo @polinaeterna @asoria @albertvillanova and the whole team 🥰

1 reply

reacted to Locutusque's post with 👍 12 months ago

Post

Introducing the "UltraTextbooks" dataset 🚀📚
Check it out here: Locutusque/UltraTextbooks
📘 A comprehensive collection of high-quality synthetic and human-written textbooks
👨‍🎓 Spanning various subjects and programming languages
🔧 Designed for advanced NLP tasks like language modeling, educational QA, text summarization, and content generation for edu purposes
🚀 Future expansions planned with additional data sources to enhance the corpus
👇 Data composition highlights 👇
- Blend of synthetic and human-written material
- Includes topics from general edu to specialized areas
- Structured with field "text"
🧩 Data collection from various Hugging Face datasets, guided by a diverse and comprehensive curation rationale
🚧 Limitations may exist, so report any issues you encounter

2 replies

liked 2 models over 1 year ago

google/pegasus-xsum

Summarization • Updated Jan 24, 2023 • 116k • 184

Aniemore/rubert-large-emotion-russian-cedr-m7

Text Classification • Updated Apr 7, 2023 • 2.03k • 3

liked a dataset about 2 years ago

IlyaGusev/gazeta

Updated Feb 12, 2023 • 727 • 25