Max Glushko's picture
2 8

Max Glushko

Pelmeshek
ยท

AI & ML interests

None yet

Recent Activity

Organizations

None yet

Pelmeshek's activity

reacted to Jaward's post with ๐Ÿ‘€ about 1 month ago
view post
Post
3018
nanoBLT: Simplified lightweight implementation of a character-level Byte Latent Transformer model (under 500 lines of code). The model is 2x4x2 (n_layers_encoder, n_layers_latent, n_layers_decoder) layer deep trained on ~1M bytes of tiny Shakespeare with a patch size of 4.

Code: https://github.com/Jaykef/ai-algorithms/blob/main/byte_latent_transformer.ipynb
reacted to clefourrier's post with โค๏ธ 12 months ago
view post
Post
๐Ÿ”ฅ New LLM leaderboard on the hub: an Enterprise Scenarios Leaderboard!

This work evaluates LLMs on several real world use cases (Finance documents, Legal confidentiality, Customer support, ...), which makes it grounded, and interesting for companies! ๐Ÿข
Bonus: the test set is private, so it's hard to game ๐Ÿ”ฅ
PatronusAI/enterprise_scenarios_leaderboard

Side note: I discovered through this benchmark that you could evaluate "Engagingness" of an LLM, which could also be interesting for our LLM fine-tuning community out there.

Read more about their different tasks and metrics in the intro blog: https://huggingface.co/blog/leaderboards-on-the-hub-patronus

Congrats to @sunitha98 who led the leaderboard implementation, and to @rebeccaqian and @anandnk24 , all at Patronus AI !
  • 2 replies
ยท
reacted to julien-c's post with ๐Ÿ‘ 12 months ago
view post
Post
๐Ÿ“ฃ NEW on HF

the Dataset Viewer is now available on *private datasets* too

You need to be a PRO or a Enterprise Hub user. ๐Ÿ”ฅ

Great work from our Datasets team ๐Ÿฅฐ: @lhoestq @severo @polinaeterna @asoria @albertvillanova and the whole team ๐Ÿฅฐ
  • 1 reply
ยท
reacted to Locutusque's post with ๐Ÿ‘ 12 months ago
view post
Post
Introducing the "UltraTextbooks" dataset ๐Ÿš€๐Ÿ“š
Check it out here: Locutusque/UltraTextbooks
๐Ÿ“˜ A comprehensive collection of high-quality synthetic and human-written textbooks
๐Ÿ‘จโ€๐ŸŽ“ Spanning various subjects and programming languages
๐Ÿ”ง Designed for advanced NLP tasks like language modeling, educational QA, text summarization, and content generation for edu purposes
๐Ÿš€ Future expansions planned with additional data sources to enhance the corpus
๐Ÿ‘‡ Data composition highlights ๐Ÿ‘‡
- Blend of synthetic and human-written material
- Includes topics from general edu to specialized areas
- Structured with field "text"
๐Ÿงฉ Data collection from various Hugging Face datasets, guided by a diverse and comprehensive curation rationale
๐Ÿšง Limitations may exist, so report any issues you encounter
  • 2 replies
ยท