15 13 46

belyakoff PRO

belyakoff

https://cval.ai

fangorntb

AI & ML interests

NLP/NLU

Recent Activity

new activity about 1 month ago

belyakoff/xlam-ru-tool-calling:Librarian Bot: Add language metadata for dataset

new activity about 1 month ago

belyakoff/xlam-ru-tool-calling:[bot] Conversion to Parquet

updated a dataset about 1 month ago

belyakoff/xlam-ru-tool-calling

View all activity

Organizations

belyakoff's activity

upvoted an article 6 months ago

Article

Llama 3.1 - 405B, 70B & 8B with multilinguality and long context

Jul 23, 2024

• 225

upvoted 3 papers 7 months ago

Transformers are SSMs: Generalized Models and Efficient Algorithms Through Structured State Space Duality

Paper • 2405.21060 • Published May 31, 2024 • 64

Perplexed by Perplexity: Perplexity-Based Data Pruning With Small Reference Models

Paper • 2405.20541 • Published May 30, 2024 • 22

Similarity is Not All You Need: Endowing Retrieval Augmented Generation with Multi Layered Thoughts

Paper • 2405.19893 • Published May 30, 2024 • 31

upvoted 5 papers 8 months ago

Jina Embeddings 2: 8192-Token General-Purpose Text Embeddings for Long Documents

Paper • 2310.19923 • Published Oct 30, 2023 • 14

upvoted 3 articles 8 months ago

Article

🕳️ Attention Sinks in LLMs for endless fluency

•

Oct 9, 2023

• 7

Article

makeMoE: Implement a Sparse Mixture of Experts Language Model from Scratch

•

May 7, 2024

• 42

Article

Inference for PROs

Sep 22, 2023

• 52

upvoted a paper 10 months ago

Soaring from 4K to 400K: Extending LLM's Context with Activation Beacon

Paper • 2401.03462 • Published Jan 7, 2024 • 27