4 11 115

Kawsar Ahmed

ka05ar

kawsar-pie

AI & ML interests

NLP

Recent Activity

updated a model 11 days ago

ka05ar/Tamil_llama_test_updated1904099

updated a model 11 days ago

ka05ar/AI_TAMIL_google_gemma_test_updated99

updated a model 11 days ago

ka05ar/AI_TAMIL_sarvam_test_updated99

View all activity

Organizations

None yet

ka05ar's activity

upvoted a collection 24 days ago

ModernBERT

Collection

Bringing BERT into modernity via both architecture changes and scaling • 3 items • Updated Dec 19, 2024 • 125

upvoted an article 3 months ago

Article

How NuminaMath Won the 1st AIMO Progress Prize

Jul 11, 2024

• 110

upvoted an article 8 months ago

Article

Comparing the Performance of LLMs: A Deep Dive into Roberta, Llama 2, and Mistral for Disaster Tweets Analysis with Lora

Nov 7, 2023

• 8

upvoted a collection 9 months ago

Gemma release

Collection

Groups the Gemma models released by the Google team. • 40 items • Updated Dec 13, 2024 • 329

upvoted 2 papers 9 months ago

Llemma: An Open Language Model For Mathematics

Paper • 2310.10631 • Published Oct 16, 2023 • 51

The Unreasonable Ineffectiveness of the Deeper Layers

Paper • 2403.17887 • Published Mar 26, 2024 • 79

upvoted a paper 10 months ago

Chronos: Learning the Language of Time Series

Paper • 2403.07815 • Published Mar 12, 2024 • 47

upvoted a paper 11 months ago

GaLore: Memory-Efficient LLM Training by Gradient Low-Rank Projection

Paper • 2403.03507 • Published Mar 6, 2024 • 184

upvoted a paper 12 months ago

DeepSeekMath: Pushing the Limits of Mathematical Reasoning in Open Language Models

Paper • 2402.03300 • Published Feb 5, 2024 • 78

upvoted 2 papers about 1 year ago

Routing to the Expert: Efficient Reward-guided Ensemble of Large Language Models

Paper • 2311.08692 • Published Nov 15, 2023 • 12

SelfEval: Leveraging the discriminative nature of generative models for evaluation

Paper • 2311.10708 • Published Nov 17, 2023 • 14