Erwin Weiß's picture

16 11

Erwin Weiß

Rexschwert

·

AI & ML interests

AI, Big Data, Data Science, Machine Learning, Computer Vision, Natural Language Processing

Recent Activity

liked a Space 5 days ago

CISCai/gguf-editor

liked a model 5 days ago

unsloth/Llama-3.3-70B-Instruct-bnb-4bit

upvoted a paper about 2 months ago

Predicting Emergent Capabilities by Finetuning

View all activity

Organizations

Rexschwert's activity

upvoted 7 papers about 2 months ago

Predicting Emergent Capabilities by Finetuning

Paper • 2411.16035 • Published Nov 25, 2024 • 6

LLMs Do Not Think Step-by-step In Implicit Reasoning

Paper • 2411.15862 • Published Nov 24, 2024 • 8

Best of Both Worlds: Advantages of Hybrid Graph Sequence Models

Paper • 2411.15671 • Published Nov 23, 2024 • 7

All Languages Matter: Evaluating LMMs on Culturally Diverse 100 Languages

Paper • 2411.16508 • Published Nov 25, 2024 • 8

Knowledge Transfer Across Modalities with Natural Language Supervision

Paper • 2411.15611 • Published Nov 23, 2024 • 15

O1 Replication Journey -- Part 2: Surpassing O1-preview through Simple Distillation, Big Progress or Bitter Lesson?

Paper • 2411.16489 • Published Nov 25, 2024 • 42

From Generation to Judgment: Opportunities and Challenges of LLM-as-a-judge

Paper • 2411.16594 • Published Nov 25, 2024 • 37

upvoted 7 papers 3 months ago

Unleashing Reasoning Capability of LLMs via Scalable Question Synthesis from Scratch

Paper • 2410.18693 • Published Oct 24, 2024 • 40

Can Knowledge Editing Really Correct Hallucinations?

Paper • 2410.16251 • Published Oct 21, 2024 • 54

Breaking the Memory Barrier: Near Infinite Batch Size Scaling for Contrastive Loss

Paper • 2410.17243 • Published Oct 22, 2024 • 89

On Memorization of Large Language Models in Logical Reasoning

Paper • 2410.23123 • Published Oct 30, 2024 • 18

Stealing User Prompts from Mixture of Experts

Paper • 2410.22884 • Published Oct 30, 2024 • 14

A Large Recurrent Action Model: xLSTM enables Fast Inference for Robotics Tasks

Paper • 2410.22391 • Published Oct 29, 2024 • 22

Task Vectors are Cross-Modal

Paper • 2410.22330 • Published Oct 29, 2024 • 11

upvoted 2 collections 4 months ago

Llama 3.1

This collection hosts the transformers and original repos of the Llama 3.1, Llama Guard 3 and Prompt Guard models • 11 items • Updated Dec 6, 2024 • 639

Llama 3.2

This collection hosts the transformers and original repos of the Llama 3.2 and Llama Guard 3 • 15 items • Updated Dec 6, 2024 • 561