-
SPaR: Self-Play with Tree-Search Refinement to Improve Instruction-Following in Large Language Models
Paper • 2412.11605 • Published • 16 -
Byte Latent Transformer: Patches Scale Better Than Tokens
Paper • 2412.09871 • Published • 80 -
Fourier Position Embedding: Enhancing Attention's Periodic Extension for Length Generalization
Paper • 2412.17739 • Published • 37 -
SKETCH: Structured Knowledge Enhanced Text Comprehension for Holistic Retrieval
Paper • 2412.15443 • Published • 8
Felix Tuma
floom
·
AI & ML interests
NLP
Recent Activity
liked
a model
3 days ago
nomic-ai/modernbert-embed-base
updated
a collection
4 days ago
ShowAndTell
updated
a collection
4 days ago
ShowAndTell
Organizations
None yet
Collections
28
-
Beyond Examples: High-level Automated Reasoning Paradigm in In-Context Learning via MCTS
Paper • 2411.18478 • Published • 32 -
o1-Coder: an o1 Replication for Coding
Paper • 2412.00154 • Published • 41 -
A Simple and Provable Scaling Law for the Test-Time Compute of Large Language Models
Paper • 2411.19477 • Published • 5 -
Reverse Thinking Makes LLMs Stronger Reasoners
Paper • 2411.19865 • Published • 19
models
None public yet
datasets
None public yet