Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
alexngai
's Collections
Retrieval/RAG
Self-Critique
Automated Research
Test-Time Compute/Optimal Scaling
General LLM
Automated SWE
Code LLMs
Multi-Agent
Self-Improving Agents
Automated ML
Codegen Benchmarks
Self-Critique
updated
2 days ago
Upvote
-
Enabling Scalable Oversight via Self-Evolving Critic
Paper
•
2501.05727
•
Published
8 days ago
•
64
Upvote
-
Share collection
View history
Collection guide
Browse collections