Yedson54
's Collections
Reasoning
updated
Chain-of-Knowledge: Integrating Knowledge Reasoning into Large Language
Models by Learning from Knowledge Graphs
Paper
•
2407.00653
•
Published
•
11
Step-DPO: Step-wise Preference Optimization for Long-chain Reasoning of
LLMs
Paper
•
2406.18629
•
Published
•
41
Whiteboard-of-Thought: Thinking Step-by-Step Across Modalities
Paper
•
2406.14562
•
Published
•
27
Buffer of Thoughts: Thought-Augmented Reasoning with Large Language
Models
Paper
•
2406.04271
•
Published
•
28
Iterative Reasoning Preference Optimization
Paper
•
2404.19733
•
Published
•
47
FlowMind: Automatic Workflow Generation with LLMs
Paper
•
2404.13050
•
Published
•
33
Cognitive Map for Language Models: Optimal Planning via Verbally
Representing the World Model
Paper
•
2406.15275
•
Published
•
11
Learn Beyond The Answer: Training Language Models with Reflection for
Mathematical Reasoning
Paper
•
2406.12050
•
Published
•
19
Improve Mathematical Reasoning in Language Models by Automated Process
Supervision
Paper
•
2406.06592
•
Published
•
26
Transformers meet Neural Algorithmic Reasoners
Paper
•
2406.09308
•
Published
•
43
Test of Time: A Benchmark for Evaluating LLMs on Temporal Reasoning
Paper
•
2406.09170
•
Published
•
24
To Compress or Not to Compress- Self-Supervised Learning and Information
Theory: A Review
Paper
•
2304.09355
•
Published
•
5
Towards Building Specialized Generalist AI with System 1 and System 2
Fusion
Paper
•
2407.08642
•
Published
•
9
Teaching Large Language Models to Reason with Reinforcement Learning
Paper
•
2403.04642
•
Published
•
46
Chain-of-Thought Reasoning Without Prompting
Paper
•
2402.10200
•
Published
•
104
Self-Discover: Large Language Models Self-Compose Reasoning Structures
Paper
•
2402.03620
•
Published
•
114
Skywork-Math: Data Scaling Laws for Mathematical Reasoning in Large
Language Models -- The Story Goes On
Paper
•
2407.08348
•
Published
•
51
Case2Code: Learning Inductive Reasoning with Synthetic Data
Paper
•
2407.12504
•
Published
•
7
Internal Consistency and Self-Feedback in Large Language Models: A
Survey
Paper
•
2407.14507
•
Published
•
46
CoD, Towards an Interpretable Medical Agent using Chain of Diagnosis
Paper
•
2407.13301
•
Published
•
56
Self-Training with Direct Preference Optimization Improves
Chain-of-Thought Reasoning
Paper
•
2407.18248
•
Published
•
32
Mutual Reasoning Makes Smaller LLMs Stronger Problem-Solvers
Paper
•
2408.06195
•
Published
•
63
On the Diagram of Thought
Paper
•
2409.10038
•
Published
•
12
Not All LLM Reasoners Are Created Equal
Paper
•
2410.01748
•
Published
•
28