Collections
Discover the best community collections!
Collections including paper arxiv:2403.10131
-
PRDP: Proximal Reward Difference Prediction for Large-Scale Reward Finetuning of Diffusion Models
Paper ā¢ 2402.08714 ā¢ Published ā¢ 11 -
Data Engineering for Scaling Language Models to 128K Context
Paper ā¢ 2402.10171 ā¢ Published ā¢ 23 -
RLVF: Learning from Verbal Feedback without Overgeneralization
Paper ā¢ 2402.10893 ā¢ Published ā¢ 10 -
Coercing LLMs to do and reveal (almost) anything
Paper ā¢ 2402.14020 ā¢ Published ā¢ 12
-
Self-Rewarding Language Models
Paper ā¢ 2401.10020 ā¢ Published ā¢ 145 -
Direct Preference Optimization: Your Language Model is Secretly a Reward Model
Paper ā¢ 2305.18290 ā¢ Published ā¢ 50 -
OLMo: Accelerating the Science of Language Models
Paper ā¢ 2402.00838 ā¢ Published ā¢ 82 -
OpenMoE: An Early Effort on Open Mixture-of-Experts Language Models
Paper ā¢ 2402.01739 ā¢ Published ā¢ 26
-
Self-Rewarding Language Models
Paper ā¢ 2401.10020 ā¢ Published ā¢ 145 -
ReFT: Reasoning with Reinforced Fine-Tuning
Paper ā¢ 2401.08967 ā¢ Published ā¢ 29 -
Tuning Language Models by Proxy
Paper ā¢ 2401.08565 ā¢ Published ā¢ 21 -
TrustLLM: Trustworthiness in Large Language Models
Paper ā¢ 2401.05561 ā¢ Published ā¢ 66
-
Multilingual Instruction Tuning With Just a Pinch of Multilinguality
Paper ā¢ 2401.01854 ā¢ Published ā¢ 10 -
LLaMA Beyond English: An Empirical Study on Language Capability Transfer
Paper ā¢ 2401.01055 ā¢ Published ā¢ 54 -
LLM Maybe LongLM: Self-Extend LLM Context Window Without Tuning
Paper ā¢ 2401.01325 ā¢ Published ā¢ 27 -
Improving Text Embeddings with Large Language Models
Paper ā¢ 2401.00368 ā¢ Published ā¢ 79
-
Self-RAG: Learning to Retrieve, Generate, and Critique through Self-Reflection
Paper ā¢ 2310.11511 ā¢ Published ā¢ 75 -
REST: Retrieval-Based Speculative Decoding
Paper ā¢ 2311.08252 ā¢ Published -
Active Retrieval Augmented Generation
Paper ā¢ 2305.06983 ā¢ Published ā¢ 3 -
Retrieval-Augmented Generation for Large Language Models: A Survey
Paper ā¢ 2312.10997 ā¢ Published ā¢ 10
-
Is ChatGPT Good at Search? Investigating Large Language Models as Re-Ranking Agent
Paper ā¢ 2304.09542 ā¢ Published ā¢ 4 -
Dense X Retrieval: What Retrieval Granularity Should We Use?
Paper ā¢ 2312.06648 ā¢ Published ā¢ 1 -
Context Tuning for Retrieval Augmented Generation
Paper ā¢ 2312.05708 ā¢ Published ā¢ 17 -
Rank-without-GPT: Building GPT-Independent Listwise Rerankers on Open-Source Large Language Models
Paper ā¢ 2312.02969 ā¢ Published ā¢ 12
-
VideoSwap: Customized Video Subject Swapping with Interactive Semantic Point Correspondence
Paper ā¢ 2312.02087 ā¢ Published ā¢ 20 -
FaceStudio: Put Your Face Everywhere in Seconds
Paper ā¢ 2312.02663 ā¢ Published ā¢ 30 -
Orthogonal Adaptation for Modular Customization of Diffusion Models
Paper ā¢ 2312.02432 ā¢ Published ā¢ 12 -
ReconFusion: 3D Reconstruction with Diffusion Priors
Paper ā¢ 2312.02981 ā¢ Published ā¢ 8
-
A survey on Kornia: an Open Source Differentiable Computer Vision Library for PyTorch
Paper ā¢ 2009.10521 ā¢ Published ā¢ 1 -
Kornia: an Open Source Differentiable Computer Vision Library for PyTorch
Paper ā¢ 1910.02190 ā¢ Published ā¢ 1 -
Learning Symmetrization for Equivariance with Orbit Distance Minimization
Paper ā¢ 2311.07143 ā¢ Published ā¢ 1 -
GS-SLAM: Dense Visual SLAM with 3D Gaussian Splatting
Paper ā¢ 2311.11700 ā¢ Published ā¢ 4
-
Understanding LLMs: A Comprehensive Overview from Training to Inference
Paper ā¢ 2401.02038 ā¢ Published ā¢ 62 -
Learning To Teach Large Language Models Logical Reasoning
Paper ā¢ 2310.09158 ā¢ Published ā¢ 1 -
ChipNeMo: Domain-Adapted LLMs for Chip Design
Paper ā¢ 2311.00176 ā¢ Published ā¢ 8 -
WizardMath: Empowering Mathematical Reasoning for Large Language Models via Reinforced Evol-Instruct
Paper ā¢ 2308.09583 ā¢ Published ā¢ 7