-
LongLoRA: Efficient Fine-tuning of Long-Context Large Language Models
Paper • 2309.12307 • Published • 88 -
LMDX: Language Model-based Document Information Extraction and Localization
Paper • 2309.10952 • Published • 65 -
Table-GPT: Table-tuned GPT for Diverse Table Tasks
Paper • 2310.09263 • Published • 39 -
BitNet: Scaling 1-bit Transformers for Large Language Models
Paper • 2310.11453 • Published • 96
Collections
Discover the best community collections!
Collections including paper arxiv:2309.00071
-
Teach LLMs to Personalize -- An Approach inspired by Writing Education
Paper • 2308.07968 • Published • 26 -
YaRN: Efficient Context Window Extension of Large Language Models
Paper • 2309.00071 • Published • 67 -
Enable Language Models to Implicitly Learn Self-Improvement From Data
Paper • 2310.00898 • Published • 23
-
TheBirdLegacy/FreeLoaderLM
Text Generation • Updated -
CofeAI/FLM-101B
Text Generation • Updated • 22 • 92 -
FLM-101B: An Open LLM and How to Train It with $100K Budget
Paper • 2309.03852 • Published • 44 -
Composable Function-preserving Expansions for Transformer Architectures
Paper • 2308.06103 • Published • 20