taghizadeh
's Collections
llms
updated
LLaMA Beyond English: An Empirical Study on Language Capability Transfer
Paper
•
2401.01055
•
Published
•
54
Self-Play Fine-Tuning Converts Weak Language Models to Strong Language
Models
Paper
•
2401.01335
•
Published
•
64
DocLLM: A layout-aware generative language model for multimodal document
understanding
Paper
•
2401.00908
•
Published
•
181
Multilingual Instruction Tuning With Just a Pinch of Multilinguality
Paper
•
2401.01854
•
Published
•
11
Understanding LLMs: A Comprehensive Overview from Training to Inference
Paper
•
2401.02038
•
Published
•
63
TinyLlama: An Open-Source Small Language Model
Paper
•
2401.02385
•
Published
•
91
LLaMA Pro: Progressive LLaMA with Block Expansion
Paper
•
2401.02415
•
Published
•
54
Lightning Attention-2: A Free Lunch for Handling Unlimited Sequence
Lengths in Large Language Models
Paper
•
2401.04658
•
Published
•
27
The Impact of Reasoning Step Length on Large Language Models
Paper
•
2401.04925
•
Published
•
16
MaLA-500: Massive Language Adaptation of Large Language Models
Paper
•
2401.13303
•
Published
•
12
mPLUG-DocOwl 1.5: Unified Structure Learning for OCR-free Document
Understanding
Paper
•
2403.12895
•
Published
•
32
Localizing Paragraph Memorization in Language Models
Paper
•
2403.19851
•
Published
•
14
Transformer-Lite: High-efficiency Deployment of Large Language Models on
Mobile Phone GPUs
Paper
•
2403.20041
•
Published
•
35
LLoCO: Learning Long Contexts Offline
Paper
•
2404.07979
•
Published
•
21
Reka Core, Flash, and Edge: A Series of Powerful Multimodal Language
Models
Paper
•
2404.12387
•
Published
•
39
OpenELM: An Efficient Language Model Family with Open-source Training
and Inference Framework
Paper
•
2404.14619
•
Published
•
127