Ahalder
's Collections
NLP LLM
updated
DistilBERT, a distilled version of BERT: smaller, faster, cheaper and
lighter
Paper
•
1910.01108
•
Published
•
14
distilbert/distilbert-base-uncased-finetuned-sst-2-english
Text Classification
•
Updated
•
6.52M
•
659
FP6-LLM: Efficiently Serving Large Language Models Through FP6-Centric
Algorithm-System Co-Design
Paper
•
2401.14112
•
Published
•
18
GPT-4V(ision) is a Human-Aligned Evaluator for Text-to-3D Generation
Paper
•
2401.04092
•
Published
•
21
TheBloke/Orca2myth7.2-GGUF
Updated
•
194
•
9
MobiLlama: Towards Accurate and Lightweight Fully Transparent GPT
Paper
•
2402.16840
•
Published
•
23
GPTVQ: The Blessing of Dimensionality for LLM Quantization
Paper
•
2402.15319
•
Published
•
19
🐨
KOALA
Beyond A*: Better Planning with Transformers via Search Dynamics
Bootstrapping
Paper
•
2402.14083
•
Published
•
47
jinaai/reader-lm-0.5b
Text Generation
•
Updated
•
507
•
129
google/datagemma-rag-27b-it
Text Generation
•
Updated
•
8.85k
•
176
kyutai/mimi
Feature Extraction
•
Updated
•
5.93M
•
95
Promptriever: Instruction-Trained Retrievers Can Be Prompted Like
Language Models
Paper
•
2409.11136
•
Published
•
22
upstage/solar-pro-preview-instruct
Text Generation
•
Updated
•
14.6k
•
438
🌖
Recommend Similar Papers
Scaling Smart: Accelerating Large Language Model Pre-training with Small
Model Initialization
Paper
•
2409.12903
•
Published
•
22
MonoFormer: One Transformer for Both Diffusion and Autoregression
Paper
•
2409.16280
•
Published
•
18