usr2015
's Collections
DocLLM: A layout-aware generative language model for multimodal document
understanding
Paper
•
2401.00908
•
Published
•
181
Learning Vision from Models Rivals Learning Vision from Data
Paper
•
2312.17742
•
Published
•
15
PanGu-π: Enhancing Language Model Architectures via Nonlinearity
Compensation
Paper
•
2312.17276
•
Published
•
15
Infinite-LLM: Efficient LLM Service for Long Context with DistAttention
and Distributed KVCache
Paper
•
2401.02669
•
Published
•
14
DeepSeek LLM: Scaling Open-Source Language Models with Longtermism
Paper
•
2401.02954
•
Published
•
41
Blending Is All You Need: Cheaper, Better Alternative to
Trillion-Parameters LLM
Paper
•
2401.02994
•
Published
•
49
CRUXEval: A Benchmark for Code Reasoning, Understanding and Execution
Paper
•
2401.03065
•
Published
•
11
Let's Go Shopping (LGS) -- Web-Scale Image-Text Dataset for Visual
Concept Understanding
Paper
•
2401.04575
•
Published
•
14
Bootstrapping LLM-based Task-Oriented Dialogue Agents via Self-Talk
Paper
•
2401.05033
•
Published
•
16
Efficient LLM inference solution on Intel GPU
Paper
•
2401.05391
•
Published
•
9
Towards Conversational Diagnostic AI
Paper
•
2401.05654
•
Published
•
16
TrustLLM: Trustworthiness in Large Language Models
Paper
•
2401.05561
•
Published
•
66
DocGraphLM: Documental Graph Language Model for Information Extraction
Paper
•
2401.02823
•
Published
•
35
ChatQA: Building GPT-4 Level Conversational QA Models
Paper
•
2401.10225
•
Published
•
34
Spotting LLMs With Binoculars: Zero-Shot Detection of Machine-Generated
Text
Paper
•
2401.12070
•
Published
•
43
VideoPrism: A Foundational Visual Encoder for Video Understanding
Paper
•
2402.13217
•
Published
•
23
OmniPred: Language Models as Universal Regressors
Paper
•
2402.14547
•
Published
•
12
API-BLEND: A Comprehensive Corpora for Training and Benchmarking API
LLMs
Paper
•
2402.15491
•
Published
•
13
DeepSeek-VL: Towards Real-World Vision-Language Understanding
Paper
•
2403.05525
•
Published
•
40
LLM Agent Operating System
Paper
•
2403.16971
•
Published
•
65
FlowMind: Automatic Workflow Generation with LLMs
Paper
•
2404.13050
•
Published
•
33