view article Article In-browser LLM app in pure Python: Gemini Nano + Gradio-Lite By whitphx β’ Jul 12, 2024 β’ 10
Planetarium: A Rigorous Benchmark for Translating Text to Structured Planning Languages Paper β’ 2407.03321 β’ Published Jul 3, 2024 β’ 16
AriGraph: Learning Knowledge Graph World Models with Episodic Memory for LLM Agents Paper β’ 2407.04363 β’ Published Jul 5, 2024 β’ 28
Tailor3D: Customized 3D Assets Editing and Generation with Dual-Side Images Paper β’ 2407.06191 β’ Published Jul 8, 2024 β’ 12
LLaVA++ (LLaMA-3 and Phi-3-Mini) Collection Extending Visual Capabilities of LLaVA with LLaMA-3 and Phi-3 β’ 11 items β’ Updated Jun 11, 2024 β’ 23
Awesome Document AI Collection A collection of open-source document AI π π π β’ 27 items β’ Updated Mar 11, 2024 β’ 77
SemantiCodec: An Ultra Low Bitrate Semantic Audio Codec for General Sound Paper β’ 2405.00233 β’ Published Apr 30, 2024 β’ 16
RAPTOR: Recursive Abstractive Processing for Tree-Organized Retrieval Paper β’ 2401.18059 β’ Published Jan 31, 2024 β’ 37
AutoCrawler: A Progressive Understanding Web Agent for Web Crawler Generation Paper β’ 2404.12753 β’ Published Apr 19, 2024 β’ 42