1 27 120

Jie

JJ-TMT

AI & ML interests

None yet

Recent Activity

liked a model about 6 hours ago

deepseek-ai/DeepSeek-R1

liked a model about 20 hours ago

jinaai/ReaderLM-v2

authored a paper 1 day ago

CityBench: Evaluating the Capabilities of Large Language Model as World Model

View all activity

Organizations

None yet

JJ-TMT's activity

upvoted a paper 3 days ago

Towards Large Reasoning Models: A Survey of Reinforced Reasoning with Large Language Models

Paper • 2501.09686 • Published 5 days ago • 30

upvoted a paper 6 days ago

MiniMax-01: Scaling Foundation Models with Lightning Attention

Paper • 2501.08313 • Published 7 days ago • 263

upvoted 2 papers 28 days ago

Smarter, Better, Faster, Longer: A Modern Bidirectional Encoder for Fast, Memory Efficient, and Long Context Finetuning and Inference

Paper • 2412.13663 • Published Dec 18, 2024 • 125

Qwen2.5 Technical Report

Paper • 2412.15115 • Published Dec 19, 2024 • 340

upvoted a paper about 1 month ago

LLaVA-o1: Let Vision Language Models Reason Step-by-Step

Paper • 2411.10440 • Published Nov 15, 2024 • 113

upvoted a collection about 2 months ago

PaliGemma 2 Release

Collection

Vision-Language Models available in multiple 3B, 10B and 28B variants. • 23 items • Updated Dec 13, 2024 • 130

upvoted 8 papers 3 months ago

StructRAG: Boosting Knowledge Intensive Reasoning of LLMs via Inference-time Hybrid Information Structurization

Paper • 2410.08815 • Published Oct 11, 2024 • 44

Does Spatial Cognition Emerge in Frontier Models?

Paper • 2410.06468 • Published Oct 9, 2024 • 2

Aria: An Open Multimodal Native Mixture-of-Experts Model

Paper • 2410.05993 • Published Oct 8, 2024 • 108

Benchmarking Agentic Workflow Generation

Paper • 2410.07869 • Published Oct 10, 2024 • 25

Baichuan Alignment Technical Report

Paper • 2410.14940 • Published Oct 19, 2024 • 50

upvoted a paper 4 months ago

NVLM: Open Frontier-Class Multimodal LLMs

Paper • 2409.11402 • Published Sep 17, 2024 • 73

upvoted a collection 4 months ago

Qwen2.5

Collection

Qwen2.5 language models, including pretrained and instruction-tuned models of 7 sizes, including 0.5B, 1.5B, 3B, 7B, 14B, 32B, and 72B. • 45 items • Updated Nov 28, 2024 • 468

upvoted a paper 4 months ago

Towards a Unified View of Preference Learning for Large Language Models: A Survey

Paper • 2409.02795 • Published Sep 4, 2024 • 72

upvoted 2 papers 5 months ago

BaichuanSEED: Sharing the Potential of ExtensivE Data Collection and Deduplication by Introducing a Competitive Large Language Model Baseline

Paper • 2408.15079 • Published Aug 27, 2024 • 53

Show-o: One Single Transformer to Unify Multimodal Understanding and Generation

Paper • 2408.12528 • Published Aug 22, 2024 • 51

upvoted a collection 5 months ago

🪐 SmolLM

Collection

A series of smol LLMs: 135M, 360M and 1.7B. We release base and Instruct models as well as the training corpus and some WebGPU demos • 12 items • Updated 30 days ago • 209