Long(Tony) Lian's picture

Long(Tony) Lian

longlian

·

https://tonylian.com/

TonyLianLong

AI & ML interests

None yet

Recent Activity

upvoted a paper 4 days ago

Training Software Engineering Agents and Verifiers with SWE-Gym

upvoted a paper 13 days ago

Deliberation in Latent Space via Differentiable Cache Augmentation

liked a model 13 days ago

llava-hf/llava-1.5-7b-hf

View all activity

Organizations

longlian's activity

upvoted a paper 4 days ago

Training Software Engineering Agents and Verifiers with SWE-Gym

Paper • 2412.21139 • Published 7 days ago • 16

upvoted a paper 13 days ago

Deliberation in Latent Space via Differentiable Cache Augmentation

Paper • 2412.17747 • Published 14 days ago • 28

upvoted 7 papers 3 months ago

Movie Gen: A Cast of Media Foundation Models

Paper • 2410.13720 • Published Oct 17, 2024 • 90

Rectified Diffusion: Straightness Is Not Your Need in Rectified Flow

Paper • 2410.07303 • Published Oct 9, 2024 • 18

Aria: An Open Multimodal Native Mixture-of-Experts Model

Paper • 2410.05993 • Published Oct 8, 2024 • 108

MonST3R: A Simple Approach for Estimating Geometry in the Presence of Motion

Paper • 2410.03825 • Published Oct 4, 2024 • 19

AuroraCap: Efficient, Performant Video Detailed Captioning and a New Benchmark

Paper • 2410.03051 • Published Oct 4, 2024 • 5

Video Instruction Tuning With Synthetic Data

Paper • 2410.02713 • Published Oct 3, 2024 • 38

Contrastive Localized Language-Image Pre-Training

Paper • 2410.02746 • Published Oct 3, 2024 • 33

upvoted 3 papers 4 months ago

Training Language Models to Self-Correct via Reinforcement Learning

Paper • 2409.12917 • Published Sep 19, 2024 • 136

Language Models Learn to Mislead Humans via RLHF

Paper • 2409.12822 • Published Sep 19, 2024 • 10

In-Context Imitation Learning via Next-Token Prediction

Paper • 2408.15980 • Published Aug 28, 2024 • 9

upvoted 4 papers 6 months ago

OpenDevin: An Open Platform for AI Software Developers as Generalist Agents

Paper • 2407.16741 • Published Jul 23, 2024 • 69

VILA^2: VILA Augmented VILA

Paper • 2407.17453 • Published Jul 24, 2024 • 40

Shape of Motion: 4D Reconstruction from a Single Video

Paper • 2407.13764 • Published Jul 18, 2024 • 19

Cambrian-1: A Fully Open, Vision-Centric Exploration of Multimodal LLMs

Paper • 2406.16860 • Published Jun 24, 2024 • 59

upvoted a paper 10 months ago

Panda-70M: Captioning 70M Videos with Multiple Cross-Modality Teachers

Paper • 2402.19479 • Published Feb 29, 2024 • 32

upvoted a paper 11 months ago

LLM-grounded Video Diffusion Models

Paper • 2309.17444 • Published Sep 29, 2023 • 2

upvoted 2 papers 12 months ago

Rethinking Patch Dependence for Masked Autoencoders

Paper • 2401.14391 • Published Jan 25, 2024 • 23

Towards A Better Metric for Text-to-Video Generation

Paper • 2401.07781 • Published Jan 15, 2024 • 14