SequenceModel

community

AI & ML interests

None defined yet.

Recent Activity

Jiayi-Pan authored a paper 19 days ago

World-to-Words: Grounded Open Vocabulary Acquisition through Fast Mapping in Vision-Language Models

Jiayi-Pan authored a paper 19 days ago

Inversion-Free Image Editing with Natural Language

Jiayi-Pan authored a paper 19 days ago

ArCHer: Training Language Model Agents via Hierarchical Multi-Turn RL

View all activity

SequenceModel's activity

Jiayi-Pan

authored 5 papers 19 days ago

World-to-Words: Grounded Open Vocabulary Acquisition through Fast Mapping in Vision-Language Models

Paper • 2306.08685 • Published Jun 14, 2023 • 1

Inversion-Free Image Editing with Natural Language

Paper • 2312.04965 • Published Dec 7, 2023 • 2

ArCHer: Training Language Model Agents via Hierarchical Multi-Turn RL

Paper • 2402.19446 • Published Feb 29, 2024

DANLI: Deliberative Agent for Following Natural Language Instructions

Paper • 2210.12485 • Published Oct 22, 2022

Fine-Tuning Large Vision-Language Models as Decision-Making Agents via Reinforcement Learning

Paper • 2405.10292 • Published May 16, 2024 • 1

mlfu7

authored a paper 5 months ago

In-Context Imitation Learning via Next-Token Prediction

Paper • 2408.15980 • Published Aug 28, 2024 • 10

Jiayi-Pan

authored a paper 6 months ago

OpenDevin: An Open Platform for AI Software Developers as Generalist Agents

Paper • 2407.16741 • Published Jul 23, 2024 • 70

Jiayi-Pan

authored a paper 7 months ago

DigiRL: Training In-The-Wild Device-Control Agents with Autonomous Reinforcement Learning

Paper • 2406.11896 • Published Jun 14, 2024 • 20

longlian

authored a paper 12 months ago

Rethinking Patch Dependence for Masked Autoencoders

Paper • 2401.14391 • Published Jan 25, 2024 • 24

mlfu7

authored a paper 12 months ago

Rethinking Patch Dependence for Masked Autoencoders

Paper • 2401.14391 • Published Jan 25, 2024 • 24

Jiayi-Pan

authored a paper about 1 year ago

Grounding Visual Illusions in Language: Do Vision-Language Models Perceive Illusions Like Humans?

Paper • 2311.00047 • Published Oct 31, 2023 • 9

longlian

authored a paper over 1 year ago

LLM-grounded Video Diffusion Models

Paper • 2309.17444 • Published Sep 29, 2023 • 2

tsbpp

authored a paper over 1 year ago

Emergence of Segmentation with Minimalistic White-Box Transformers

Paper • 2308.16271 • Published Aug 30, 2023 • 13

longlian

authored a paper over 1 year ago

Bootstrapping Objectness from Videos by Relaxed Common Fate and Visual Grouping

Paper • 2304.08025 • Published Apr 17, 2023 • 2

mlfu7

authored a paper over 1 year ago

Robot Learning with Sensorimotor Pre-training

Paper • 2306.10007 • Published Jun 16, 2023 • 13

longlian

authored a paper over 1 year ago

LLM-grounded Diffusion: Enhancing Prompt Understanding of Text-to-Image Diffusion Models with Large Language Models

Paper • 2305.13655 • Published May 23, 2023 • 7