thuml
/

ivideogpt-oxe-64-act-free-medium

open-x-embodiment

robotic-manipulation

video-generation

video-prediction

Model card Files Files and versions Community

ivideogpt-oxe-64-act-free-medium / README.md

manchery's picture

Update README.md

0076304 verified about 1 month ago

|

history blame contribute delete

759 Bytes

	---
	license: mit
	tags:
	- world-model
	- open-x-embodiment
	- robotic-manipulation
	- video-generation
	- video-prediction
	- gpt
	---

	# iVideoGPT (Pre-trained on Open X-Embodiment, 64x64 resolution, action-free, medium size)

	Pre-trained model introduced in the paper [iVideoGPT: Interactive VideoGPTs are Scalable World Models](https://arxiv.org/abs/2405.15223).

	See https://github.com/thuml/iVideoGPT for examples for using this model.

	## Citation

	```
	@inproceedings{wu2024ivideogpt,
	title={iVideoGPT: Interactive VideoGPTs are Scalable World Models},
	author={Jialong Wu and Shaofeng Yin and Ningya Feng and Xu He and Dong Li and Jianye Hao and Mingsheng Long},
	booktitle={Advances in Neural Information Processing Systems},
	year={2024}
	}
	```