stabilityai
/

sv3d

Model card Files Files and versions Community

sv3d / README.md

sumiin's picture

Update README.md

c0c46aa verified 8 months ago

|

3.42 kB

	---
	license: other
	license_name: sv3d-nc-community
	license_link: LICENSE
	datasets:
	- allenai/objaverse
	pipeline_tag: image-to-3d
	extra_gated_prompt: >-
	By clicking here, you accept the License agreement, and will use the Software
	Products and Derivative Works for non-commercial or research purposes only. A
	commercial license is required to self-host the Software Products for
	commercial purposes. [Please learn more about our self-hosted Membership
	options here](https://stability.ai/membership).

	By clicking below, you agree to sharing with Stability AI the information
	contained within this form and that Stability AI can contact you for the
	purposes of marketing our products and services.
	extra_gated_fields:
	I agree: checkbox
	Yes, I consent to receiving Stability AI marketing communications: checkbox
	---
	# Stable Video 3D
	![](sv3doutputs.gif)
	Stable Video 3D (SV3D) is a generative model based on [Stable Video Diffusion](https://huggingface.co/stabilityai/stable-video-diffusion-img2vid-xt) that takes in a still image of an object as a conditioning frame, and generates an orbital video of that object.

	Please note: For commercial use, please refer to https://stability.ai/membership.

	## Model Details

	This model was trained to generate 21 frames at resolution 576x576 given a context frame of the same size, finetuned from SVD Image-to-Video. Please check our [tech report](https://stability.ai/s/SV3D_report.pdf) and [video summary](https://youtu.be/Zqw4-1LcfWg) for details.

	We release two variants of the model:
	1. SV3D_u: This variant generates orbital videos based on single image inputs without camera conditioning.
	2. SV3D_p: Extending the capability of SVD3_u, this variant accommodates both single images and orbital views allowing for the creation of 3D video along specified camera paths.


	### Model Description

	* Developed by: [Stability AI](https://stability.ai/)
	* Model type: Generative image-to-video model
	* License: [StabilityAI Non-Commercial Research Community License](https://huggingface.co/stabilityai/sv3d/raw/main/LICENSE).
	* Commercial License: to use this model commercially, please refer to https://stability.ai/membership


	### Model Sources

	* Repository: https://github.com/Stability-AI/generative-models
	* Tech report: https://stability.ai/s/SV3D_report.pdf
	* Video summary: https://youtu.be/Zqw4-1LcfWg
	* Project page: https://sv3d.github.io
	* arXiv page: https://arxiv.org/abs/2403.12008

	### Training Dataset

	We use renders from the [Objaverse](https://objaverse.allenai.org/objaverse-1.0) dataset, utilizing our enhanced rendering method that more closely replicate the distribution of images found in the real world, significantly improving our model’s ability to generalize. We selected a carefully curated subset of the Objaverse dataset for the training data, which is available under the CC-BY license.


	## Usage

	For usage instructions, please refer to our [generative models GitHub repository](https://github.com/Stability-AI/generative-models)


	### Out-of-Scope Use

	The model was not trained to be factual or true representations of people or events,
	and therefore using the model to generate such content is out-of-scope for the abilities of this model.
	The model should not be used in any way that violates Stability AI's [Acceptable Use Policy](https://stability.ai/use-policy).