Step-Video-T2V
Open-Source, 204-Frame Video Generation from Text
Featured
6 Votes







Description
Step-Video-T2V is the open-source text-to-video model series from StepFun. Up to 204-frame generation, high compression Video-VAE, and video-based DPO for enhanced quality. Achieves SOTA on Step-Video-T2V-Eval.