- Text-to-video generation producing synchronized audio (dialogue, SFX, music)
- Image-to-video animation and first-last-frame transitions
- Multi-reference input: up to 9 images, 3 videos, and 3 audio files per generation
- Beat-sync: generate visuals timed to uploaded music tracks
- Video extension and prompt-based video editing
- Multiple internal models (Seedance, Sora, Veo, Kling) selectable in one interface
- 1080p output with various aspect ratios and download as MP4