- Text-to-video generation with multi‑shot storytelling
- Image-to-video transformation with subject consistency
- Dual Branch generation producing synchronized audio and video
- Phoneme-level lip sync in 8+ languages
- Natural motion synthesis for realistic and stable movement
- Support for multiple aspect ratios and up to 2K resolution
- Versatile style control (photorealistic, anime, stop motion)