- 4K multi-shot video generation
- Native audio synthesis and lip-sync
- Thinking Mode image generation
- First/Last Frame control
- 9-grid and multi-reference input
- Text-to-image and image-to-image workflows
- Character consistency across scenes
- Transparent PNG export