- Quad-modal input: text, images, video clips, audio tracks
- Native audio-video synchronization with precise lip-sync
- 2K native resolution output
- Auto storyboarding and shot composition
- Smart camera system with pans, zooms and tracking
- Multi-shot narrative consistency across scenes
- Enterprise API and integration
- Credit-based pricing with multiple subscription tiers