- Multi-agent environment scaffolding
- Modular agent and policy definitions
- Customizable reward sharing mechanisms
- Built-in RL algorithms (DQN, PPO, A3C)
- Scenario templating and dynamic configs
- Training loop management and callbacks
- Performance logging and visualization