Pits and Orbs is a lightweight Python-based multi-agent grid-world environment designed for reinforcement learning research and education. It simulates turn-based gameplay where agents navigate a grid, avoid deadly pits, gather orbs for rewards, and interact competitively or cooperatively. With customizable grid sizes and reward configurations, it provides a flexible testbed for developing and benchmarking RL algorithms.
Pits and Orbs is a lightweight Python-based multi-agent grid-world environment designed for reinforcement learning research and education. It simulates turn-based gameplay where agents navigate a grid, avoid deadly pits, gather orbs for rewards, and interact competitively or cooperatively. With customizable grid sizes and reward configurations, it provides a flexible testbed for developing and benchmarking RL algorithms.
Pits and Orbs is an open-source reinforcement learning environment implemented in Python, offering a turn-based multi-agent grid-world where agents pursue objectives and face environmental hazards. Each agent must navigate a customizable grid, avoid randomly placed pits that penalize or terminate episodes, and collect orbs for positive rewards. The environment supports both competitive and cooperative modes, enabling researchers to explore varied learning scenarios. Its simple API integrates seamlessly with popular RL libraries like Stable Baselines or RLlib. Key features include adjustable grid dimensions, dynamic pit and orb distributions, configurable reward structures, and optional logging for training analysis.
Who will use Pits and Orbs?
Reinforcement Learning researchers
AI educators
Game AI developers
Students and hobbyists in AI
How to use the Pits and Orbs?
Step1: Clone the GitHub repository or install via pip
Step2: Import the PitsAndOrbs environment in your Python script
Step3: Configure grid dimensions, pit and orb settings
Step4: Wrap the environment with an RL interface (e.g., OpenAI Gym)
Step5: Train and evaluate your agent with chosen learning algorithm
Step6: Analyze performance metrics and refine parameters