MAPF_G2RL is an open-source research framework that bridges graph theory and deep reinforcement learning to tackle the multi-agent path finding (MAPF) problem. It encodes nodes and edges into vector representations, defines spatial and collision-aware reward functions, and supports various RL algorithms such as DQN, PPO, and A2C. The framework automates scenario creation by generating random graphs or importing real-world maps, and orchestrates training loops that optimize policies for multiple agents simultaneously. After learning, agents are evaluated in simulated environments to measure path optimality, makespan, and success rates. Its modular design allows researchers to extend core components, integrate new MARL techniques, and benchmark against classical solvers.
MADDPG-Keras delivers a complete framework for multi-agent reinforcement learning research by implementing the MADDPG algorithm in Keras. It supports continuous action spaces, multiple agents, and standard OpenAI Gym environments. Researchers and developers can configure neural network architectures, training hyperparameters, and reward functions, then launch experiments with built-in logging and model checkpointing to accelerate multi-agent policy learning and benchmarking.