Comprehensive 自定義獎勵函數 Tools in One Place

Sponsored by Flowith - Flowith is a canvas-based agentic workspace which offers free 🍌Nano Banana Pro and other effective models...



Flowith - Flowith is a canvas-based agentic workspace which offers free 🍌Nano Banana Pro and other effective models...





AI News

自定義獎勵函數

MARFT
MARFT is an open-source multi-agent RL fine-tuning toolkit for collaborative AI workflows and language model optimization.

0


0
Visit AI
What is MARFT?
MARFT is a Python-based LLMs, enabling reproducible experiments and rapid prototyping of collaborative AI systems.
MARFT Core Features
Multi-Agent Surveillance
Open-source Python environment for training AI agents to cooperatively surveil and detect intruders in grid-based scenarios.

0


0
Visit AI
What is Multi-Agent Surveillance?
Multi-Agent Surveillance offers a flexible simulation framework where multiple AI agents act as predators or evaders in a discrete grid world. Users can configure environment parameters such as grid dimensions, number of agents, detection radii, and reward structures. The repository includes Python classes for agent behavior, scenario generation scripts, built-in visualization via matplotlib, and seamless integration with popular reinforcement learning libraries. This makes it easy to benchmark multi-agent coordination, develop custom surveillance strategies, and conduct reproducible experiments.
Multi-Agent Surveillance Core Features
Multi-Agent DDPG with PyTorch & Unity ML-Agents
Implements decentralized multi-agent DDPG reinforcement learning using PyTorch and Unity ML-Agents for collaborative agent training.

0


0
Visit AI
What is Multi-Agent DDPG with PyTorch & Unity ML-Agents?
This open-source project delivers a complete multi-agent reinforcement learning framework built on PyTorch and Unity ML-Agents. It offers decentralized DDPG algorithms, environment wrappers, and training scripts. Users can configure agent policies, critic networks, replay buffers, and parallel training workers. Logging hooks allow TensorBoard monitoring, while modular code supports custom reward functions and environment parameters. The repository includes sample Unity scenes demonstrating collaborative navigation tasks, making it ideal for extending and benchmarking multi-agent scenarios in simulation.
Multi-Agent DDPG with PyTorch & Unity ML-Agents Core Features
RL Shooter
RL Shooter provides a customizable Doom-based reinforcement learning environment for training AI agents to navigate and shoot targets.

0


0
Visit AI
What is RL Shooter?
RL Shooter is a Python-based framework that integrates ViZDoom with OpenAI Gym APIs to create a flexible reinforcement learning environment for FPS games. Users can define custom scenarios, maps, and reward structures to train agents on navigation, target detection, and shooting tasks. With configurable observation frames, action spaces, and logging facilities, it supports popular deep RL libraries such as Stable Baselines and RLlib, enabling clear performance tracking and reproducibility across experiments.
RL Shooter Core Features
Shepherding
Shepherding is a Python-based RL framework for training AI agents to herd and guide multiple agents in simulations.

0


0
Visit AI
What is Shepherding?
Shepherding is an open-source simulation framework designed for reinforcement learning researchers and developers to study and implement multi-agent herding tasks. It provides a Gym-compatible environment where agents can be trained to perform behaviors such as flanking, collecting, and dispersing target groups across continuous or discrete spaces. The framework includes modular reward shaping functions, environment parameterization, and logging utilities for monitoring training performance. Users can define obstacles, dynamic agent populations, and custom policies using TensorFlow or PyTorch. Visualization scripts generate trajectory plots and video recordings of agent interactions. Shepherding’s modular design allows seamless integration with existing RL libraries, enabling reproducible experiments, benchmarking of novel coordination strategies, and rapid prototyping of AI-driven herding solutions.
Shepherding Core Features
Simple Playgrounds
A lightweight Python library for creating customizable 2D grid environments to train and test reinforcement learning agents.

0


0
Visit AI
What is Simple Playgrounds?
Simple Playgrounds provides a modular platform for building interactive 2D grid environments where agents can navigate mazes, interact with objects, and complete tasks. Users define environment layouts, object behaviors, and reward functions via simple YAML or Python scripts. The integrated Pygame renderer delivers real-time visualization, while a step-based API ensures seamless integration with reinforcement learning libraries like Stable Baselines3. With support for multi-agent setups, collision detection, and customizable physics parameters, Simple Playgrounds streamlines the prototyping, benchmarking, and educational demonstration of AI algorithms.
Simple Playgrounds Core Features



Featured

自定義獎勵函數

MARFT

Multi-Agent Surveillance

Multi-Agent DDPG with PyTorch & Unity ML-Agents

RL Shooter

Shepherding

Simple Playgrounds