DataEnvGym delivers a collection of modular, customizable environments built on the Gym API to facilitate reinforcement learning research in data-driven domains. Researchers and engineers can select from built-in tasks like data cleaning, feature engineering, batch scheduling, and streaming analytics. The framework supports seamless integration with popular RL libraries, standardized benchmarking metrics, and logging tools to track agent performance. Users can extend or combine environments to model complex data pipelines and evaluate algorithms under realistic constraints.