Newest 'rllib' Questions

1. Home
2. Questions
3. AI Assist
4. Tags
5. Challenges
6. Chat
7. Articles
8. Users
9. Companies
11. Communities for your favorite technologies. Explore all Collectives
Stack Internal

Stack Overflow for Teams is now called Stack Internal. Bring the best of human thought and AI automation together at your work.
Try for free Learn more
Bring the best of human thought and AI automation together at your work. Learn more

106 questions

1 vote

1 answer

90 views

Ray: Resource request cannot be scheduled — using offline data and enabled the RL module

I tried training a BC algorithm using offline data and enabled the RL module in the algorithm configuration. I ran the code on Google Colab, which only provides 2 CPUs, and encountered the following ...

Alan Yu's user avatar

Alan Yu

asked Oct 24, 2025 at 6:00

3 votes

0 answers

69 views

KeyError: 'advantages' in PPO MARL using Ray RLLib

I use ray 2.50.1 to implement a MARL model using PPO. However, I meet the following problem: 'advantages' KeyError: 'advantages' During handling of the above exception, another exception occurred: ...

geniusadven's user avatar

geniusadven

asked Oct 22, 2025 at 3:49

1 vote

1 answer

138 views

Error Raised with SAC for Centralized Training, Decentralized Execution in Ray RLlib

I'm using a slight variant of the RockPaperScissors multi-agent environment from the Ray RLlib documentation as a test environment to verify that a custom RLModule for Centralized Training, ...

Nelson Salazar's user avatar

Nelson Salazar

asked Jul 22, 2025 at 22:45

0 votes

1 answer

97 views

SUMO RL : 'module’ object is not callable

When I run the following code i get error 'module' object is not callable if __name__ == "__main__": env_name = "4x4grid" register_env( env_name, lambda _: ...

Do Giang's user avatar

Do Giang

asked Apr 25, 2025 at 2:18

2 votes

1 answer

194 views

I keep getting this error, cuda available 'RuntimeError: Expected all tensors to be on the same device, but found at least two devices, cuda:0 and cpu

I'm training a transformer model using RLlib's PPO algorithm, but I encounter a device mismatch error: RuntimeError: Expected all tensors to be on the same device, but found at least two devices, ...

Thanasis Mpoulionis's user avatar

Thanasis Mpoulionis

asked Apr 11, 2025 at 11:41

1 vote

2 answers

318 views

Ray rllib episode_reward_mean not showing

Can anyone explain to me why the episode_reward_mean is NOT part of the results dictionary? Is it replaced by a different key in the latest API? I see env_runners/episode_return_mean and env_runners/...

aaden's user avatar

aaden

asked Apr 8, 2025 at 7:03

1 vote

1 answer

80 views

the action space of a reinforcement model is 1 dimensional, but when test stage the model output action with 2 dimensional

I trained a PPO model with action space self.action_space = gym.spaces.Box(-1, 1, (1,), data_type)" with rllib But when i use the trained model to manually call forward_inference, the inference ...

Altman Jeffry's user avatar

Altman Jeffry

asked Dec 11, 2024 at 9:44

1 vote

1 answer

69 views

Correct way of using foreach_worker and foreach_env

I am quite new to Reinforcement Learning and can’t understand it. I am unable to update configurations for the batch data using PPO. I am using my custom-defined GYM environment, and want to train it ...

Abid Meraj's user avatar

Abid Meraj

asked Dec 10, 2024 at 11:42

2 votes

0 answers

111 views

Ray custom environment render

I'm creating my own gym environment to test the freeze-tag problem. I'm trying to use Ray to do MAPPO. I have two problems: 1: My simulation is not rendering 2: Its creating multiple PyGame windows I'...

SteveRodgers43's user avatar

SteveRodgers43

asked Sep 11, 2024 at 20:54

0 votes

1 answer

132 views

Custom MLPPolicy issues in Ray RLLIB

I'm trying to create a custom MLP-based policy in Ray Rllib using this code below: python: 3.10 Rayrlib version: 2.23 class CustomMLPModel(TorchModelV2, nn.Module): def __init__(self, obs_space, ...

David's user avatar

David

asked Jun 12, 2024 at 5:07

1 vote

0 answers

62 views

AttributeError: Can't get attribute 'CustomActionMaskedEnvironment.observation_space' in RLlib with PettingZoo environment

I have this very basic custom parallel multi agent environment written in PettingZoo. import functools import random from copy import copy import numpy as np from gymnasium.spaces import Discrete, ...

Lukas's user avatar

Lukas

asked May 28, 2024 at 14:30

0 votes

0 answers

30 views

Recurrent NN layers intialization problem

I am having problems initializing the LSTM layers for a PPO+LSTM in RLlib. The inputs expected are different from what I give, and I do not understand why. Here my code: class CustomTorchModel(...

Federica Tonti's user avatar

Federica Tonti

asked May 9, 2024 at 18:50

1 vote

0 answers

49 views

Does the GTrXL Model from RLLIB supports dict/tuple observation?

im trying to use the AttentionNet GTrXL from RLLIB with a dictionary/tuple gym input. I found this example of complex inputs: Complex input nets. Now im not sure how to combine both properly. I would ...

JS-FWR's user avatar

JS-FWR

asked Apr 26, 2024 at 16:22

0 votes

0 answers

326 views

AttributeError: 'NoneType' object has no attribute 'cuda'

I'm encountering an AttributeError when trying to run a PPO trainer on an OR-GYM environment for inventory management using Ray RLlib and PyTorch in a CPU-only setup. Despite explicitly setting ...

Sumaya's user avatar

Sumaya

asked Apr 25, 2024 at 15:42

1 vote

0 answers

51 views

Loading pickle file in Mac after writing in linux causing issues

I am using ray rllib to save and load checkpoints. I am using the same version across Mac and Linux. I want to be able train on linux and infer on my Mac but I am getting the following error: ...

Prabhjot Singh Rai's user avatar

Prabhjot Singh Rai

2,635

asked Apr 25, 2024 at 10:38

15 30 50 per page

2 3 4 5

...

8 Next

CollectivesTM on Stack Overflow

Ray: Resource request cannot be scheduled — using offline data and enabled the RL module

KeyError: 'advantages' in PPO MARL using Ray RLLib

Error Raised with SAC for Centralized Training, Decentralized Execution in Ray RLlib

SUMO RL : 'module’ object is not callable

I keep getting this error, cuda available 'RuntimeError: Expected all tensors to be on the same device, but found at least two devices, cuda:0 and cpu

Ray rllib episode_reward_mean not showing

the action space of a reinforcement model is 1 dimensional, but when test stage the model output action with 2 dimensional

Correct way of using foreach_worker and foreach_env

Ray custom environment render

Custom MLPPolicy issues in Ray RLLIB

AttributeError: Can't get attribute 'CustomActionMaskedEnvironment.observation_space' in RLlib with PettingZoo environment

Recurrent NN layers intialization problem

Does the GTrXL Model from RLLIB supports dict/tuple observation?

AttributeError: 'NoneType' object has no attribute 'cuda'

Loading pickle file in Mac after writing in linux causing issues

Hot Network Questions