Newest 'stablebaseline3' Questions

1. Home
2. Questions
3. AI Assist
4. Tags
5. Challenges
6. Chat
7. Articles
8. Users
9. Companies
11. Communities for your favorite technologies. Explore all Collectives
Stack Internal

Stack Overflow for Teams is now called Stack Internal. Bring the best of human thought and AI automation together at your work.
Try for free Learn more
Bring the best of human thought and AI automation together at your work. Learn more

8 questions

0 votes

0 answers

52 views

Unable to reproduce training results in a dummy vector using stablebaseline3

I created a custom Gymnasium environment and trained an agent using Stable-Baselines3 with DummyVecEnv and VecNormalize. The agent performs well during training and consistently reaches the goal. ...

Amir Hosein Nourian's user avatar

Amir Hosein Nourian

asked Jun 27, 2025 at 15:55

0 votes

0 answers

44 views

How to implement model.learn() correctly in self-play (Stable-baseline3 DQN)

I use DQN from sb3 to train a model. I want to train 2 agents that play against each other alternately. The problem is, as soon as I call model.learn(total_timesteps=N), which is the central method to ...

Max5566678's user avatar

Max5566678

asked Mar 11, 2025 at 14:10

0 votes

1 answer

214 views

EvalCallback hangs in stable-baselines3

I'm trying to train an A2C model in stable-baselines3 and the EvalCallback appears to freeze when it is called. I cannot figure out why. Below you will find a script that recreates this problem. ...

Finncent Price's user avatar

Finncent Price

asked Jan 29, 2025 at 22:34

0 votes

1 answer

107 views

Stable baselines 3 not generating tensorfiles for ppo, sac and td3

I am comparing a2c, dqn and ppo models. I need to have tensorboard graphs to show my teacher. The tensorboard only collect data for the a2cmodel, when using it for ppo, sac or td3 it creates the event ...

Alex Robert Petrovič's user avatar

Alex Robert Petrovič

asked Dec 27, 2024 at 18:28

1 vote

0 answers

59 views

what input should I use to predict rl model? will it be scaled or inv scaled?

I am using sb3 DQN to train stock data where my obs is last 120 candle with 7 feature i.e open high low close hour min rsi etc... . so obs shape would be (120,7) output would be discrete with 3 int 0, ...

manan5439's user avatar

manan5439

asked Sep 30, 2024 at 9:24

2 votes

1 answer

716 views

Training a Custom Feature Extractor in Stable Baselines3 Starting from Pre-trained Weights?

I am using the following custom feature extractor for my StableBaselines3 model: import torch.nn as nn from stable_baselines3 import PPO class Encoder(nn.Module): def __init__(self, input_dim, ...

Sayyor Y's user avatar

Sayyor Y

1,404

asked Jul 7, 2024 at 22:11

0 votes

1 answer

116 views

requested array would exceed the maximum number of dimension of 1 issue in gym

let us suppose we have folloing code : import gym from stable_baselines3 import PPO env = gym.make("CartPole-v1", render_mode="human") model = PPO("MlpPolicy", env, ...

AI ML's user avatar

AI ML

asked Jun 27, 2024 at 11:36

1 vote

1 answer

134 views

Baseline3 TD3, reset() method too many values to unpack error

The env is python 3.10, stable-baseline3 2.3.0 and I'm trying TD3 Algorithm. I'm keep getting same error for whatever I do. As far as I know, the reset method has return as same as observation space ...

GatesPlan's user avatar

GatesPlan

asked Apr 21, 2024 at 12:58

CollectivesTM on Stack Overflow

Unable to reproduce training results in a dummy vector using stablebaseline3

How to implement model.learn() correctly in self-play (Stable-baseline3 DQN)

EvalCallback hangs in stable-baselines3

Stable baselines 3 not generating tensorfiles for ppo, sac and td3

what input should I use to predict rl model? will it be scaled or inv scaled?

Training a Custom Feature Extractor in Stable Baselines3 Starting from Pre-trained Weights?

requested array would exceed the maximum number of dimension of 1 issue in gym

Baseline3 TD3, reset() method too many values to unpack error

Hot Network Questions