Q-learning: What is the correct state for reward calculation

Q learning - rewards

I'm struggling to interpret the pseudocode for the Q learning algorithm:

1  For each s, a initialize table entry Q(a, s) = 0
2  Observe current state s
3  Do forever:
4     Select an action a and execute it
5     Receive immediate reward r
6     Observe the new state s′ ← δ(a, s)
7     Update the table entry for Q(a, s) as follows:
8        Q( a, s ) ← R( s ) + γ * max Q( a′, s′ )
9     s ← s′

Should the rewards be collected from the subsequent state s' or the current state s?

Solution

The rewards should be collected from the subsequent state you enter after executing the action a.

PPO agent is not learning
Pytorch Geometric graph batching not using DataLoader for Reinforcement learning
Keras-rl2 error Compability with Tensorflow
Monte Carlo Method for Blackjack: strange Q-values table
Performance issue with gradient-bandit agent
Probability 0 in Importance Sampling
Complex state in re-inforcement learning
OpenAi-Gym Discrete Space with negative values
How to seed `gymnasium` environment resets when using `stable_baselines3`?
Using Tensorflow Huber loss in Keras
Training a Custom Feature Extractor in Stable Baselines3 Starting from Pre-trained Weights?
AttributeError: module '_Box2D' has no attribute 'RAND_LIMIT_swigconstant'
How do I log observations after reset in Stable_Baselines3?
tf.function converts variable to tensor automatically
How can I change this to use a q table for reinforcement learning
Getting a very simple stablebaselines3 example to work
Are neural networks really abandonware?
Difference between TensorFlow model fit and train_on_batch
stmemory and ltmemory in "How to build your own AlphaZero AI using Python and Keras"
Reinforcement Learning Gymnasium ValueError
SB3 - AttributeError: 'DummyVecEnv' object has no attribute 'get_action_meanings'
Learning agent in custom gymnasium enviroment with stable_baseline3 make change this envirment
Why is my REINFORCE algorithm not learning?
AST extraction of parameters from multiple formats RL scripts
RuntimeError: Trying to backward through the graph a second time (or directly access saved tensors after they have already been freed)
PyTorch lightning RuntimeError: CUDA error: initialization error. CPU works tho
OpenAI-Gym Mojoco Walker2d-v4 model global cordinates error
Too many / Not enough values in OpenAI Gym Mario Model for Reinforcement Learning
Can't install torchrl into Google Colab after torch 2.2.1
How to solve deepcopy error of a pruned model in pytorch