machine-learning reinforcement-learning q-learning

How many states could I work with on my ordinary home computer when using Q-learning?

How many states could I work with on my ordinary home computer when I want to implement a reinforcement learning algorithm such as Q-Learning? 1 thousand, 1 million, more?

Solution

It is highly unadvisable to run a lot of states. The reason is really simple - when there are a lot of states in the memory, by the time the GPU finds the state and its corresponding action, the game already changes to another state.

So the solution is to use something a bit more advanced than naive Q-learning. See Deep Q-learning and other popular variants of RL like A3C. They help to avoid this issue

ALS (Alternating Least Square) algorithm in multiple rankings for a user
How does one set the pad token correctly (not to eos) during fine-tuning to avoid model not predicting EOS?
java.lang.AssertionError: "Does not support data type INT32" in Android Studio
How to create image of confusion matrix in Python
Cross-validation with nb method
The “Forward/Backward Passage Size” is too large for the pytorch model (Yolov3)
How many images(minimum) should be there in each classes for training YOLO?
Why do neural networks work so well?
Will larger batch size make computation time less in machine learning?
Why KL divergence is negative in Pytorch?
Creating a voice identification system using machine learning
Forward pass with all samples
fit method in sklearn
Calibrating Probabilities in lightgbm or XGBoost
Implementation of F1-score, IOU and Dice Score
Is it ok to have the training history very similar to the validation history?
How to understand Shapley value for binary classification problem?
Stochastic Gradient Descent for Logistic Regression always returns a cost of Inf and weight vector never gets any closer
Data format for Libsvm SVR training in Matlab
Why shouldn't we use multiple activation functions in the same layer?
How to implement a butterworth filter
weka java api stringtovector exception
Predict training data in sklearn
Text classification with weka
How can i apply feature reduction methods in Weka?
'super' object has no attribute '__sklearn_tags__'
lightgbm.cv: cvbooster.best_iteration always returns -1
Wrong detection from yolov5 model
torchrl: Using SyncDataCollector with a custom pytorch dqn
OCR Preprocessing for Oman License Plates - Issues with Alphabet Recognition