I am trying to fine the shortest route between two nodes using reinforcement learning. I am not sure what environment to use. I have found this particular environment and am not sure if I am going in the right direction. Can anybody please help. Can anybody please suggest a few python OpenAI gym environments I can use.
I am trying to do the same things too. But currently, none of open-sourced OpenAI gym environments is applicable. So I suggest you create your own environment (it is not too hard). Here is what I found:
check_env
in stablebaseline3 for this environment will warn you the state
/ observation_space
for step()
and reset()
are not work well.