Learning Curve in Q-learning

My question is I wrote the Q-learning algorithm in c++ with epsilon greedy policy now I have to plot the learning curve for the Q-values. What exactly I should have to plot because I have an 11x5 Q matrix, so should I take one Q value and plot its learning or should I have to take the whole matrix for a learning curve, could you guide me with it. Thank you

Solution

Learning curves in RL are typically plots of returns over time, not Q-losses or anything like this. So you should run your environment, compute the total reward (aka return) and plot it at a corresponding time.

How to trigger implicit pointer conversion inline in a C macro?
How do I create a Virus signature?
snprintf and sprintf explanation
Code for implementing queue with ring buffer
c - what is the most efficient way to copying a string?
passing argument 2 makes pointer from integer without a cast
How to declare a 2d Array without knowing the dimensions?
what is the easiest way to read and process serial data for windows 32-bit systems?
Where is SYSCALL() implemented in Linux?
Does s conversion specifier specify direction / order in which characters are written?
I'm getting an incredible error in the for loop in c language. For loop works in 1,2 and 3 but stops suddenly
Address of local variable gives invalid address
ESP-IDF linker not reporting duplicate functions so generating unsafe binary
typedef for constant pointer to constant data function array
Transfer pointer from dll (c/c++) to python
Find out which thread/process is consuming CPU
Chocolate Feast Program
Embedding python in multithreaded C application
C and C++ : Partial initialization of automatic structure
How to access global variable via JNA interface?
Error while loading shared libraries: libpq.so.5
Request for heeds when using '_' <underscore> as identifier in C (and interop with other languages)
What is the result of #if MACRO and MACRO is defined without value?
Mac OS Catalina sbrk is deprecated
Why do I need to put the millis() function inside the main loop to trigger the ISR?
undefined reference to `WinMain@16' collect2.exe: error: ld returned 1 exit status
String brokes after splitted in C
What's the meaning of the %m formatting specifier?
Trouble Returning to Previous input in loop
snprintf vs. strcpy (etc.) in C