r docker shiny computer-science shinyproxy

ShinyProxy Docker - Max Threads (max concurrent users)

I am creating an enterprise wide form entry solution in R Shiny that will be aimed at approximately 400 users.

My question is (in a nutshell): Since R is a single threaded process, do I need 200 cores (assuming two threads per core) to support a maximum of 400 concurrent users? How do I go about specifying infrastructure requirements (current is 32GB ram, 4 CPU)?

As mentioned there are no complex R models etc in the back end this is basically an interface to the database with minimal data crunching, so the application is pretty lightweight my main concern is just the number of concurrent users. This is an on premise solution as well.

I am using a containerized version of ShinyProxy with a Docker Network Bridge to communicate with the application image.

Any comments would be appreciated.

Solution

If you want to be absolutely certain that a given user will never experience a delay due to the R process being used by someone else, then yes, you would need 400 threads for 400 concurrent users.

In practice, and if (as you say) the app doesn't have many computationally intensive functions, then you could get away with significantly fewer cores. If it takes, say, 200 ms for a user to compute some function, then you'll need a handful of users on the same thread running such a function at the same time before anyone notices a significant (e.g., 1 s) delay. More likely you can get away with dozens of concurrent connections per thread before the user experience is negatively affected if there is no heavy computation involved.

Having said all this, my understanding of ShinyProxy is that it will spin up a new instance for every new connection, so the number of concurrent users may be limited by the number of cores. I'm not entirely clear on this point. However, if that's the case, and assuming a model of several hundred connections but little processing requirements, I think a better approach might be a handful of Shiny Server instances behind a load balancer. The servers would not need to be particularly powerful (you're only using one thread anyway). Each server can handle >100 connections (I don't know if there is an upper limit).

To summarise, in my view, ShinyProxy is better when you have fewer connections running more computationally intensive apps, while Shiny Server (even the free version) is better when you have more connections to a computationally trivial app.

In terms of specifying infrastructure requirements, a good place to start would be shinyloadtest.