I reconfigured Spark infrastructure in K8s (as part of MLRun/Iguazio platform) and after that, I got a lot of issues in level of services:
Failed
)Failed dependencies
)and also general error/message:
Some services have not been successfully deployed. Check the services status as shown below.
I changed only amount of RAM (1-30 GB RAM), vCPU (1-14) and Replicas (3).
Did you get the similar issue and how to avoid the situation?
It was human mistake, the solution was easy and the key problem was in Spark service configuration (I configured extremely small vCPU values and it generated timeouts for Spark service):
millicpu
(not cpu)cpu
and restart of Spark service, everything was fine.