Search code examples
azure-batch

MaxInternalRetryCountReached - The specified task has reached maximum internal retry count


Would you please explain when this task scheduling error will happen?

MaxInternalRetryCountReached - The specified task has reached maximum internal retry count MSDN link here.

The background is that I see this error sometimes, for jobs with tasks whose RetryCount = 1. When this error is seen, the retry count is 1 (which means it was retried). However the task returned Exit Code of 0 and it appeared to have done its job successfully.

Thanks!


Solution

  • Internal retries for tasks can happen if there are transient issues when a task is scheduled to a compute node (i.e., an Azure Storage error occurs) or if the node cannot schedule the task on the node due to some issue (i.e., preparing the task's directory failed).

    The internal retry count is a separate counter from that of the task's normal retry count which is triggered from retries occurring normally, e.g., a retry as a result of a non-successful exit code.