Search code examples
hadoopapache-pig

Why are mapreduce attempts killed due to "Container preempted by scheduler"?


I just noticed the fact that many Pig jobs on Hadoop are killed due to the following reason: Container preempted by scheduler

Could someone explain me what causes this, and if I should (and am able to) do something about this?

Thanks!


Solution

  • If you have the fair scheduler and a number of different queue's enabled, then higher priority applications can terminate your jobs (in a preemptive fashion).

    Hortonworks have a pretty good explanation with more details

    https://docs.hortonworks.com/HDPDocuments/HDP2/HDP-2.3.2/bk_yarn_resource_mgt/content/preemption.html

    Should you do anything about it? Depends if your application is within its SLA's and performing within expectations. General good practice would be to review your job priority and the queue it's assigned to.