Search code examples
apache-zookeepermesosmesosphere

Mesos cluster does not recover when physical host restart


I'm using mesosphere on 3 host over Ubuntu 14.04 as follow:

  • one with mesos master
  • two with mesos slave

All work fine, but after restart all physical hosts all scheduled job was lost. It's normal? I'm expected that zookeeper will store the current jobs, then when the system will need restart it, all jobs will be rescheduled after the master boot.

Update: I'm using marathon and mesos on a same node, and I'm run marathon with flag --zk


Solution

  • I solved the problem following this installation instructions: How To Configure a Production-Ready Mesosphere Cluster on Ubuntu 14.04