I'm in this situation in which I got two masters and four slaves in mesos. All of them are running fine. But when I'm trying to access marathon I'm getting the 'Could not determine the current leader' error. I got marathon in both masters (117 and 115). This is basically what I'm running to get marathon up:
java -jar ./bin/../target/marathon-assembly-0.11.0-SNAPSHOT.jar --master 172.16.50.117:5050 --zk zk://172.16.50.115:2181,172.16.50.117:2181/marathon
Could anyone shed some light over this?
First, I would double-check that you're able to talk to Zookeeper from the Marathon hosts.
Next, there are a few related points to be aware of:
--quorum
flag appropriately based on that number. See the details of how to set the --quorum
flag (and why it's important) in the operational guide on the Apache Mesos website here: http://mesos.apache.org/documentation/latest/operational-guide--zk
flag on the Mesos masters to --zk=zk://172.16.50.117:2181,172.16.50.115:2181/mesos
(add a third ZK instance, see the first point above). The same value should be used for the --master
flags in both the Mesos agents and Marathon, instead of specifying a single master.