HI I've test environment having 3 masters and 2 slaves those are like m1, m2, m3 and s1, s2 respectively.
I want to achieve HA for mesos and marathon with help of zookeeper.
OS: RHEL 7.2
Type : VM's Virtual Box
I've installed mesos, marathon and zookeeper in offline mode. I,e
Mesos : Downloaded mesos binaries and extract rpm packages.
Marathon and Zookeeper : Downloaded tar.gz file and extracted using binaries.
3 masters : m1, m2, m3
2 slaves : s1, s2
Started zookeeper first in masters i.e m1, m2, m3 one chosen as leader ex: m1 -> leader,
m2-> follower, m3->follower.
Starting Mesos masters and slaves:
Executed mesos binary with options in leading master i.e m1. and then started mesos in followers also i.e m2, m3.
Starting Marathon :
Started Marathon in leading machine i.e m1 and then started marathon in remaining machines i.e m2 and m3.
Now Cluster state is like below
m1–> leading mesos master, leading marathon master.
m2-> non-leading mesos master, non leading marathon
m3-> non-leading mesos master, non leading marathon
and slave1, slav2.
I created some sample applications(t1, t2) via marathon from m1 and Able to run successfully.
When I power off m1 vm, then m2 took leading for mesos master and m3 took leading for marathon Cluster state is like below.
m1-> Power off(Unavaliable)
m2-> leading mesos master, non-leading marathon
m3->non-leading mesos master, leading marathon.
I tried to create and run sample app(t3) via marathon and task status is went for "Waiting" status forever. i.e can't able run but previous task running i.e t1 and t2.
Mesos leading from one machine and marathon leading from another machine is expected behaviour? If yes why can't able to run new task from marathon and how can we run.
Will it happen like this.i.e choosing masters from different machines for leading mesos and marathon?
Am I doing correct of in all the config.?
I did like this for five times, 2 to 3 times happened like this. other cases was choosing mesos and marathon from same machine.