Uploaded image for project: 'Marathon'
  1. Marathon
  2. MARATHON-2311

Workaround required for unreachable resident tasks

    Details

    • Sprint:
      Marathon Sprint 7-2017
    • Build artifact:
      Marathon-v1.7.174

      Description

      Presently, on agent reboot, the agent will come up with a new ID. This presents a problem because it means that Mesos will never be able to report that a LOST task as gone, and the instance will forever be in the Unreachable state, even though it's associated reservation is reoffered. We expect this to be fixed by https://issues.apache.org/jira/browse/MESOS-6223, which is slated to be released with Mesos 1.3.0.

      Until then, we plan to monitor the offer stream, watch for reservations, and then check associated instance state. If the instance state is unreachable, then we will emit a terminal TASK_GONE mesos update so that the task can transition appropriately.

        Attachments

          Issue Links

            Activity

              People

              • Assignee:
                tharper Tim Harper
                Reporter:
                tharper Tim Harper
                Team:
                Orchestration Team
                Watchers:
                Johannes Unterstein, Ondrej Smola
              • Watchers:
                2 Start watching this issue

                Dates

                • Created:
                  Updated:
                  Resolved: