Uploaded image for project: 'DC/OS'
  1. DC/OS
  2. DCOS_OSS-4048

Delayed Status for Services in UI

    Details

    • Type: Epic
    • Status: Open
    • Priority: High
    • Resolution: Unresolved
    • Affects Version/s: None
    • Fix Version/s: None
    • Component/s: dcos-ui
    • Labels:
    • Epic Name:
      Delayed Status for Services
    • Epic Status:
      To Do
    • Total Story Points:
      15
    • Remaining Story Points:
      12
    • Progress (%):
      20
    • Progress Bar:
      20%

      Description

      While using the DC/OS UI to debug some services that wouldn't start for days, I saw that their state is shown as Recovering or Deploying.

      The UI said something like:
      DC/OS has been waiting for resources and is unable to complete this deployment for 2 days. See recent resource offers.
       
      This led me to think that Marathon was not getting offers, but after looking at the Marathon UI, I saw that the problem was not the lack of matching offers, but that the status of the applications was Delayed.

      The reason why the services were delayed was a transient infrastructure error. I tried to reset the delay using the DC/OS UI, but I didn't find a way of doing it, so I had to go back to the Marathon UI to reset the delay.

       

      How to reproduce

      You can deploy this application, it should go in the delayed state pretty quickly. To see if it worked, please take a look at the marathon ui. If it didn't work deploy more and make sure this has not enough resources.

      {
      "id": "/delayed",
      "backoffFactor": 10,
      "backoffSeconds": 1,
      "cmd": "exit 1",
      "container": {
      "type": "MESOS",
      "volumes": []
      },
      "cpus": 0.1,
      "disk": 0,
      "instances": 10,
      "maxLaunchDelaySeconds": 3600,
      "mem": 10,
      "gpus": 0,
      "networks": [
      {
      "mode": "host"
      }
      ],
      "portDefinitions": [],
      "requirePorts": false,
      "upgradeStrategy": {
      "maximumOverCapacity": 1,
      "minimumHealthCapacity": 1
      },
      "killSelection": "YOUNGEST_FIRST",
      "unreachableStrategy": {
      "inactiveAfterSeconds": 0,
      "expungeAfterSeconds": 0
      },
      "healthChecks": [],
      "fetch": [],
      "constraints": []
      }
      

        Attachments

          Issue Links

            Activity

              People

              • Assignee:
                danielschmidt Daniel Schmidt
                Reporter:
                danielschmidt Daniel Schmidt
                Team:
                Frontend Team
                Watchers:
                Daniel Schmidt, Matthias Eichstedt
              • Watchers:
                2 Start watching this issue

                Dates

                • Created:
                  Updated: