Uploaded image for project: 'DC/OS'
  1. DC/OS
  2. DCOS_OSS-4181

[1.13] Telegraf does not retrieve executor metadata from Mesos

    Details

    • Sprint:
      Observability Team Sprint 32, Observability Team Sprint 33, Observability Team Sprint 34
    • Story Points:
      5

      Description

      Sep 25 20:31:50 ip-10-0-6-64.us-west-2.compute.internal start_telegraf.sh[5867]: 2018-09-25T20:32:00Z I! Metadata for container "dc53b42f-9b3d-433b-b334-9aa1e4da2466" was not found in cache
      Sep 25 20:32:00 ip-10-0-6-64.us-west-2.compute.internal start_telegraf.sh[5867]: 2018-09-25T20:32:10Z I! Metadata for container "dc53b42f-9b3d-433b-b334-9aa1e4da2466" was not found in cache

      These logs are repeated because they belong to an executor. Executor containers should be cached, not just tasks.

      This bug means noise in the logs, missing data in the metrics output, and repeated hits on Mesos.

      One option to resolve this is using GET_EXECUTORS and then GET_TASKS (GET_FRAMEWORKS) rather than polling usingĀ  GET_STATE which doesn't contain everything we need.

        Attachments

          Issue Links

            Activity

              People

              • Assignee:
                gracedo Grace Do
                Reporter:
                philip Philip Norman
                Team:
                Observability Team
                Watchers:
                Branden Rolston, Grace Do, Lisa Gunn, Mergebot, Philip Norman
              • Watchers:
                5 Start watching this issue

                Dates

                • Created:
                  Updated:
                  Resolved: