Uploaded image for project: 'DC/OS'
  1. DC/OS
  2. DCOS_OSS-3625

arp cache neighbor table overflow

    Details

    • Type: Bug
    • Status: Resolved
    • Priority: Medium
    • Resolution: Cannot Reproduce
    • Affects Version/s: DC/OS 1.11.0, DC/OS 1.11.1, DC/OS 1.11.2
    • Fix Version/s: None
    • Labels:
      None

      Description

      Hey,

      I have DCOS Cluster, it has around 20 Agents and around 50 Tasks Running.

      The Problem that i am facing is that components on agents and master go into unhealthy states.

      after some digging, i found the following error in dmesg `arp cache neighbor table overflow`. i increased the following

      net.ipv4.neigh.default.gc_thresh2

      net.ipv4.neigh.default.gc_thresh2

      net.ipv4.neigh.default.gc_thresh3

       

      My issue is how is the limit is reached, with only 50 Tasks Running. Also can DCOS optimize gc_thresh for its use case

        Attachments

          Activity

            People

            • Assignee:
              dgoel Deepak Goel
              Reporter:
              tahaalibra tahaalibra
              Team:
              Networking Team
              Watchers:
              Deepak Goel, tahaalibra
            • Watchers:
              2 Start watching this issue

              Dates

              • Created:
                Updated:
                Resolved: