zoukankan      html  css  js  c++  java
  • 【原创】大叔经验分享(101)marathon反复重启

    marathon反复重启

    # systemctl status marathon
    ● marathon.service - Marathon
       Loaded: loaded (/usr/lib/systemd/system/marathon.service; enabled; vendor preset: disabled)
       Active: activating (auto-restart) (Result: exit-code) since Fri 2020-01-10 19:57:20 CST; 284ms ago
      Process: 9919 ExecStart=/usr/share/marathon/bin/marathon (code=exited, status=103)
      Process: 9916 ExecStartPre=/bin/chmod 755 /run/marathon (code=exited, status=0/SUCCESS)
      Process: 9913 ExecStartPre=/bin/chown marathon:marathon /run/marathon (code=exited, status=0/SUCCESS)
      Process: 9910 ExecStartPre=/bin/mkdir -p /run/marathon (code=exited, status=0/SUCCESS)
     Main PID: 9919 (code=exited, status=103)

    通过journalctl查看日志如下:

    Jan 10 19:20:53 test-002 marathon[26905]: [2020-01-10 19:20:53,773] ERROR abdicateLeadership was called (mesosphere.marathon.core.election.ElectionServiceImpl:scala-execut
    Jan 10 19:20:53 test-002 marathon[26905]: mesosphere.marathon.Exception: abdicateLeadership
    Jan 10 19:20:53 test-002 marathon[26905]: at mesosphere.marathon.core.election.ElectionServiceImpl.abdicateLeadership(ElectionService.scala:154)
    Jan 10 19:20:53 test-002 marathon[26905]: at mesosphere.marathon.MarathonSchedulerService.$anonfun$startLeadership$7(MarathonSchedulerService.scala:254)
    Jan 10 19:20:53 test-002 marathon[26905]: at mesosphere.marathon.MarathonSchedulerService.$anonfun$startLeadership$7$adapted(MarathonSchedulerService.scala:253)
    Jan 10 19:20:53 test-002 marathon[26905]: at scala.Option.foreach(Option.scala:257)
    Jan 10 19:20:53 test-002 marathon[26905]: at mesosphere.marathon.MarathonSchedulerService.$anonfun$startLeadership$6(MarathonSchedulerService.scala:253)
    Jan 10 19:20:53 test-002 marathon[26905]: at mesosphere.marathon.MarathonSchedulerService.$anonfun$startLeadership$6$adapted(MarathonSchedulerService.scala:240)
    Jan 10 19:20:53 test-002 marathon[26905]: at scala.concurrent.impl.CallbackRunnable.run(Promise.scala:60)
    Jan 10 19:20:53 test-002 marathon[26905]: at java.util.concurrent.ForkJoinTask$RunnableExecuteAction.exec(ForkJoinTask.java:1402)
    Jan 10 19:20:53 test-002 marathon[26905]: at java.util.concurrent.ForkJoinTask.doExec(ForkJoinTask.java:289)
    Jan 10 19:20:53 test-002 marathon[26905]: at java.util.concurrent.ForkJoinPool$WorkQueue.runTask(ForkJoinPool.java:1056)
    Jan 10 19:20:53 test-002 marathon[26905]: at java.util.concurrent.ForkJoinPool.runWorker(ForkJoinPool.java:1692)
    Jan 10 19:20:53 test-002 marathon[26905]: at java.util.concurrent.ForkJoinWorkerThread.run(ForkJoinWorkerThread.java:157)

    检查mesos发现mesos异常,最终定位原因为zookeeper节点之间数据不同步,mesos在zookeeper上维护master leader信息

    [zk: 192.168.0.1:2181(CONNECTED) 1] ls /mesos

    [json.info_0000000114, log_replicas, json.info_0000000115]

    这个数据不同步导致mesos找不到master,进而导致marathon异常,将有问题的zookeeper节点stop,清空数据目录后重启,数据一致后问题解决;

  • 相关阅读:
    MyBatis执行sql的整个流程
    Ftp传输:向linux服务器上传文件时“550 Permission denied.”错误问题解决
    SpringBoot框架:两个方法同时调用时父方法使内部方法的DataSource注解失效的解决办法
    SpringBoot框架:通过AOP和自定义注解完成druid连接池的动态数据源切换(三)
    SpringBoot框架:配置文件application.properties和application.yml的区别
    SpringBoot框架:'url' attribute is not specified and no embedded datasource could be configured问题处理
    bash脚本打印字符串一个空格的内容
    gethostbyname的线程安全
    算法工程师的职业规划
    理解Deep Link & URI Schemes & Universal Link & App Link
  • 原文地址:https://www.cnblogs.com/barneywill/p/12180304.html
Copyright © 2011-2022 走看看