在python中使用zookeeper管理你的应用集群 | -猪之哀伤的Blog-
在python中使用zookeeper管理你的应用集群
Type: 技术相关 - Posted at: 2011/09/09 12:43简介:
Zookeeper 分布式服务框架是 Apache Hadoop 的一个子项目,它主要是用来解决分布式应用中经常遇到的一些数据管理问题,如:统一命名服务、状态同步服务、集群管理、分布式应用配置项的管理等。
具体简介可以参照这篇文章。
zkpython的安装:
python中有一个zkpython的包,是基于zookeeper的c-client开发的,所以安装的时候需要先安装zookeeper的c客户端。安装步骤如下:
12345678910111213# 首先下载zookeeperwget http://labs.renren.com/apache-mirror//zookeeper/zookeeper-3.3.3/zookeeper-3.3.3.tar.gztarxzvf zookeeper-3.3.3.tar.gzcdzookeeper-3.3.3/src/c/./configuremakemakeinstall# 然后下载zkpythonwget http://pypi.python.org/packages/source/z/zkpython/zkpython-0.4.tar.gz#md5=3de220615aaddf57f1462b78d32477f9tarxzvf zkpython-0.4.tar.gzcdzkpython-0.4python setup.pyinstall这样就完成了zkpython的安装。
一个简单的demo:
之后让我们来写一个简单的demo吧。(demo中用到的zkclient.py: https://github.com/piglei/zkpython_example/blob/master/zkclient.py)
123456789101112131415161718192021222324252627282930313233343536373839404142434445464748495051525354555657585960616263646566676869707172737475767778798081828384858687888990919293949596979899# coding: utf-8importloggingfromos.pathimportbasename, joinfromzkclientimportZKClient, zookeeper, watchmethodlogging.basicConfig(level=logging.DEBUG,format="[%(asctime)s] %(levelname)-8s %(message)s")log=loggingclassGJZookeeper(object):ZK_HOST="localhost:2181"ROOT="/app"WORKERS_PATH=join(ROOT,"workers")MASTERS_NUM=1TIMEOUT=10000def__init__(self, verbose=True):self.VERBOSE=verboseself.masters=[]self.is_master=Falseself.path=Noneself.zk=ZKClient(self.ZK_HOST, timeout=self.TIMEOUT)self.say("login ok!")# initself.__init_zk()# registerself.register()def__init_zk(self):"""create the zookeeper node if not exist"""nodes=(self.ROOT,self.WORKERS_PATH)fornodeinnodes:ifnotself.zk.exists(node):try:self.zk.create(node, "")except:pass@propertydefis_slave(self):returnnotself.is_masterdefregister(self):"""register a node for this worker"""self.path=self.zk.create(self.WORKERS_PATH+"/worker","1", flags=zookeeper.EPHEMERAL | zookeeper.SEQUENCE)self.path=basename(self.path)self.say("register ok! I'm %s"%self.path)# check who is the masterself.get_master()defget_master(self):"""get children, and check who is the smallest child"""@watchmethoddefwatcher(event):self.say("child changed, try to get master again.")self.get_master()children=self.zk.get_children(self.WORKERS_PATH, watcher)children.sort()self.say("%s's children: %s"%(self.WORKERS_PATH, children))# check if I'm masterself.masters=children[:self.MASTERS_NUM]ifself.pathinself.masters:self.is_master=Trueself.say("I've become master!")else:self.say("%s is masters, I'm slave"%self.masters)defsay(self, msg):"""print messages to screen"""ifself.VERBOSE:ifself.path:log.info("[ %s(%s) ] %s"%(self.path,"master"ifself.is_masterelse"slave", msg))else:log.info(msg)defmain():gj_zookeeper=GJZookeeper()if__name__=="__main__":main()importtimetime.sleep(1000)这个简单的demo所做的事情,就是通过在zookeeper的/app/workers节点下建立临时的子节点( flags=zookeeper.EPHEMERAL | zookeeper.SEQUENCE ),每次create完成之后检查自己是不是在最小的MASTERS_NUM(例子中为1,即单master)里。如果是的话,作为master运行,否则的话,作为slave运行。
这样的话,当我们的master挂掉以后,与zookeeper之间的连接也会中断,过了指定的TIMEOUT以后,master之前在worker下的子节点就会被删除,于是slave节点之前设置的watcher会被触发,再次检查自己是否为master,如果是的话则完成切换。
demo运行结果:
1234567891011121314151617181920# 第一个实例Connected in 20 ms, handle is 0[2011-09-09 12:40:43,702] INFO login ok!Node /app/workers/worker created in 4 ms[2011-09-09 12:40:43,708] INFO [ worker0000000022(slave) ] register ok! I'm worker0000000022[2011-09-09 12:40:43,709] INFO [ worker0000000022(slave) ] /app/workers's children: ['worker0000000022'][2011-09-09 12:40:43,709] INFO [ worker0000000022(master) ] I've become master!# 这时再起第二个实例Connected in 64 ms, handle is 0[2011-09-09 12:43:08,334] INFO login ok!Node /app/workers/worker created in 11 ms[2011-09-09 12:43:08,346] INFO [ worker0000000023(slave) ] register ok! I'm worker0000000023[2011-09-09 12:43:08,347] INFO [ worker0000000023(slave) ] /app/workers's children: ['worker0000000022', 'worker0000000023'][2011-09-09 12:43:08,347] INFO [ worker0000000023(slave) ] ['worker0000000022'] is masters, I'm slave# 杀掉master,第二个实例发生的变化[2011-09-09 12:44:06,016] INFO [ worker0000000023(slave) ] child changed, try to get master again.[2011-09-09 12:44:06,017] INFO [ worker0000000023(slave) ] /app/workers's children: ['worker0000000023'][2011-09-09 12:44:06,017] INFO [ worker0000000023(master) ] I've become master!