zoukankan      html  css  js  c++  java
  • Scalable System Design Patterns[转]

    Scalable System Design Patterns

     
    Looking back after 2.5 years since my previous post on scalable system design techniques, I've observed an emergence of a set of commonly used design patterns. Here is my attempt to capture and share them.

    Load Balancer

    In this model, there is a dispatcher that determines which worker instance will handle the request based on different policies. The application should best be "stateless" so any worker instance can handle the request.

    This pattern is deployed in almost every medium to large web site setup.



    Scatter and Gather

    In this model, the dispatcher multicast the request to all workers of the pool. Each worker will compute a local result and send it back to the dispatcher, who will consolidate them into a single response and then send back to the client.

    This pattern is used in Search engines like Yahoo, Google to handle user's keyword search request ... etc.



    Result Cache

    In this model, the dispatcher will first lookup if the request has been made before and try to find the previous result to return, in order to save the actual execution.

    This pattern is commonly used in large enterprise application. Memcached is a very commonly deployed cache server.



    Shared Space

    This model also known as "Blackboard"; all workers monitors information from the shared space and contributes partial knowledge back to the blackboard. The information is continuously enriched until a solution is reached.

    This pattern is used in JavaSpace and also commercial product GigaSpace.



    Pipe and Filter

    This model is also known as "Data Flow Programming"; all workers connected by pipes where data is flow across.

    This pattern is a very common EAI pattern.



    Map Reduce

    The model is targeting batch jobs where disk I/O is the major bottleneck. It use a distributed file system so that disk I/O can be done in parallel.

    This pattern is used in many of Google's internal application, as well as implemented in open source Hadoop parallel processing framework. I also find this pattern can be used in many many application design scenarios.



    Bulk Synchronous Parellel

    This model is based on lock-step execution across all workers, coordinated by a master. Each worker repeat the following steps until the exit condition is reached, when there is no more active workers.
    1. Each worker read data from input queue
    2. Each worker perform local processing based on the read data
    3. Each worker push local result along its direct connection
    This pattern has been used in Google's Pregel graph processing model as well as the Apache Hama project.



    Execution Orchestrator

    This model is based on an intelligent scheduler / orchestrator to schedule ready-to-run tasks (based on a dependency graph) across a clusters of dumb workers.

    This pattern is used in Microsoft's Dryad project



    Although I tried to cover the whole set of commonly used design pattern for building large scale system, I am sure I have missed some other important ones. Please drop me a comment and feedback.

    Also, there is a whole set of scalability patterns around data tier that I haven't covered here. This include some very basic patterns underlying NOSQL. And it worths to take a deep look at some leading implementations.
     
     
     
  • 相关阅读:
    CSS选择符-----关系选择符
    CSS选择符-----元素选择符
    jQuery效果--show([speed,[easing],[fn]])和hide([speed,[easing],[fn]])
    大型网站架构系列:电商网站架构案例
    大型网站架构系列:负载均衡详解(上)
    JBOSS集群和安装
    webwork或Struts配置网站根路径的默认页面办法
    SQL Server 删除重复记录,只保留一条记录
    删除JBOSS eap4.3下的jmx-console、web-console、ws-console、status服务
    SLF4J versions 1.4.0 and later requires log4j 1.2.12 or later 终极解决
  • 原文地址:https://www.cnblogs.com/huangfox/p/2843103.html
Copyright © 2011-2022 走看看