zoukankan      html  css  js  c++  java
  • 2020.2.13

    主要概念

    Term

    Meaning

    Application

    User program built on Spark. Consists of a driver program and executors on
    the cluster.

    Application jar

    A jar containing the user's Spark application. In some cases users will want to create an "uber jar" containing their application along with its dependencies. The user's jar should never include Hadoop or Spark libraries, however, these will
    be added at runtime.

    Driver program

    The process running the main() function of the application and creating the SparkContext

    Cluster manager

    An external service for acquiring resources on the cluster (e.g. standalone manager, Mesos, YARN)

    Deploy mode

    Distinguishes where the driver process runs. In "cluster" mode, the framework launches the driver inside of the cluster. In "client" mode, the submitter launches the driver outside of the cluster.

    Worker node

    Any node that can run application code in the cluster

    Executor

    A process launched for an application on a worker node, that runs tasks and keeps data in memory or disk storage across them. Each application has its own executors.

    Task

    A unit of work that will be sent to one executor

    Job

    A parallel computation consisting of multiple tasks that gets spawned in response to a Spark action (e.g. savecollect);
    you'll see this term used in the driver's logs.

    Stage

    Each job gets divided into smaller sets of tasks called stages that depend on each other (similar to the map
    and reduce stages in MapReduce); you'll see this term used in the driver's logs.

    源文档 <http://spark.apache.org/docs/latest/cluster-overview.html>

  • 相关阅读:
    数值拓展与函数拓展
    正则小记
    Struts2(三)配置详解
    Struts2(二)工作原理
    Struts2(一)基本配置
    Eclipse新建动态web工程项目出现红叉解决方案
    Spring系列之AOP
    Java动态代理的实现机制
    Spring系列之装配Bean
    Spring系列之基本配置
  • 原文地址:https://www.cnblogs.com/yishaui/p/12305639.html
Copyright © 2011-2022 走看看