zoukankan      html  css  js  c++  java
  • 2020.2.13

    主要概念

    Term

    Meaning

    Application

    User program built on Spark. Consists of a driver program and executors on
    the cluster.

    Application jar

    A jar containing the user's Spark application. In some cases users will want to create an "uber jar" containing their application along with its dependencies. The user's jar should never include Hadoop or Spark libraries, however, these will
    be added at runtime.

    Driver program

    The process running the main() function of the application and creating the SparkContext

    Cluster manager

    An external service for acquiring resources on the cluster (e.g. standalone manager, Mesos, YARN)

    Deploy mode

    Distinguishes where the driver process runs. In "cluster" mode, the framework launches the driver inside of the cluster. In "client" mode, the submitter launches the driver outside of the cluster.

    Worker node

    Any node that can run application code in the cluster

    Executor

    A process launched for an application on a worker node, that runs tasks and keeps data in memory or disk storage across them. Each application has its own executors.

    Task

    A unit of work that will be sent to one executor

    Job

    A parallel computation consisting of multiple tasks that gets spawned in response to a Spark action (e.g. savecollect);
    you'll see this term used in the driver's logs.

    Stage

    Each job gets divided into smaller sets of tasks called stages that depend on each other (similar to the map
    and reduce stages in MapReduce); you'll see this term used in the driver's logs.

    源文档 <http://spark.apache.org/docs/latest/cluster-overview.html>

  • 相关阅读:
    mysql单机多实例配置
    mysql中FILE权限
    手把手教你用 Git(转)
    单例模式
    Java中的枚举Enum
    常见排序算法的亲手实现(代码与注释)
    Java中的位运算
    重载
    Java中的boxing和unboxing(转)
    Java的transient关键字(转)
  • 原文地址:https://www.cnblogs.com/yishaui/p/12305639.html
Copyright © 2011-2022 走看看