zoukankan      html  css  js  c++  java
  • 2020.2.13

    主要概念

    Term

    Meaning

    Application

    User program built on Spark. Consists of a driver program and executors on
    the cluster.

    Application jar

    A jar containing the user's Spark application. In some cases users will want to create an "uber jar" containing their application along with its dependencies. The user's jar should never include Hadoop or Spark libraries, however, these will
    be added at runtime.

    Driver program

    The process running the main() function of the application and creating the SparkContext

    Cluster manager

    An external service for acquiring resources on the cluster (e.g. standalone manager, Mesos, YARN)

    Deploy mode

    Distinguishes where the driver process runs. In "cluster" mode, the framework launches the driver inside of the cluster. In "client" mode, the submitter launches the driver outside of the cluster.

    Worker node

    Any node that can run application code in the cluster

    Executor

    A process launched for an application on a worker node, that runs tasks and keeps data in memory or disk storage across them. Each application has its own executors.

    Task

    A unit of work that will be sent to one executor

    Job

    A parallel computation consisting of multiple tasks that gets spawned in response to a Spark action (e.g. savecollect);
    you'll see this term used in the driver's logs.

    Stage

    Each job gets divided into smaller sets of tasks called stages that depend on each other (similar to the map
    and reduce stages in MapReduce); you'll see this term used in the driver's logs.

    源文档 <http://spark.apache.org/docs/latest/cluster-overview.html>

  • 相关阅读:
    python 递归计算阶乘
    python引用
    python3 函数参数
    名片管理系统V0.0.2(函数实现)
    python 之socket语法及相关
    常见模块(一)
    常见模块(二)
    Python之迭代器、生成器、装饰器和递归
    python 之自定义函数
    python 之SET和collections
  • 原文地址:https://www.cnblogs.com/yishaui/p/12305639.html
Copyright © 2011-2022 走看看