zoukankan      html  css  js  c++  java
  • 2020.2.13

    主要概念

    Term

    Meaning

    Application

    User program built on Spark. Consists of a driver program and executors on
    the cluster.

    Application jar

    A jar containing the user's Spark application. In some cases users will want to create an "uber jar" containing their application along with its dependencies. The user's jar should never include Hadoop or Spark libraries, however, these will
    be added at runtime.

    Driver program

    The process running the main() function of the application and creating the SparkContext

    Cluster manager

    An external service for acquiring resources on the cluster (e.g. standalone manager, Mesos, YARN)

    Deploy mode

    Distinguishes where the driver process runs. In "cluster" mode, the framework launches the driver inside of the cluster. In "client" mode, the submitter launches the driver outside of the cluster.

    Worker node

    Any node that can run application code in the cluster

    Executor

    A process launched for an application on a worker node, that runs tasks and keeps data in memory or disk storage across them. Each application has its own executors.

    Task

    A unit of work that will be sent to one executor

    Job

    A parallel computation consisting of multiple tasks that gets spawned in response to a Spark action (e.g. savecollect);
    you'll see this term used in the driver's logs.

    Stage

    Each job gets divided into smaller sets of tasks called stages that depend on each other (similar to the map
    and reduce stages in MapReduce); you'll see this term used in the driver's logs.

    源文档 <http://spark.apache.org/docs/latest/cluster-overview.html>

  • 相关阅读:
    SDN第二次上机作业
    SDN第二次作业
    SDN第一次上机作业
    个人作业-软件产品案例分析
    pandas之Series
    pandas简答介绍
    python爬取英文名
    python爬取动态网页
    python画树
    Python存储数据的方式
  • 原文地址:https://www.cnblogs.com/yishaui/p/12305639.html
Copyright © 2011-2022 走看看