zoukankan      html  css  js  c++  java
  • Spark2 Dataset统计指标:mean均值,variance方差,stddev标准差,corr(Pearson相关系数),skewness偏度,kurtosis峰度

    val df4=spark.sql("SELECT mean(age),variance(age),stddev(age),corr(age,yearsmarried),skewness(age),kurtosis(age) FROM Affairs")
    
    df4.show
    +--------+------------------+------------------+-----------------------+-----------------+--------------------+
    |avg(age)|     var_samp(age)|  stddev_samp(age)|corr(age, yearsmarried)|    skewness(age)|       kurtosis(age)|
    +--------+------------------+------------------+-----------------------+-----------------+--------------------+
    |    34.0|173.33333333333334|13.165611772087667|     0.7456766124552038|0.965388004190285|-0.43417159763313595|
    +--------+------------------+------------------+-----------------------+-----------------+--------------------+
    
  • 相关阅读:
    创建FLASK,同步docker
    FLASK Buleprint
    restful api
    Angular JS
    线程日志
    将项目部署到linux下的docker容器中
    安装和卸载docker
    学习目录总编
    Ansible
    装饰器
  • 原文地址:https://www.cnblogs.com/wwxbi/p/6102545.html
Copyright © 2011-2022 走看看