zoukankan      html  css  js  c++  java
  • Python for Data Science

    Chapter 5 - Outlier Analysis

    Segment 9 - Multivariate analysis for outlier detection

    import pandas as pd
    
    import matplotlib.pyplot as plt
    from pylab import rcParams
    import seaborn as sb
    
    %matplotlib inline
    rcParams['figure.figsize'] = 5, 4
    sb.set_style('whitegrid')
    

    Visually inspecting boxplots

    df = pd.read_csv(filepath_or_buffer='~/Data/iris.data.csv', header=None, sep=',')
    
    df.columns=['Sepal Length','Sepal Width','Petal Length','Petal Width', 'Species']
    
    data = df.iloc[:,0:4].values
    target = df.iloc[:,4].values
    
    df[:5]
    
    sb.boxplot(x='Species', y = 'Sepal Length', data=df, palette='hls')
    
    <matplotlib.axes._subplots.AxesSubplot at 0x7f10bca12e10>
    

    png

    Looking at the scatterplot matrix

    sb.pairplot(df, hue='Species', palette='hls')
    
    <seaborn.axisgrid.PairGrid at 0x7f10bc332ef0>
    

    png

  • 相关阅读:
    PAT 1018. 锤子剪刀布
    PAT 1017. A除以B
    PAT 1016. 部分A+B
    PAT 1015. 德才论
    PAT 1014. 福尔摩斯的约会
    PAT 1013. 数素数
    PAT 1012. 数字分类
    PAT 1011. A+B和C
    292. Nim Game
    412. Fizz Buzz
  • 原文地址:https://www.cnblogs.com/keepmoving1113/p/14286435.html
Copyright © 2011-2022 走看看