zoukankan      html  css  js  c++  java
  • Python for Data Science

    Chapter 5 - Basic Math and Statistics

    Segment 7 - Transforming dataset distributions

    import numpy as np
    import pandas as pd
    import scipy
    
    import matplotlib.pyplot as plt
    from matplotlib import rcParams
    import seaborn as sb
    
    import sklearn
    from sklearn import preprocessing
    from sklearn.preprocessing import scale
    
    %matplotlib inline
    rcParams['figure.figsize'] = 5, 4
    sb.set_style('whitegrid')
    

    Normalizing and transforming features with MinMaxScalar() and fit_transform()

    address = '~/Data/mtcars.csv'
    
    cars = pd.read_csv(address)
    cars.columns = ['car_names','mpg','cyl','disp', 'hp', 'drat', 'wt', 'qsec', 'vs', 'am', 'gear', 'carb']
    
    mpg = cars.mpg
    plt.plot(mpg)
    
    [<matplotlib.lines.Line2D at 0x7f8460556b70>]
    

    png

    cars[['mpg']].describe()
    
    mpg
    count 32.000000
    mean 20.090625
    std 6.026948
    min 10.400000
    25% 15.425000
    50% 19.200000
    75% 22.800000
    max 33.900000
    mpg_matrix = mpg.values.reshape(-1,1)
    
    scaled = preprocessing.MinMaxScaler()
    
    scaled_mpg = scaled.fit_transform(mpg_matrix)
    plt.plot(scaled_mpg)
    
    [<matplotlib.lines.Line2D at 0x7f845fe54828>]
    

    png

    scaled = preprocessing.MinMaxScaler(feature_range=(0,10))
    
    scaled_mpg = scaled.fit_transform(mpg_matrix)
    plt.plot(scaled_mpg)
    
    [<matplotlib.lines.Line2D at 0x7f845fdb8550>]
    

    png

    Using scale() to scale your features

    standardized_mpg = scale(mpg, axis=0, with_mean=False, with_std=False)
    plt.plot(standardized_mpg)
    
    [<matplotlib.lines.Line2D at 0x7f845fd91be0>]
    

    png

    standardized_mpg = scale(mpg)
    plt.plot(standardized_mpg)
    
    [<matplotlib.lines.Line2D at 0x7f845fcf6470>]
    

    png

  • 相关阅读:
    有关javamelody的配置
    jsp页面路径问题
    mavne问题解决---Dynamic Web Module 2.3 or newer
    centos配置数据源和java环境配置
    iBatis之Iterator的使用
    [转]Android MediaPlayer状态机
    [转]Android Activity的加载模式和onActivityResult方法之间的冲突
    ubuntu1304无法启动桌面系统的问题和解决
    java使用AES256解密
    [转]理解Linux的处理器负载均值
  • 原文地址:https://www.cnblogs.com/keepmoving1113/p/14285856.html
Copyright © 2011-2022 走看看