zoukankan      html  css  js  c++  java
  • 机器学习—Logistic Regression

    一、一般模型

    import numpy as np
    import pandas as pd
    import matplotlib.pyplot as plt
    from sklearn.linear_model import LogisticRegression
    from sklearn.model_selection import train_test_split
    from sklearn.preprocessing import StandardScaler
    from sklearn import metrics
    from sklearn.datasets import load_iris
    %matplotlib inline
    #载入数据
    iris = load_iris()
    x = iris.data
    y = iris.target
    x_train,x_test,y_train,y_test = train_test_split(x,y,train_size=0.7,random_state=0)
    #数据标准化
    sc = StandardScaler()
    x_train_std = sc.fit_transform(x_train)
    x_test_std = sc.transform(x_test)
    #建立模型
    lr = LogisticRegression()
    lr.fit(x_train_std,y_train)
    y_pred = lr.predict(x_test_std)
    #检验模型
    accuracy_score = metrics.accuracy_score(y_test,y_pred)   #错误率,也就是np.average(y_test==y_pred)
    accuracy_score

    结果是:0.82222222222222219

    二、加入正则项:

    from sklearn.linear_model import RidgeClassifierCV
    alpha = np.logspace(-3,2,10)
    ridge_model = RidgeClassifierCV(alphas=alpha,cv=5)
    ridge_model.fit(x_train_std,y_train)
    ridge_model.alpha_
    y_pred_ridge = ridge_model.predict(x_test_std)
    accuracy_score = metrics.accuracy_score(y_test,y_pred_ridge)
    accuracy_score

    结果是:0.77777777777777779

  • 相关阅读:
    MFC 将文件拖进对话框获得文件信息
    微软历史最高市值是多少?
    ZooKeeper的学习与应用
    OutputCache祥解
    本人的微博转移
    java list三种遍历方法性能比較
    VS2010旗舰版安装图解
    SSL协议具体解释
    freemarker字符串拼接
    [java web 入门](一)MyEclipse & HelloWorld 记录
  • 原文地址:https://www.cnblogs.com/jiegege/p/7629992.html
Copyright © 2011-2022 走看看