Logistic Regression

zoukankan html css js c++ java

Logistic Regression
逻辑回归模型：

　　want： 0 <= h_θ(x)<=1

　　h_θ(x) = g(θ^Tx)　　　　　//θ 注意有个θ₀一般置1

　　g(z) = 1 / (1+e^-z)　　　　//sigmoid 函数

由sigmoid函数特性（2分类问题）：
def sigmoid(inX): return 1.0 /(1 + np.exp(-inX))
　　当θ^Tx>= 0 时，h_θ(x)>= 0.5 , 预测 y =1 ；

　　当θ^Tx< 0 时，h_θ(x)< 0.5 , 预测 y = 0 ；

ps: 可以改变阈值，此处为0.5，取决于需要的置信度高低

决策边界：

　　θ^T.X == 0 对应上面阈值为 0.5 不一定是直线，看特征向量怎么选平方 x1*x2

csot function:

　　J(θ) = 1/m Σ(-y log( h_θ(X) ) - (1 - y)log( 1 - h_θ(X) ))+ (λ /2m)∑_j=1ⁿ θ²_j　　　　//m为样本数可进行向量化　　　　红色部分为正则项

目标：

　　min(J(θ))

　　使用梯度下降

　　Repeat:

　　　　θ_j := θ_j - α(Σ(h_θ(xⁱ) - yⁱ) x^i_j + (λ /m)θ_j)

　　　　注意： θ要“同时“赋值，因为求下降的部分用到了 h_θ(xⁱ) 而h_θ(xⁱ)里面用到θ

　　　　　　　　即是在山上一个位置同时对各个方向求偏导如果不同时那么位置就变了
1 def gradDescent(dataMatIn, classLabels): 2 dataMatrix = np.mat(dataMatIn) 3 labelMat = np.mat(classLabels).transpose() 4 m , n = np.shape(dataMatrix) 5 alpha = 0.001 6 maxCycles = 500 7 weights = np.ones((n,1)) 8 for k in range(maxCycles): 9 h = sigmoid(dataMatrix * weights) 10 error = (h - labelMat) 11 weights = weights - alpha * dataMatrix.transpose() *error 12 return weights
优化：

　　overfit ：　　reduce feature， bigger dataset，increase penalize parameter λ

　　underfit：　　increase feature, reduce penalize parameter λ

　　How to choose learning rate α ?

　　　　By iteration and draw the picture of cost function and times of iteration

　　　　
查看全文

相关阅读:
JAVA深入研究——Method的Invoke方法。
java String->float，float->int
Java中Object转化为int类型
 Android使用SeekBar
转：最值得阅读学习的 10 个 C 语言开源项目代码
 Eclipse快捷键列表大全
 Android使用的Eclipse NDK开发较详细一篇文章
 rm 删除带空格的文件或者目录
 man命令中的文本操作
 androidSDK无法更新的解决方法之一

原文地址：https://www.cnblogs.com/hao11/p/11668854.html