zoukankan      html  css  js  c++  java
  • [ML] 2. Introduction to neural networks

    Training an algorithm involes four ingredients:

    • Data
    • Model
    • Objective function: We put data input a Model and get output out of it. The value we call it as 'lost'. We want to minimize the 'lost' value.
    • Optimization algorithm: For example the linear model, we will try to optimize y = wx + b, 'w' & 'b' so that it will minimize the 'lost' value.

    Repeat the process...

    Three types of machine learning:

    Supervised: Give feedback

    • Classification: outputs are categories: cats or dogs
    • Regression: output would be numbers.

    Unsupervised: No feedback, find parttens

    Reinforcement: Train the algorithm to works in a enviorment based on the rewords it receives. (Just like training your dog)

    Linear Model:

    f(x) = x * w + b

    x: input

    w: coefficient / weight

    b: intercept / bias

    Linear Model: Multi inputs:

    x, w are both vectors: 

    x: 1 * 2

    w: 2 * 1

    f(x): 1 * 1

    Notice that the lienar model doesn't chage, it is still:

    f(x) = x * w + b

    Lienar Model: multi inputs and multi outputs:

    For 'W', the first index is always the same as X; the second index is always the same as ouput Y.

    If there is K inputs and M outputs, the number of Weigths would be K * M

    The number of bias is equal to the number of ouputs: M

    N * M = (N * K) * (K * M) + 1 * M

    Each model is determined by its weights and biases.

    Objection function:

    Is the measure used to evaluate how well the model's output match the desired correct values.

    • Loss function: the lower the loss function, the higher the level of accuracy (Supervized learning)
    • Reward function: the hight of the reward function, the higher the level of accuracy (Reubfircement learning)

    Loss functions for Supervised learning:

    • Regression: L2-NORM

    • Classification: CROSS-ENTROPY

    Expect cross-entropy should be lower.

    Optimization algorithm: Dradient descent

    Until one point, the following value never update anymore.

    The picture looks like this:

    Generally, we want the learning rate to be:

      High enough, so we can reach the closest minimum in a rational amount of time

      Low enough, so we don't oscillate around the minimum

    N-parameter gradient descent

  • 相关阅读:
    webservice 测试窗体只能用于来自本地计算机的请求
    未能加载文件或程序集system.web.extensions解决方法
    VS2010中水晶报表应用及实例
    存储过程
    Windows下wamp的配置问题(php初学者必看!!)
    IIS附加进程在Visual Studio 2010 中进行调试(高级)
    求职之(1)各公司待遇~~可能有点老了
    编译原理之(2)C++词法文件,语法文件
    STL笔记(4)关于erase,remove
    STL笔记(6)标准库:标准库中的排序算法
  • 原文地址:https://www.cnblogs.com/Answer1215/p/12324642.html
Copyright © 2011-2022 走看看