zoukankan      html  css  js  c++  java
  • [ML] 2. Introduction to neural networks

    Training an algorithm involes four ingredients:

    • Data
    • Model
    • Objective function: We put data input a Model and get output out of it. The value we call it as 'lost'. We want to minimize the 'lost' value.
    • Optimization algorithm: For example the linear model, we will try to optimize y = wx + b, 'w' & 'b' so that it will minimize the 'lost' value.

    Repeat the process...

    Three types of machine learning:

    Supervised: Give feedback

    • Classification: outputs are categories: cats or dogs
    • Regression: output would be numbers.

    Unsupervised: No feedback, find parttens

    Reinforcement: Train the algorithm to works in a enviorment based on the rewords it receives. (Just like training your dog)

    Linear Model:

    f(x) = x * w + b

    x: input

    w: coefficient / weight

    b: intercept / bias

    Linear Model: Multi inputs:

    x, w are both vectors: 

    x: 1 * 2

    w: 2 * 1

    f(x): 1 * 1

    Notice that the lienar model doesn't chage, it is still:

    f(x) = x * w + b

    Lienar Model: multi inputs and multi outputs:

    For 'W', the first index is always the same as X; the second index is always the same as ouput Y.

    If there is K inputs and M outputs, the number of Weigths would be K * M

    The number of bias is equal to the number of ouputs: M

    N * M = (N * K) * (K * M) + 1 * M

    Each model is determined by its weights and biases.

    Objection function:

    Is the measure used to evaluate how well the model's output match the desired correct values.

    • Loss function: the lower the loss function, the higher the level of accuracy (Supervized learning)
    • Reward function: the hight of the reward function, the higher the level of accuracy (Reubfircement learning)

    Loss functions for Supervised learning:

    • Regression: L2-NORM

    • Classification: CROSS-ENTROPY

    Expect cross-entropy should be lower.

    Optimization algorithm: Dradient descent

    Until one point, the following value never update anymore.

    The picture looks like this:

    Generally, we want the learning rate to be:

      High enough, so we can reach the closest minimum in a rational amount of time

      Low enough, so we don't oscillate around the minimum

    N-parameter gradient descent

  • 相关阅读:
    Spring MVC 核心组件详解
    Spring MVC 入门就这一篇
    Spring 事务解决方案
    【UGUI源码分析】Unity遮罩之Mask详细解读
    游戏开发中不同时区下的时间问题
    ARTS第十三周(阅读Tomcat源码)
    Win10 电脑安装.NET低版本提示“这台计算机中已经安装了 .NET Framwork 4.6.2或版本更高的更新”问题
    Dynamics 365 Setup 提示SqlServer 存在
    Dynamics CRM "Verification of prerequisites for Domain Controller promotion failed. Certificate Server is installed."
    Dynamics CRM
  • 原文地址:https://www.cnblogs.com/Answer1215/p/12324642.html
Copyright © 2011-2022 走看看