zoukankan      html  css  js  c++  java
  • What is fit and overfit

    1. In statistics, goodness of fit refers to how closely a model's predicted values match the observed(true) values
    2. overfit: A model that has learned the noise instead of the signal is considered "overfit" because it fits the training dataset but has poor fit with new dataset.
    3. Underfitting occurs when a model is too simple —— informed by too few features or regularized too much(由于太少的特征或太多的正则化导致)-----which makes it inflexible in learning the dataset.
    4. simple learners tend to have less variance in their predictions but more bias towards wrong outcome; On the other hand, complex learners tend to have more variance in their predictions. Both bias and variance are forms of prediction error in machine learning. Typically, we can reduce error from bias but might increase error from variance as a result, or vice versa.
      This trade-off between too simple (high bias) vs. too complex (high variance) is a key concept in statistics and machine learning, and one that affects all supervised learning algorithm
    5. How to detect a overfitting

    5.1 We can split our initial dataset into separate training and test subsets
    5.2 Another tip is to start with a very simple model to serve as a benchmark (基准),Then, as you try more complex algorithms, you’ll have a reference point to see if the additional complexity is worth it.

  • 相关阅读:
    魔法阵
    求和
    罗马数字
    「NOIP2005P」循环
    【Windows批处理III】实现删除含自定字符串的文件和文件夹(搜索子目录)
    扩展欧几里得算法
    埃氏筛法(素数筛)
    python学习之argparse模块
    51Nod1364 最大字典序排列
    51Nod1537 分解
  • 原文地址:https://www.cnblogs.com/qiulinzhang/p/9513414.html
Copyright © 2011-2022 走看看