zoukankan      html  css  js  c++  java
  • What is fit and overfit

    1. In statistics, goodness of fit refers to how closely a model's predicted values match the observed(true) values
    2. overfit: A model that has learned the noise instead of the signal is considered "overfit" because it fits the training dataset but has poor fit with new dataset.
    3. Underfitting occurs when a model is too simple —— informed by too few features or regularized too much(由于太少的特征或太多的正则化导致)-----which makes it inflexible in learning the dataset.
    4. simple learners tend to have less variance in their predictions but more bias towards wrong outcome; On the other hand, complex learners tend to have more variance in their predictions. Both bias and variance are forms of prediction error in machine learning. Typically, we can reduce error from bias but might increase error from variance as a result, or vice versa.
      This trade-off between too simple (high bias) vs. too complex (high variance) is a key concept in statistics and machine learning, and one that affects all supervised learning algorithm
    5. How to detect a overfitting

    5.1 We can split our initial dataset into separate training and test subsets
    5.2 Another tip is to start with a very simple model to serve as a benchmark (基准),Then, as you try more complex algorithms, you’ll have a reference point to see if the additional complexity is worth it.

  • 相关阅读:
    pyinstall打包工具
    python中用xlsxwriter创建图表
    python打包工具 cx_Freeze介绍
    JavaScript、Dom和jQuery
    MHA原理
    mysql-mmm实现高可用和部署时须要考虑的问题
    jquery中attr和prop的区别
    FLUSH TABLES WITH READ LOCK 获取锁的速度
    Python Memcached、Redis & RabbitMQ使用
    innodb_flush_log_at_trx_commit
  • 原文地址:https://www.cnblogs.com/qiulinzhang/p/9513414.html
Copyright © 2011-2022 走看看