zoukankan      html  css  js  c++  java
  • Data mining 4_7

    Well, I really think u have to write note here, because when I new a word, I always lost.

    Get down to business. For decision tree, which is based on rule, also use some statistical method, in other words, heuristic rule.

     

    Error rate

    you couldn't use the error rate in training data, what we care about is that the error rate in text data. Consequently, u can use the optimistic and pessimistic statistical method to get the samiliar error rate.

    Problems from missing value

    Firstly, when the data is not very mess, which means the date don't possess the statistical quality, thus, u couldn't use the statistical method to predict.

    Secondly, the decision tree are not unique, consequently, we can use greed or heuristic algorithm to solve the better tree.

    Thirdly, for some points, u couldn't separate by using the only attribute, perhaps u should use the expression of the attributes, such as x + y = 1.

     

    Model Evaluation

    u know when u create a decision tree, u need a model evalution to know whether it's good. So We introduce two matrices, one is confusion matrix, the other is cost matrix. u know everything have its own environments.

    Lastly, we introduce the three index to evaluate the model, which are precision, recall, F.

  • 相关阅读:
    Css的transform和transition
    移动端事件
    回流和重绘
    Swift更新至2.2版本 语言变化
    编程中遇到的 问题 总结
    NSNotificationCenter
    iOS中boolean、Boolean、BOOL、bool的区别
    推送的 代码实战写法
    MKNetworkKit的使用
    MKNetworkKit 的介绍
  • 原文地址:https://www.cnblogs.com/chuanlong/p/3006622.html
Copyright © 2011-2022 走看看