zoukankan      html  css  js  c++  java
  • Something wrong with FTK OCR

    A case about business secret the suspect took lots of photos and screenshots from BOM, RD papers... We have to conduct a keyword search to find out what he/she had stolen. We're not going to spend time to "take a look" at all documents and pictures so we need to use OCR function to figure it out. As you could see that FTK OCR options support file type like pdf,jpg,tiff..etc.

    1.First we have to check OCR function when processing evidence.

    2. Sorry I can't show you guys contents in the evidence. So I use other files instead of files in the evidence. The search critereia is "apple" or "fbi". As we could see that the search result is 20hits on a file - order.pdf. No any other htis on other files in the evidence.

    3.But what about other files??? Actually some pictures also have  "apple" or "fbi", but FTK could not recognize them. You guys could take a look at those two pics as below;

    4.Let's use other OCR tool like JOCR to figure it out. Fortunately JOCR could recognize texts in those two pictures and extract them successfully.

    5. To my surprise FTK could not extract texts on those pics. I used to trust its OCR function, but now my confidence on FTK's OCR function is eroded. 

  • 相关阅读:
    jupyterlab数据处理
    系统监测模块
    登录验证码的实现
    编码格式检测chardet模块
    图像处理pillow模块
    内存数据的读取
    力扣(LeetCode)728. 自除数
    力扣(LeetCode)709. 转换成小写字母
    Java 层序创建和遍历二叉树
    力扣(LeetCode) 849. 到最近的人的最大距离
  • 原文地址:https://www.cnblogs.com/pieces0310/p/5297350.html
Copyright © 2011-2022 走看看