zoukankan      html  css  js  c++  java
  • tesseract-ocr4.0 安装部署及训练验证码识别

    1.  下载最新版本的leptonica,  leptonica-1.74.1.tar.gz  

    2.  编译安装

    tar -zxvf leptonica-1.74.1.tar.gz
    cd leptonica-1.74.1
    ./configure
    make
    sudo make install

    3. 安装相关依赖库

    sudo apt-get install autoconf automake libtool
    sudo apt-get install autoconf-archive
    sudo apt-get install pkg-config
    sudo apt-get install libpng12-dev
    sudo apt-get install libjpeg8-dev
    sudo apt-get install libtiff5-dev
    sudo apt-get install zlib1g-dev
    
    #if you plan to install the training tools, you also need the following libraries:
    
    sudo apt-get install libicu-dev
    sudo apt-get install libpango1.0-dev
    sudo apt-get install libcairo2-dev

    4. 下载编译安装最新版本 tesseract-4.0, 

    git clone --depth 1 https://github.com/tesseract-ocr/tesseract.git
    cd tesseract
    ./autogen.sh
    ./configure --enable-debug
    LDFLAGS="-L/usr/local/lib" CFLAGS="-I/usr/local/include" make
    sudo make install
    sudo ldconfig

    5. 使用

    # 查看版本号
    tesseract -v
    
    # 查看tesseract 支持语言
    tesseract --list-langs
    
    # 识别 test.jpg 图片文字
    tesseract test.jpg out -l eng
    more out.txt
    每天一小步,人生一大步!Good luck~
  • 相关阅读:
    css之深入理解padding
    css布局大杂烩
    css深入理解margin
    css之深入理解border
    css样式画各种图形
    css Sprite雪碧图
    JVM,JRE,JDK
    JAVA 遍历数组
    JAVA 得到数组的长度
    大一对软件工程
  • 原文地址:https://www.cnblogs.com/jkmiao/p/6417167.html
Copyright © 2011-2022 走看看