zoukankan      html  css  js  c++  java
  • Ubuntu tesseract

    安装tesseract:

    sudo apt install tesseract-ocr
    sudo apt install libtesseract-dev

    安装模型

    sudo apt-get install tesseract-ocr-chi-sim

    命令行调用

    tesseract test.jpeg ./test -l chi-sim

    C语言API

    #include <stdio.h>
    
    #include <tesseract/baseapi.h>
    #include <leptonica/allheaders.h>
    
    int main() {
        char *outText;
    
        tesseract::TessBaseAPI *api = new tesseract::TessBaseAPI();
        // Initialize tesseract-ocr with English, without specifying tessdata path
        if (api->Init(NULL, "eng+chi_sim")) {
            fprintf(stderr, "Could not initialize tesseract.
    ");
            exit(1);
        }
    
        // Open input image with leptonica library
        Pix *image = pixRead("./invoice_dialog.png");
        api->SetImage(image);
        // Get OCR result
        outText = api->GetUTF8Text();
        printf("OCR output:
    %s", outText);
    
        // Destroy used object and release memory
        api->End();
        delete [] outText;
        pixDestroy(&image);
    
        return 0;
    }

    编译 

    g++ -o myprogram myprogram.cpp -llept -ltesseract

    出错

    /usr/bin/ld: /tmp/ccIl3vUD.o: undefined reference to symbol '__gxx_personality_v0@@CXXABI_1.3'
    //usr/lib/x86_64-linux-gnu/libstdc++.so.6: 无法添加符号: DSO missing from command line
    collect2: error: ld returned 1 exit status
    The terminal process terminated with exit code: 1

    加入选项

    g++ -o myprogram myprogram.cpp -llept -ltesseract -L/usr/lib/x86_64-linux-gnu -lstdc++

     出错

    fatal error: leptonica/allheaders.h: No such file or directory
    sudo apt install libleptonica-dev
  • 相关阅读:
    NOIP前的一些计划
    回文自动机[学习笔记]
    Luogu-3705 [SDOI2017]新生舞会
    01分数规划[学习笔记]
    jvm参数总结
    读懂jstack
    PRODUCER配置加载
    为什么WAIT必须在同步块中
    NIO学习笔记
    dp求最长递增子序列并输出
  • 原文地址:https://www.cnblogs.com/csig/p/11023185.html
Copyright © 2011-2022 走看看