zoukankan      html  css  js  c++  java
  • Ubuntu tesseract

    安装tesseract:

    sudo apt install tesseract-ocr
    sudo apt install libtesseract-dev

    安装模型

    sudo apt-get install tesseract-ocr-chi-sim

    命令行调用

    tesseract test.jpeg ./test -l chi-sim

    C语言API

    #include <stdio.h>
    
    #include <tesseract/baseapi.h>
    #include <leptonica/allheaders.h>
    
    int main() {
        char *outText;
    
        tesseract::TessBaseAPI *api = new tesseract::TessBaseAPI();
        // Initialize tesseract-ocr with English, without specifying tessdata path
        if (api->Init(NULL, "eng+chi_sim")) {
            fprintf(stderr, "Could not initialize tesseract.
    ");
            exit(1);
        }
    
        // Open input image with leptonica library
        Pix *image = pixRead("./invoice_dialog.png");
        api->SetImage(image);
        // Get OCR result
        outText = api->GetUTF8Text();
        printf("OCR output:
    %s", outText);
    
        // Destroy used object and release memory
        api->End();
        delete [] outText;
        pixDestroy(&image);
    
        return 0;
    }

    编译 

    g++ -o myprogram myprogram.cpp -llept -ltesseract

    出错

    /usr/bin/ld: /tmp/ccIl3vUD.o: undefined reference to symbol '__gxx_personality_v0@@CXXABI_1.3'
    //usr/lib/x86_64-linux-gnu/libstdc++.so.6: 无法添加符号: DSO missing from command line
    collect2: error: ld returned 1 exit status
    The terminal process terminated with exit code: 1

    加入选项

    g++ -o myprogram myprogram.cpp -llept -ltesseract -L/usr/lib/x86_64-linux-gnu -lstdc++

     出错

    fatal error: leptonica/allheaders.h: No such file or directory
    sudo apt install libleptonica-dev
  • 相关阅读:
    vbScript首字拼音查询
    C#读取U盘序列号
    下拉 回车 筛选
    Oracle的汉字转拼音首字母的函数
    sql 触发器禁止和启用
    List.FindAll 方法
    MyControl 常用操作
    List.Find 方法
    regsvr32 C:\WINDOWS\system32\cell32.ocx
    2222222
  • 原文地址:https://www.cnblogs.com/csig/p/11023185.html
Copyright © 2011-2022 走看看