zoukankan      html  css  js  c++  java
  • PDFBox 解析PDF文件-解析服务器文件

    1.首先引进pom

    <!-- PDF读取依赖 -->
    <dependency>
    <groupId>org.apache.pdfbox</groupId>
    <artifactId>pdfbox</artifactId>
    <version>2.0.4</version>
    </dependency>

    2.controller层直接代码

    /**
    * PDF解析
    * @return
    */
    @PostMapping("/getPdf")
    public StringBuffer getPdf(@RequestBody JSONObject jsonObject) throws IOException {
    StringBuffer stringBuffer = null;

    //获取服务器地址
    ImportParams params = new ImportParams();
    params.setSaveUrl("/file");
    String filePath = jsonObject.getString("filePath");
    filePath = fileServer + "/" + filePath;
    URL url = new URL(filePath);
    URLConnection connection = url.openConnection();
    InputStream inputStream = connection.getInputStream();
    try {
    PDDocument document;
    PDFParser parser = new PDFParser(new RandomAccessBuffer(inputStream));
    parser.parse();
    document = parser.getPDDocument();
    document.getClass();
    if(!document.isEncrypted()) {
    PDFTextStripperByArea stripper = new PDFTextStripperByArea();
    stripper.setSortByPosition(true);
    PDFTextStripper textStripper = new PDFTextStripper();
    String exposeContent = textStripper.getText(document);
    String[] content = exposeContent.split("\n");
    stringBuffer = new StringBuffer();
    for(String line:content) {
    stringBuffer.append(line);
    }
    }

    } catch (Exception e) {
    e.printStackTrace();

    }
    return stringBuffer;
    }
  • 相关阅读:
    TensorFlow实现LeNet5模型
    jmeter+influxdb+grafana性能测试可视化报告
    jmeter命令执行脚本
    jmeter JSON Extractor使用
    jmeter上传文件
    jenkins登录信息无效,忘记密码
    XML
    JMeter函数和变量
    jmeter配置CSV Data Set Config
    jmeter发送Query String Parameters格式参数报错
  • 原文地址:https://www.cnblogs.com/shxkey/p/12427472.html
Copyright © 2011-2022 走看看