import com.ironsoftware.ironpdf.*; import java.io.IOException; import java.nio.file.Paths; // Apply your license key License.setLicenseKey("YOUR-LICENSE-KEY"); // Set a log path Settings.setLogPath(Paths.get("C:/tmp/IronPdfEngine.log")); // Render the HTML as a PDF. Stored in myPdf as type PdfDocument; PdfDocument myPdf = PdfDocument.renderHtmlAsPdf("<h1> ~Hello World~ </h1> Made with IronPDF!"); // Save the PdfDocument to a file myPdf.saveAs(Paths.get("html_saved.pdf"));

在 JAVA 中使用 IRONPDF

如何在 Java 中读取 PDF 文件

Darrius Serrant

已更新:2025年7月28日

在Java中读取PDF文档可以是任何项目的重要组成部分，从商业应用到数据分析。使用IronPDF库，将PDF处理功能集成到您的Java项目中比以往更加容易。

## 如何用 Java 阅读 PDF 文件

安装 IronPDF 以在 Java 中读取 PDF 文件。
使用 `fromFile` 方法加载现有 PDF 文档
从 HTML 字符串、文件或网络 URL 生成新的 PDF
利用 `extractAllText` 方法从打开的 PDF 中读取文本
将提取的 PDF 文本打印到控制台或保存在 Java 中

IronPDF: 导入Java PDF库

IronPDF for Java PDF库概述是软件开发人员需要快速从HTML生成高质量捕获准备好的PDF的完美解决方案。该库还提供强大的文档操作工具，能够动态控制IronPDF中的页面布局和格式、内容和格式。

让我们看看如何使用IronPDF库在Java程序中读取位于路径中的PDF文件。

使用IronPDF读取PDF

第一步是使用Maven安装IronPDF；可以在IronPDF安装指南中找到更多详细信息。

在Maven中安装IronPDF

以下是在Maven项目中安装IronPDF的步骤：

在您首选的IDE中打开您的Maven项目。

在dependencies部分添加IronPDF库的依赖。


<dependency>
    <groupId>com.ironsoftware</groupId>
    <artifactId>ironpdf</artifactId>
    <version>Your_IronPDF_Version_Here</version>
</dependency>


<dependency>
    <groupId>com.ironsoftware</groupId>
    <artifactId>ironpdf</artifactId>
    <version>Your_IronPDF_Version_Here</version>
</dependency>

XML

保存pom.xml文件，并让Maven下载并安装IronPDF库。

安装完成后，您应该可以在项目中导入和使用IronPDF的类。

Java代码读取PDF文档

这里是您可以使用的代码，无论是否有表格边界，都可以使用IronPDF库来读取文件。

import com.ironsoftware.ironpdf.PdfDocument;
import java.io.IOException;
import java.nio.file.Paths;

/**
 * This class demonstrates how to read text from a PDF document using the IronPDF library.
 */
public class PdfReader {
    public static void main(String[] args) {
        try {
            // Load the PDF document from the specified file path
            PdfDocument pdf = PdfDocument.fromFile(Paths.get("C:\\sample.pdf"));

            // Extract all text content from the loaded PDF document
            String text = pdf.extractAllText();

            // Print the extracted text to the console
            System.out.println(text);
        } catch (IOException e) {
            // Handle exceptions that may occur during file loading or reading.
            e.printStackTrace();
        }
    }
}

import com.ironsoftware.ironpdf.PdfDocument;
import java.io.IOException;
import java.nio.file.Paths;

/**
 * This class demonstrates how to read text from a PDF document using the IronPDF library.
 */
public class PdfReader {
    public static void main(String[] args) {
        try {
            // Load the PDF document from the specified file path
            PdfDocument pdf = PdfDocument.fromFile(Paths.get("C:\\sample.pdf"));

            // Extract all text content from the loaded PDF document
            String text = pdf.extractAllText();

            // Print the extracted text to the console
            System.out.println(text);
        } catch (IOException e) {
            // Handle exceptions that may occur during file loading or reading.
            e.printStackTrace();
        }
    }
}

JAVA

在这个程序中，IronPDF中的PdfDocument对象。然后在这个对象上调用String。提取的文本将打印到控制台。程序包括使用try-catch块的错误处理以管理潜在的IOException。

程序输出