using IronPdf; // Disable local disk access or cross-origin requests Installation.EnableWebSecurity = true; // Instantiate Renderer var renderer = new ChromePdfRenderer(); // Create a PDF from a HTML string using C# var pdf = renderer.RenderHtmlAsPdf("<h1>Hello World</h1>"); // Export to a file or Stream pdf.SaveAs("output.pdf"); // Advanced Example with HTML Assets // Load external html assets: Images, CSS and JavaScript. // An optional BasePath 'C:\site\assets\' is set as the file location to load assets from var myAdvancedPdf = renderer.RenderHtmlAsPdf("<img src='icons/iron.png'>", @"C:\site\assets\"); myAdvancedPdf.SaveAs("html-with-assets.pdf");

NARZęDZIA PDF

Jak przeglądać pliki PDF w C++

Curtis Chau

Zaktualizowano:16 października, 2025

Pliki PDF są powszechnie stosowanym formatem wymiany dokumentów ze względu na ich zdolność do zachowania formatowania na różnych platformach. W różnych zastosowaniach programowe odczytywanie zawartości plików PDF staje się nieocenione.

In this article, we will learn how to view text from PDF files in C++ using the Xpdf command-line tool. Xpdf provides a suite of command-line utilities and C++ libraries for working with PDF files, including text extraction. By integrating Xpdf into our C++ PDF viewer program, we can efficiently view text content from PDF files and process it programmatically.

`Xpdf` - C++ Library and Command-line Tools

Xpdf to suite oprogramowania typu open source, która oferuje szereg narzędzi i bibliotek do pracy z plikami PDF. Obejmuje ona różne narzędzia wiersza poleceń oraz biblioteki C++, które umożliwiają korzystanie z funkcji związanych z plikami PDF, takich jak parsowanie, renderowanie, drukowanie i wyodrębnianie tekstu. Narzędzia wiersza poleceń Xpdf oferują również możliwości przeglądania plików PDF bezpośrednio z terminala.

One of the key components of Xpdf is pdftotext, which is primarily known for extracting text content from PDF files. However, when used in combination with other tools like pdftops and pdfimages, Xpdf allows users to view the PDF content in different ways. The pdftotext tool proves valuable for extracting textual information from PDFs for further processing or analysis, and it offers options to specify which pages to extract text from.

Wymagania wstępne

Zanim zaczniemy, upewnij się, że spełniasz następujące wymagania wstępne:

Kompilator C++, taki jak GCC lub Clang, zainstalowany w systemie. W tym celu będziemy korzystać ze środowiska Code::Blocks IDE.
Narzędzia wiersza poleceń Xpdf zainstalowane i dostępne z wiersza poleceń. Pobierz Xpdf i zainstaluj wersję odpowiednią dla swojego środowiska. Następnie należy ustawić katalog bin programu Xpdf w ścieżce zmiennych środowiskowych systemu, aby uzyskać do niego dostęp z dowolnego miejsca w systemie plików.

Tworzenie projektu przeglądarki plików PDF

Open Code::Blocks: Launch the Code::Blocks IDE on your computer.
Utwórz nowy projekt: Kliknij "Plik" w górnym menu i wybierz "Nowy" z menu rozwijanego. Następnie kliknij "Projekt" w podmenu.
Wybierz typ projektu: W oknie "Nowy z szablonu" wybierz "Aplikacja konsolowa" i kliknij "Przejdź". Następnie wybierz język "C/C++" i kliknij "Dalej".
Enter Project Details: In the "Project title" field, give your project a name (e.g., "PDFViewer"). Choose the location where you want to save the project files, and click "Next."
Select Compiler: Choose the compiler you want to use for your project. By default, Code::Blocks should have automatically detected the available compilers on your system. If not, select a suitable compiler from the list, and click "Finish."

Steps to View Text from PDF in C++

Include the Necessary Headers

First, let's add the required header files to our main.cpp file:

#include <cstdlib>
#include <iostream>
#include <fstream>
#include <cstdio>

using namespace std; // Use standard namespace for convenience

#include <cstdlib>
#include <iostream>
#include <fstream>
#include <cstdio>

using namespace std; // Use standard namespace for convenience

C++

Set Input and Output Paths

string pdfPath = "input.pdf";
string outputFilePath = "output.txt";

string pdfPath = "input.pdf";
string outputFilePath = "output.txt";

C++

In the main function, we declare two strings: pdfPath and outputFilePath. pdfPath stores the path to the input PDF file, and outputFilePath stores the path where the extracted text will be saved as a plain text file.

Input file is as follows:

How to View PDF Files in C++: Figure 1

Execute the `pdftotext` Command

// Construct the command to execute pdftotext with input and output paths
string command = "pdftotext " + pdfPath + " " + outputFilePath;

// Execute the command using system function and capture the status
int status = system(command.c_str());

// Construct the command to execute pdftotext with input and output paths
string command = "pdftotext " + pdfPath + " " + outputFilePath;

// Execute the command using system function and capture the status
int status = system(command.c_str());

C++

Here, we construct the pdftotext command using the pdfPath and outputFilePath variables to open the PDF file for viewing its contents. The system function is then called to execute the command, and its return value is stored in the status variable.

Check Text Extraction Status

if (status == 0) 
{
    cout << "Text extraction successful." << endl;
} 
else 
{ 
    cout << "Text extraction failed." << endl; 
}

if (status == 0) 
{
    cout << "Text extraction successful." << endl;
} 
else 
{ 
    cout << "Text extraction failed." << endl; 
}

C++

We check the status variable to see if the pdftotext command executed successfully. If status is equal to 0, it means the text extraction was successful, and we print a success message. If the status is non-zero, it indicates an error, and we print an error message.

Read Extracted Text and Display

// Open the output file to read the extracted text
ifstream outputFile(outputFilePath);
if (outputFile.is_open()) 
{ 
    string textContent;
    string line;
    while (getline(outputFile, line)) 
    {
        textContent += line + "\n"; // Concatenate each line to the text content
    }
    outputFile.close();
    cout << "Text content extracted from PDF:" << endl;
    cout << textContent << endl;
} 
else 
{
    cout << "Failed to open output file." << endl;
}

// Open the output file to read the extracted text
ifstream outputFile(outputFilePath);
if (outputFile.is_open()) 
{ 
    string textContent;
    string line;
    while (getline(outputFile, line)) 
    {
        textContent += line + "\n"; // Concatenate each line to the text content
    }
    outputFile.close();
    cout << "Text content extracted from PDF:" << endl;
    cout << textContent << endl;
} 
else 
{
    cout << "Failed to open output file." << endl;
}

C++

In the above sample code, we open the outputFile (the text file generated by pdftotext), read its content line by line, and store it in the textContent string. Finally, we close the file and print the extracted text content on the console.

Remove Output File

Jeśli nie potrzebujesz edytowalnego pliku tekstowego z wynikami lub chcesz zwolnić miejsce na dysku, na końcu programu po prostu usuń go za pomocą następującego polecenia przed zakończeniem funkcji głównej:

// Remove the output file to free up disk space and if output is not needed
remove(outputFilePath.c_str());

// Remove the output file to free up disk space and if output is not needed
remove(outputFilePath.c_str());

C++

Compiling and Running the Program

Build the code using the "Ctrl+F9" shortcut key. Upon successful compilation, running the executable will extract the text content from the specified PDF document and display it on the console. Oto wynik:

How to View PDF Files in C++: Figure 2

View PDF files in C

IronPDF .NET C# Library is a powerful .NET C# PDF library that allows users to easily view PDF files within their C# applications. Leveraging the Chromium web browser engine, IronPDF accurately renders and displays PDF content, including images, fonts, and complex formatting. With its user-friendly interface and extensive functionalities, developers can seamlessly integrate IronPDF into their C# projects, enabling users to view PDF documents efficiently and interactively. Whether it's for displaying reports, invoices, or any other PDF content, IronPDF provides a robust solution for creating feature-rich PDF viewers in C#.

To install the IronPDF NuGet package in Visual Studio, follow these steps:

Open Visual Studio: Launch Visual Studio or any other IDE of your preference.
Create or Open Your Project: Create a new C# project or open an existing one where you want to install the IronPDF package.
Open the NuGet Package Manager: In Visual Studio, go to "Tools" > "NuGet Package Manager" > "Manage NuGet Packages for Solution". Alternatively, click on solution explorer and then select "Manage NuGet Packages for Solution".
Search for IronPDF: In the "NuGet Package Manager" window, click on the "Browse" tab, and then search for "IronPDF" in the search bar. Alternatively, visit the NuGet IronPDF Package and directly download the latest version of "IronPDF".
Select IronPDF Package: Find the "IronPDF" package and click on it to select it for your project.
Install IronPDF: Click the "Install" button to install the selected package.
However, you can also install IronPDF using NuGet Package Manager Console using the following command:
```
Install-Package IronPdf
```

Using IronPDF, we can perform operations such as extract text and images from PDF documents and display them in the console for viewing. Poniższy kod pomaga w realizacji tego zadania:

using IronPdf;
using IronSoftware.Drawing;
using System.Collections.Generic;

// Extracting Image and Text content from Pdf Documents
// Open a 128-bit encrypted PDF
var pdf = PdfDocument.FromFile("encrypted.pdf", "password");

// Get all text to put in a search index
string text = pdf.ExtractAllText();

// Get all Images
var allImages = pdf.ExtractAllImages();

// Or even find the precise text and images for each page in the document
for (var index = 0 ; index < pdf.PageCount ; index++)
{
    int pageNumber = index + 1;
    text = pdf.ExtractTextFromPage(index);
    List<AnyBitmap> images = pdf.ExtractBitmapsFromPage(index);
    // Further processing here...
}

using IronPdf;
using IronSoftware.Drawing;
using System.Collections.Generic;

// Extracting Image and Text content from Pdf Documents
// Open a 128-bit encrypted PDF
var pdf = PdfDocument.FromFile("encrypted.pdf", "password");

// Get all text to put in a search index
string text = pdf.ExtractAllText();

// Get all Images
var allImages = pdf.ExtractAllImages();

// Or even find the precise text and images for each page in the document
for (var index = 0 ; index < pdf.PageCount ; index++)
{
    int pageNumber = index + 1;
    text = pdf.ExtractTextFromPage(index);
    List<AnyBitmap> images = pdf.ExtractBitmapsFromPage(index);
    // Further processing here...
}

Imports IronPdf
Imports IronSoftware.Drawing
Imports System.Collections.Generic

' Extracting Image and Text content from Pdf Documents
' Open a 128-bit encrypted PDF
Private pdf = PdfDocument.FromFile("encrypted.pdf", "password")

' Get all text to put in a search index
Private text As String = pdf.ExtractAllText()

' Get all Images
Private allImages = pdf.ExtractAllImages()

' Or even find the precise text and images for each page in the document
For index = 0 To pdf.PageCount - 1
	Dim pageNumber As Integer = index + 1
	text = pdf.ExtractTextFromPage(index)
	Dim images As List(Of AnyBitmap) = pdf.ExtractBitmapsFromPage(index)
	' Further processing here...
Next index

$vbLabelText $csharpLabel

For more detailed information on IronPDF, please visit the IronPDF Documentation.

Wnioski

In this article, we learned how to extract and view the contents of a PDF document in C++ using the Xpdf command-line tool. This approach allows us to process and analyze the extracted text within our C++ applications seamlessly.

A Free Trial License is available to test for commercial purposes.

Curtis Chau

Czat z zespołem inżynierów teraz

Autor tekstów technicznych

Curtis Chau posiada tytuł licencjata z informatyki (Uniwersytet Carleton) i specjalizuje się w front-endowym rozwoju, z ekspertką w Node.js, TypeScript, JavaScript i React. Pasjonuje się tworzeniem intuicyjnych i estetycznie przyjemnych interfejsów użytkownika, Curtis cieszy się pracą z nowoczesnymi frameworkami i tworzeniem dobrze zorganizowanych, atrakcyjnych wizualnie podrę...

Czytaj więcej

Powiązane artykuły

Zaktualizowano 22 czerwca, 2025

Odkryj najlepsze oprogramowanie do redagowania PDF na 2025 rok

Odkryj najlepsze rozwiązania do redagowania PDF na 2025 rok, w tym Adobe Acrobat Pro DC, Nitro PDF Pro, Foxit PDF Editor i PDF-XChange Editor. Dowiedz się, jak IronPDF automatyzuje redagowanie w .NET dla zwiększenia bezpieczeństwa i zgodności.

Czytaj więcej

Zaktualizowano 22 czerwca, 2025

Najlepszy czytnik PDF na iPhone (Porównanie darmowych i płatnych narzędzi)

W tym artykule przyjrzymy się najlepszym czytnikom PDF na iPhone'a i wyjaśnimy, dlaczego IronPDF wyróżnia się jako najlepsza opcja.

Czytaj więcej

Zaktualizowano 26 czerwca, 2025

Najlepszy darmowy edytor PDF dla Windows (Porównanie darmowych i płatnych narzędzi)

W tym artykule omówiono najlepsze darmowe edytory PDF dostępne w 2025 roku, a na koniec przedstawiono najpotężniejszą i najbardziej elastyczną opcję: IronPDF.

Czytaj więcej

Jak konwertować PDF na obraz w Node.js

Jak tworzyć pliki PDF w C++

Zwyciestwo klienta:

Podkreslaj programiste:

SEMINARIA INTERNETOWE:

Rozpocznij Bezpłatny 30-dniowy Okres Próbny

Jak przeglądać pliki PDF w C++

`Xpdf` - C++ Library and Command-line Tools

Wymagania wstępne

Tworzenie projektu przeglądarki plików PDF

Steps to View Text from PDF in C++

Include the Necessary Headers

Set Input and Output Paths

Execute the `pdftotext` Command

Check Text Extraction Status

Read Extracted Text and Display

Remove Output File

Compiling and Running the Program

View PDF files in C

Wnioski

Zespol wsparcia Iron

Rozpocznij Bezpłatny 30-dniowy Okres Próbny

Jak przeglądać pliki PDF w C++

Xpdf - C++ Library and Command-line Tools

Wymagania wstępne

Tworzenie projektu przeglądarki plików PDF

Steps to View Text from PDF in C++

Include the Necessary Headers

Set Input and Output Paths

Execute the pdftotext Command

Check Text Extraction Status

Read Extracted Text and Display

Remove Output File

Compiling and Running the Program

View PDF files in C

Wnioski

Powiązane artykuły

Odkryj najlepsze oprogramowanie do redagowania PDF na 2025 rok

Najlepszy czytnik PDF na iPhone (Porównanie darmowych i płatnych narzędzi)

Najlepszy darmowy edytor PDF dla Windows (Porównanie darmowych i płatnych narzędzi)

Następny krok: Rozpocznij darmową 30-dniową wersję próbną

Następny krok: Rozpocznij darmową 30-dniową wersję próbną

Want to deploy IronSuite to a live project for FREE?

What’s included?

Zaufane przez miliony inżynierów na całym świecie

Zespol wsparcia Iron

`Xpdf` - C++ Library and Command-line Tools

Execute the `pdftotext` Command