Test in production without watermarks.
Works wherever you need it to.
Get 30 days of fully functional product.
Have it up and running in minutes.
Full access to our support engineering team during your product trial
In today's digital era, PDF files are integral to many workflows across education, business, and personal use. They are a standard format for sharing and presenting diverse data, including text, images, and tables.
While displaying full PDF documents with complete visual fidelity within a C# Windows Forms application can involve dedicated rendering components, developers often have other needs. Sometimes, the goal is to read PDF text in C#, extract data, or display the textual content of a PDF for quick review, indexing, or accessibility.
This article will guide you through creating an application that focuses on this specific task: building a simple C# PDF text content viewer using IronPDF, a powerful .NET library. You'll learn how to use IronPDF to load a PDF and effectively extract and display its text content in a Windows Forms application.
IronPDF is a comprehensive C# library that empowers .NET developers to create, edit, and process PDF files within their applications. It allows users to convert HTML, images, and SVG to PDF documents, and importantly for this tutorial, to read and extract content from existing PDFs. IronPDF is designed for ease of use and provides a wide range of features to manipulate PDF files.
To create this C# PDF text display application, you will need:
IronPDF can also create PDFs from HTML, a separate feature from the text extraction shown in this tutorial.
To begin, launch Visual Studio and click on "Create a new project." Select "Windows Forms App (.NET Framework)" or a similar .NET template from the list.
Visual Studio New Project Creation
Next, provide a name for your project (e.g., CSharpPdfTextReader
) and click the Create button. This will set up a new Windows Forms Application project.
IronPdf
package and click "Install". Installing IronPDF via NuGet Package Manager
Alternatively, open the Package Manager Console (Tools > NuGet Package Manager > Package Manager Console) and run the command:
Install-Package IronPdf
This will download and install IronPDF and its dependencies into your project.
We will use a RichTextBox
control to display the extracted text content from the PDF. A RichTextBox
is suitable for showing formatted text, though for this tutorial, its primary role is to present the plain text extracted by IronPDF. It effectively shows the textual information without attempting to render the PDF's original visual layout.
To add a RichTextBox
:
RichTextBox
under "Common Controls," drag it onto your form.Name
(e.g., pdfDataRichTextBox
) and set its Dock
property to Fill
if you want it to take up most of the form. Adding a RichTextBox to Form1 to display extracted PDF text
Add a Button
control to your form. Users will click this button to open a file dialog and select a PDF file for text extraction.
Button
from the Toolbox onto your form.Name
(e.g., openBtn
) and Text
(e.g., "Open PDF & Display Text"). Adding a Button to Form1 to trigger PDF selection
Double-click the button you just added ("Open PDF & Display Text") to create its Click
event handler in Form1.cs
.
First, ensure you have the IronPDF namespace imported at the top of your Form1.cs
file:
using IronPdf;
using System; // For EventArgs, Exception
using System.Windows.Forms; // For OpenFileDialog, MessageBox, DialogResult, etc.
using IronPdf;
using System; // For EventArgs, Exception
using System.Windows.Forms; // For OpenFileDialog, MessageBox, DialogResult, etc.
Imports IronPdf
Imports System ' For EventArgs, Exception
Imports System.Windows.Forms ' For OpenFileDialog, MessageBox, DialogResult, etc.
Now, implement the event handler for the button click. This code will:
ExtractAllText()
method to get all text from the PDF.RichTextBox
.private void openBtn_Click(object sender, EventArgs e)
{
// Create an OpenFileDialog to open PDF files
var openFileDialog = new OpenFileDialog
{
Filter = "PDF files (*.pdf)|*.pdf|All files (*.*)|*.*", // Filter to show only PDFs
Title = "Select a PDF file to extract text from" // Dialog title
};
// Show dialog and check if the user selected a file
if (openFileDialog.ShowDialog() == DialogResult.OK)
{
try
{
// It's recommended to set your license key once at application startup.
// License.LicenseKey = "YourIronPdfLicenseKey";
// If no key is set, IronPDF runs in trial mode (watermarks on output, time limits).
// For text extraction, the trial is fully functional for development.
// Load the selected PDF using IronPDF
var pdf = PdfDocument.FromFile(openFileDialog.FileName);
// Extract all text content from the PDF using IronPDF
string extractedText = pdf.ExtractAllText();
// Display the extracted text in the RichTextBox
// (Assuming your RichTextBox is named pdfDataRichTextBox, change if different)
pdfDataRichTextBox.Text = extractedText;
}
catch (Exception ex)
{
// Show error message if an exception occurs
MessageBox.Show("An error occurred while processing the PDF file: " + ex.Message, "Error", MessageBoxButtons.OK, MessageBoxIcon.Error);
}
}
}
private void openBtn_Click(object sender, EventArgs e)
{
// Create an OpenFileDialog to open PDF files
var openFileDialog = new OpenFileDialog
{
Filter = "PDF files (*.pdf)|*.pdf|All files (*.*)|*.*", // Filter to show only PDFs
Title = "Select a PDF file to extract text from" // Dialog title
};
// Show dialog and check if the user selected a file
if (openFileDialog.ShowDialog() == DialogResult.OK)
{
try
{
// It's recommended to set your license key once at application startup.
// License.LicenseKey = "YourIronPdfLicenseKey";
// If no key is set, IronPDF runs in trial mode (watermarks on output, time limits).
// For text extraction, the trial is fully functional for development.
// Load the selected PDF using IronPDF
var pdf = PdfDocument.FromFile(openFileDialog.FileName);
// Extract all text content from the PDF using IronPDF
string extractedText = pdf.ExtractAllText();
// Display the extracted text in the RichTextBox
// (Assuming your RichTextBox is named pdfDataRichTextBox, change if different)
pdfDataRichTextBox.Text = extractedText;
}
catch (Exception ex)
{
// Show error message if an exception occurs
MessageBox.Show("An error occurred while processing the PDF file: " + ex.Message, "Error", MessageBoxButtons.OK, MessageBoxIcon.Error);
}
}
}
Private Sub openBtn_Click(ByVal sender As Object, ByVal e As EventArgs)
' Create an OpenFileDialog to open PDF files
Dim openFileDialog As New OpenFileDialog With {
.Filter = "PDF files (*.pdf)|*.pdf|All files (*.*)|*.*",
.Title = "Select a PDF file to extract text from"
}
' Show dialog and check if the user selected a file
If openFileDialog.ShowDialog() = DialogResult.OK Then
Try
' It's recommended to set your license key once at application startup.
' License.LicenseKey = "YourIronPdfLicenseKey";
' If no key is set, IronPDF runs in trial mode (watermarks on output, time limits).
' For text extraction, the trial is fully functional for development.
' Load the selected PDF using IronPDF
Dim pdf = PdfDocument.FromFile(openFileDialog.FileName)
' Extract all text content from the PDF using IronPDF
Dim extractedText As String = pdf.ExtractAllText()
' Display the extracted text in the RichTextBox
' (Assuming your RichTextBox is named pdfDataRichTextBox, change if different)
pdfDataRichTextBox.Text = extractedText
Catch ex As Exception
' Show error message if an exception occurs
MessageBox.Show("An error occurred while processing the PDF file: " & ex.Message, "Error", MessageBoxButtons.OK, MessageBoxIcon.Error)
End Try
End If
End Sub
Code Breakdown:
openFileDialog
: A standard dialog for file selection, filtered for PDF files.PdfDocument.FromFile(openFileDialog.FileName)
: This IronPDF method loads the chosen PDF into a PdfDocument
object.pdf.ExtractAllText()
: This is the key IronPDF function for this tutorial. It reads through the entire PDF and extracts all discernible text content into a single string. This is incredibly useful for C# parse PDF text scenarios.pdfDataRichTextBox.Text = extractedText;
: The extracted text is then assigned to the Text
property of your RichTextBox
(ensure the name pdfDataRichTextBox
matches the name you gave your RichTextBox control).This demonstrates how IronPDF simplifies reading PDF text in C#, allowing developers to access PDF content programmatically with minimal effort.
Your application window will appear. Click the "Open PDF & Display Text" button, select a PDF file from your computer, and click "Open."
Running the C# PDF Text Viewer Application
The RichTextBox
will then display the text content extracted from the selected PDF file.
Text content extracted from the PDF and displayed in the
RichTextBox
For information on visually rendering PDFs in MAUI applications (which is different from this tutorial's text extraction focus), you might explore "PDF Viewing in MAUI Tutorial".
By following these steps, you've created a C# Windows Forms application that effectively extracts and displays text content from PDF files using IronPDF. This approach is valuable when you need to programmatically access the textual information within PDFs for display, analysis, or further processing in your .NET applications.
IronPDF provides robust capabilities for C# PDF text extraction, and it's just one part of its comprehensive feature set. You can also use IronPDF for more advanced tasks like text searching within PDFs, adding annotations, printing PDF documents, PDF encryption and decryption, and editing PDF forms.
Remember, this tutorial focused on one specific use case: making PDF text accessible in a C# application. You can adapt and expand upon this foundation to meet more complex requirements.
If you're interested in exploring the full potential of IronPDF:
IronPDF is a comprehensive C# library that empowers .NET developers to create, edit, and process PDF files within their applications. It allows users to convert HTML, images, and SVG to PDF documents, and to read and extract content from existing PDFs.
To create a C# PDF text display application, you will need Visual Studio as your Integrated Development Environment and IronPDF, which is a NuGet package providing functionality to read, create, and manipulate PDF documents.
To start a new Windows Forms Application, launch Visual Studio and select 'Create a new project.' Choose 'Windows Forms App (.NET Framework)' or a similar .NET template, provide a project name, and click 'Create.'
You can install IronPDF using the NuGet Package Manager GUI by right-clicking your project in Solution Explorer, selecting 'Manage NuGet Packages,' and searching for 'IronPdf.' Alternatively, use the Package Manager Console with the command 'Install-Package IronPdf.'
A RichTextBox control is used to display the extracted text content from the PDF. It is suitable for showing formatted text and can be added to your form from the Toolbox.
Drag a Button control from the Toolbox onto your form. Set its Name and Text properties in the Properties window. This button will be used to open a file dialog for PDF selection.
IronPDF extracts text from a PDF using the method ExtractAllText(). This method reads the entire PDF and extracts all discernible text content into a single string.
You can handle exceptions by wrapping the PDF processing code in a try-catch block. Display an error message using MessageBox.Show if an exception occurs during PDF processing.
To build and run the application, select 'Build Solution' from the Build menu in Visual Studio. Once built successfully, press F5 or click 'Start' to run the application. Use the button to select a PDF and display its text.
Beyond text extraction, IronPDF offers features like HTML to PDF conversion, text searching, adding annotations, printing PDFs, PDF encryption and decryption, and editing PDF forms.