Read PDF Files in C#

VB C#

using IronPdf;
using IronSoftware.Drawing;
using System.Collections.Generic;

// Extracting Image and Text content from Pdf Documents

// open a 128 bit encrypted PDF
var pdf = PdfDocument.FromFile("encrypted.pdf", "password");

// Get all text to put in a search index
string text = pdf.ExtractAllText();

// Get all Images
var allImages = pdf.ExtractAllImages();

// Or even find the precise text and images for each page in the document
for (var index = 0 ; index < pdf.PageCount ; index++)
{
    int pageNumber = index + 1;
    text = pdf.ExtractTextFromPage(index);
    List<AnyBitmap> images = pdf.ExtractBitmapsFromPage(index);
    //...
}

Imports IronPdf
Imports IronSoftware.Drawing
Imports System.Collections.Generic

' Extracting Image and Text content from Pdf Documents

' open a 128 bit encrypted PDF
Private pdf = PdfDocument.FromFile("encrypted.pdf", "password")

' Get all text to put in a search index
Private text As String = pdf.ExtractAllText()

' Get all Images
Private allImages = pdf.ExtractAllImages()

' Or even find the precise text and images for each page in the document
For index = 0 To pdf.PageCount - 1
	Dim pageNumber As Integer = index + 1
	text = pdf.ExtractTextFromPage(index)
	Dim images As List(Of AnyBitmap) = pdf.ExtractBitmapsFromPage(index)
	'...
Next index

Install-Package IronPdf

Read PDF Files in C#

The PdfDocument.ExtractAllText method from the IronPDF C# PDF library is perfect for vanilla PDF text reading tasks. This method handles whitespace and encoding discrepancies within source PDF documents without any issue.

PdfDocument.ExtractTextFromPage reads the text from specific pages of a PDF. In the example above, we see it used iteratively to retrieve text content from a specific range of pages.

IronPDF can also extract raw images from PDFs. For this, use either of the methods from the PdfDocument class below:

ExtractAllImages: returns all images embedded in a PDF as IronSoftware.Drawing.AnyBitmap objects.
ExtractAllRawImages: retrieves all embedded images as a list of raw bytes (byte []).
ExtractImagesFromPage: extracts the images contained on an indexed page
ExtractImagesFromPages: same as ExtractImagesFromPage, but from a specific page range or a list of individual pages.
ExtractRawImagesFromPage and ExtractRawImagesFromPages: works the same as the previous two methods, but returns extracted images as byte arrays instead of as IronSoftware.Drawing.AnyBitmap objects.

How to Read PDF Files in C#

Download Read and Write PDF C# Library
Extract Images or Text from PDF
Read and Find Words in Specific Documents
View PDF Output from your original document

Ready to get started? Version: 2024.7 just released

View Licenses >

Read PDF Files in C#

Read PDF Files in C#

How to Read PDF Files in C#

Related Docs Links

Ready to get started? Version: 2024.7 just released

Test in a live environment

Fully-functional product

24/5 technical support

The trial form was submitted
successfully.

Test in a live environment

Fully-functional product

24/5 technical support

Test in a live environment

Fully-functional product

24/5 technical support

The trial form was submitted
successfully.

IronPDF is a part of IRONSUITE

Read PDF Files in C#

Read PDF Files in C#

How to Read PDF Files in C#

Related Docs Links

Ready to get started? Version: 2024.7 just released

Get your FREE

The trial form was submittedsuccessfully.

The trial form was submittedsuccessfully.

The trial form was submittedsuccessfully.

The trial form was submittedsuccessfully.

Test in a live environment

Fully-functional product

24/5 technical support

Get your free 30-day Trial Key instantly.

The trial form was submittedsuccessfully.

The trial form was submittedsuccessfully.

Trusted by Millions of Engineers Worldwide

Test in a live environment

Fully-functional product

24/5 technical support

Get your free 30-day Trial Key instantly.

Trusted by Millions of Engineers Worldwide

Test in a live environment

Fully-functional product

24/5 technical support

Get your free 30-day Trial Key instantly.

The trial form was submittedsuccessfully.

The trial form was submittedsuccessfully.

Trusted by Millions of Engineers Worldwide

IronPDF is a part of IRONSUITE

The trial form was submitted
successfully.

The trial form was submitted
successfully.

The trial form was submitted
successfully.

The trial form was submitted
successfully.

The trial form was submitted
successfully.

The trial form was submitted
successfully.

The trial form was submitted
successfully.

The trial form was submitted
successfully.