iTextSharp Read PDF Alternatives (Developer Tutorial)

PDF (Portable Document Format) is a widely used file format for sharing documents consistently and securely. Reading and manipulating such files in C# is a common requirement in various applications, such as document management systems, reporting tools, and more. In this article, we will compare two popular libraries for reading PDF files in C#: IronPDF and iTextSharp (the latest .NET library iText).

IronPDF is a comprehensive C# library from Iron Software that provides a wide range of features for working with PDF files. It allows developers to create, edit, and manipulate PDF documents seamlessly. IronPDF is known for its simplicity and ease of use, making it an excellent choice for developers who need to integrate PDF functionality into their applications quickly.

iTextSharp is another popular library for working with PDF files in C#. It has been around for quite some time and is widely used in the industry. iText provides a rich set of features for creating and manipulating PDF documents. It is known for its flexibility and extensibility, making it suitable for complex PDF-related tasks.

How to Use IronPDF Vs iTextSharp to Read PDFs in C#

  1. Create a new C# project in Visual Studio to compare IronPDF Vs iTextSharp for reading PDF file.
  2. Install IronPDF and iTextSharp libraries to the project.
  3. Read PDF files using IronPDF.
  4. Read PDF files using iTextSharp.

Prerequisites

  1. Visual Studio: Ensure you have Visual Studio or any other C# development environment installed.
  2. NuGet Package Manager: Make sure you can use NuGet to manage packages in your project.

Step 1: Create a new C# project in Visual Studio to compare IronPDF Vs iTextSharp read PDF files

Begin by setting up a C# Console Application. Open Visual Studio and select Create a new project. Select Console Application type.

iTextSharp Read PDF Alternatives (Developer Tutorial): Figure 1 - Console App

Provide the project name as shown below.

iTextSharp Read PDF Alternatives (Developer Tutorial): Figure 2 - Project Configuration

Select the required .NET version for the project.

iTextSharp Read PDF Alternatives (Developer Tutorial): Figure 3 - Framework

Once this is done, Visual Studio will generate a new project.

Step 2: Install IronPDF and iTextSharp libraries to the project

iTextSharp can be installed from the NuGet Package Manager. The latest version is available as an iText package.

iTextSharp Read PDF Alternatives (Developer Tutorial): Figure 4 - iText

Or from the Visual Studio Package Manager as shown below. Search for iText in Package Manager and click Install.

iTextSharp Read PDF Alternatives (Developer Tutorial): Figure 5 - NuGet Package Manager

IronPDF can be installed from the NuGet Package Manager as shown below.

iTextSharp Read PDF Alternatives (Developer Tutorial): Figure 6 - IronPDF

Or from the Visual Studio package manager as shown below. Search for IronPDF in Package Manager and click Install.

iTextSharp Read PDF Alternatives (Developer Tutorial): Figure 7 - Install IronPDF

Step 3: Read Text from a PDF file using IronPDF

Add the below code to the program.cs file and provide a sample PDF document which has the below content.

iTextSharp Read PDF Alternatives (Developer Tutorial): Figure 8 - PDF Input

using IronPdf;
Console.WriteLine("Comparison of IronPDF And itextsharp Read PDF Files in C#");
// pdfreader reader
ReadUsingIronPDF.Read();
public class ReadUsingIronPDF
{
    public static void Read()
    {
        // read from specific location
        string filename = "C:\\code\\articles\\ITextSharp\\ITextSharpIronPdfDemo\\Example.pdf";
        var pdfReader = PdfDocument.FromFile(filename);
        // Get all text to put in a search index using new simpletextextractionstrategy
        var allText = pdfReader.ExtractAllText();
        Console.WriteLine("------------------Text From PDF-----------------");
        Console.WriteLine(allText);
        Console.WriteLine("------------------Text From PDF-----------------");
        var allIMages = pdfReader.ExtractAllImages();
        Console.WriteLine("------------------Image Count From PDF-----------------");
        Console.WriteLine($"Total Images={allIMages.Count()}");
        Console.WriteLine("------------------Image Count From PDF-----------------");
        Console.WriteLine("------------------one Page Text From PDF page-----------------");
        var pageCount = pdfReader.PageCount;
        for (int page = 0; page < pageCount; page++)
        {
            string Text = pdfReader.ExtractTextFromPage(page);
            Console.WriteLine(Text);
        }    
    }
}
using IronPdf;
Console.WriteLine("Comparison of IronPDF And itextsharp Read PDF Files in C#");
// pdfreader reader
ReadUsingIronPDF.Read();
public class ReadUsingIronPDF
{
    public static void Read()
    {
        // read from specific location
        string filename = "C:\\code\\articles\\ITextSharp\\ITextSharpIronPdfDemo\\Example.pdf";
        var pdfReader = PdfDocument.FromFile(filename);
        // Get all text to put in a search index using new simpletextextractionstrategy
        var allText = pdfReader.ExtractAllText();
        Console.WriteLine("------------------Text From PDF-----------------");
        Console.WriteLine(allText);
        Console.WriteLine("------------------Text From PDF-----------------");
        var allIMages = pdfReader.ExtractAllImages();
        Console.WriteLine("------------------Image Count From PDF-----------------");
        Console.WriteLine($"Total Images={allIMages.Count()}");
        Console.WriteLine("------------------Image Count From PDF-----------------");
        Console.WriteLine("------------------one Page Text From PDF page-----------------");
        var pageCount = pdfReader.PageCount;
        for (int page = 0; page < pageCount; page++)
        {
            string Text = pdfReader.ExtractTextFromPage(page);
            Console.WriteLine(Text);
        }    
    }
}
Imports IronPdf
Console.WriteLine("Comparison of IronPDF And itextsharp Read PDF Files in C#")
' pdfreader reader
ReadUsingIronPDF.Read()
'INSTANT VB TODO TASK: Local functions are not converted by Instant VB:
'public class ReadUsingIronPDF
'{
'	public static void Read()
'	{
'		' read from specific location
'		string filename = "C:\code\articles\ITextSharp\ITextSharpIronPdfDemo\Example.pdf";
'		var pdfReader = PdfDocument.FromFile(filename);
'		' Get all text to put in a search index using new simpletextextractionstrategy
'		var allText = pdfReader.ExtractAllText();
'		Console.WriteLine("------------------Text From PDF-----------------");
'		Console.WriteLine(allText);
'		Console.WriteLine("------------------Text From PDF-----------------");
'		var allIMages = pdfReader.ExtractAllImages();
'		Console.WriteLine("------------------Image Count From PDF-----------------");
'		Console.WriteLine(string.Format("Total Images={0}", allIMages.Count()));
'		Console.WriteLine("------------------Image Count From PDF-----------------");
'		Console.WriteLine("------------------one Page Text From PDF page-----------------");
'		var pageCount = pdfReader.PageCount;
'		for (int page = 0; page < pageCount; page++)
'		{
'			string Text = pdfReader.ExtractTextFromPage(page);
'			Console.WriteLine(Text);
'		}
'	}
'}
VB   C#

Code Explanation

  1. To create a text PDF create a Word document and add the above text to the Word document and save it as a PDF document named Example.pdf
  2. In the code, we create a PDFReader from the PDF file path and extract all text
  3. The images in PDF can be extracted using the ExtractImages method
  4. Each page in PDF documents can be read using the ExtractTextFromPage method

Output

iTextSharp Read PDF Alternatives (Developer Tutorial): Figure 9 - Read PDF Using IronPDF Output

Step 3: Read Text from a PDF file using iTextSharp

Now to compare the read text from iTextSharp add the below code to the same program.cs file. For simplicity, we have not separated the classes into different files.

using IronPdf;
using iText.Kernel.Pdf;
using iText.Kernel.Pdf.Canvas.Parser.Listener;
using iText.Kernel.Pdf.Canvas.Parser;
Console.WriteLine("Comparison of IronPDF And Itextsharp Read PDF Files in C#");
//ReadUsingIronPDF.Read();
ReadUsingITextSharp.Read();
public class ReadUsingITextSharp
{
    public static void Read()
    {
        string pdfFile = "C:\\code\\articles\\ITextSharp\\ITextSharpIronPdfDemo\\Example.pdf";
        // Create a PDF reader
        PdfReader pdfReader = new PdfReader(pdfFile);
        iText.Kernel.Pdf.PdfDocument pdfDocument = new iText.Kernel.Pdf.PdfDocument(pdfReader);
        // Extract plain text from the PDF
        LocationTextExtractionStrategy strategy = new LocationTextExtractionStrategy();
        string pdfText = PdfTextExtractor.GetTextFromPage(pdfDocument.GetPage(1), strategy);
        // Display or manipulate the extracted text as needed
        Console.WriteLine(pdfText);
    }
}
using IronPdf;
using iText.Kernel.Pdf;
using iText.Kernel.Pdf.Canvas.Parser.Listener;
using iText.Kernel.Pdf.Canvas.Parser;
Console.WriteLine("Comparison of IronPDF And Itextsharp Read PDF Files in C#");
//ReadUsingIronPDF.Read();
ReadUsingITextSharp.Read();
public class ReadUsingITextSharp
{
    public static void Read()
    {
        string pdfFile = "C:\\code\\articles\\ITextSharp\\ITextSharpIronPdfDemo\\Example.pdf";
        // Create a PDF reader
        PdfReader pdfReader = new PdfReader(pdfFile);
        iText.Kernel.Pdf.PdfDocument pdfDocument = new iText.Kernel.Pdf.PdfDocument(pdfReader);
        // Extract plain text from the PDF
        LocationTextExtractionStrategy strategy = new LocationTextExtractionStrategy();
        string pdfText = PdfTextExtractor.GetTextFromPage(pdfDocument.GetPage(1), strategy);
        // Display or manipulate the extracted text as needed
        Console.WriteLine(pdfText);
    }
}
Imports IronPdf
Imports iText.Kernel.Pdf
Imports iText.Kernel.Pdf.Canvas.Parser.Listener
Imports iText.Kernel.Pdf.Canvas.Parser
Console.WriteLine("Comparison of IronPDF And Itextsharp Read PDF Files in C#")
'ReadUsingIronPDF.Read();
ReadUsingITextSharp.Read()
'INSTANT VB TODO TASK: Local functions are not converted by Instant VB:
'public class ReadUsingITextSharp
'{
'	public static void Read()
'	{
'		string pdfFile = "C:\code\articles\ITextSharp\ITextSharpIronPdfDemo\Example.pdf";
'		' Create a PDF reader
'		PdfReader pdfReader = New PdfReader(pdfFile);
'		iText.Kernel.Pdf.PdfDocument pdfDocument = New iText.Kernel.Pdf.PdfDocument(pdfReader);
'		' Extract plain text from the PDF
'		LocationTextExtractionStrategy strategy = New LocationTextExtractionStrategy();
'		string pdfText = PdfTextExtractor.GetTextFromPage(pdfDocument.GetPage(1), strategy);
'		' Display or manipulate the extracted text as needed
'		Console.WriteLine(pdfText);
'	}
'}
VB   C#

Output

iTextSharp Read PDF Alternatives (Developer Tutorial): Figure 10 - Read PDF using iTextSharp Output

iTextSharp Limitations

  1. Learning Curve: iTextSharp has a steeper learning curve, especially for beginners.
  2. Licensing: iTextSharp's licensing model may not be suitable for all projects, especially those with budget constraints.

IronPDF Benefits

  1. Ease of Use: IronPDF is known for its straightforward API, making it easy for developers to get started.
  2. Document Rendering: IronPDF provides accurate rendering of PDF documents, ensuring that the extracted text is faithful to the original.

Licensing (Free Trial Available)

To use IronPDF. Insert this key into the appsettings.json file.

"IronPdf.LicenseKey": "your license key"

To receive a trial license, please provide your email.

Conclusion

Choosing between IronPDF and iTextSharp depends on the specific requirements of your project. If you need a straightforward and easy-to-use library for common PDF operations, IronPDF might be the better choice. Consider factors like your application's complexity, budget, and the learning curve when making your decision.
IronPDF is designed to seamlessly integrate PDF generation into your application, effortlessly handling the conversion of formatted documents into PDFs. This versatile tool allows you to convert web forms, local HTML pages, and other web content to PDF using .NET. Users can conveniently download, email, or store documents in the cloud. Whether you need to produce invoices, quotes, reports, contracts, or other professional documents, IronPDF has you covered. Elevate your application with IronPDF's intuitive and efficient PDF generation capabilities.