Class PageText
Text content for a single page
Contains text extracted from a single page of a PDF document.
Includes both the raw text and positioned lines for layout reconstruction.
Inheritance
System.Object
PageText
Namespace: IronPdf.Extractions
Assembly: IronPdf.dll
Syntax
public class PageText : Object
Constructors
PageText()
Declaration
public PageText()
Properties
PageBounds
Physical dimensions of the page
Declaration
public < 7 b56d8b3 - e75c - 452 b - af2b - e32243b44b60 > PdfRectangle PageBounds { get; }
Property Value
| Type | Description |
|---|---|
| <7b56d8b3-e75c-452b-af2b-e32243b44b60>PdfRectangle |
PageNumber
Page number (1-based)
Declaration
public int PageNumber { get; }
Property Value
| Type | Description |
|---|---|
| System.Int32 |
PositionedLines
List of positioned text lines
Each item contains the Y and X coordinates of the line and the line text.
Used for layout reconstruction and ordering of text content.
Declaration
public List<(double, double, string)> PositionedLines { get; }
Property Value
| Type | Description |
|---|---|
| System.Collections.Generic.List<System.ValueTuple<System.Double, System.Double, System.String>> |
Text
Text content of the page
Contains the raw text extracted from the page.
Declaration
public string Text { get; }
Property Value
| Type | Description |
|---|---|
| System.String |