.NET 幫助

Parseint C＃（對開發人員的工作原理）

Name: IronPDF
Brand: Iron Software
Availability: InStock
Rating: 4.87 (307 reviews)

奇佩戈·卡林达

2025年1月14日

在使用 C# 處理資料時，開發人員經常需要將數字的文本表示轉換為整數。這個被稱為「解析整數」的任務，對於各種應用來說都是至關重要的，從處理使用者輸入到從像PDF這樣的文件中提取數據。儘管 C# 提供強大的解析整數的方法，但在處理如 PDF 中的非結構化或半結構化數據時，過程可能變得更加複雜。

這就是IronPDF，一個供 .NET 開發人員使用的強大 PDF 庫，發揮作用的地方。使用 IronPDF，您可以從 PDF 中提取文本，並利用 C# 的解析功能將這些文本轉換為可用的數據。無論您是在分析發票、報告還是表格，結合 C# 的解析工具與 IronPDF 可以簡化 PDF 資料的處理，讓您能夠將字串格式的數字轉換為整數。

在本文中，我們將深入探討如何在 C# 中使用 ParseInt 將數字的字串表示轉換為整數，以及 IronPDF 如何簡化從 PDF 中提取和解析數據的過程。

在 C# 中，什麼是 ParseInt？

解析整數的基礎知識

在 C# 中，將字串值（例如 "123"）轉換為整數通常使用 int.Parse() 或 Convert.ToInt32()。這些方法幫助開發人員將文字數據轉換為可用於計算和驗證的數值。

int.Parse(string s)：將字串轉換為整數。如果字串不是有效的整數，則拋出例外。
Convert.ToInt32(string s)：將字串轉換為整數，並以不同方式處理 null 輸入。
以下是一個使用 int.Parse() 轉換字串的範例：

string numberString = "123";
int num = int.Parse(numberString);
Console.WriteLine(num); // Output: 123

string numberString = "123";
int num = int.Parse(numberString);
Console.WriteLine(num); // Output: 123

Dim numberString As String = "123"
Dim num As Integer = Integer.Parse(numberString)
Console.WriteLine(num) ' Output: 123

$vbLabelText $csharpLabel

或者，使用 Convert 類別：

string numericString = "123";
int i = Convert.ToInt32(numericString);
Console.WriteLine(result); // Outputs: 123

string numericString = "123";
int i = Convert.ToInt32(numericString);
Console.WriteLine(result); // Outputs: 123

Dim numericString As String = "123"
Dim i As Integer = Convert.ToInt32(numericString)
Console.WriteLine(result) ' Outputs: 123

$vbLabelText $csharpLabel

Convert 類別允許您安全地轉換字串和其他資料類型。當字串變數可能表示 null 或無效值時，這特別有用，因為 Convert.ToInt32() 會返回預設值（此例中為 0），而不是拋出例外。

預設值和錯誤處理

開發人員在將字串轉換為整數時經常面臨的一個問題是處理無效或非數字的輸入。如果數字的字串表示形式格式不正確，像 int.Parse() 這樣的方法將會拋出異常。然而，Convert.ToInt32() 具有一個內建的回退機制，用於無效字串。

以下是一個示例，說明在解析時如何處理預設值：

string invalidString = "abc";
int result = Convert.ToInt32(invalidString); // Returns 0 (default value) instead of throwing an error.
Console.WriteLine(result); // Outputs: 0

string invalidString = "abc";
int result = Convert.ToInt32(invalidString); // Returns 0 (default value) instead of throwing an error.
Console.WriteLine(result); // Outputs: 0

Dim invalidString As String = "abc"
Dim result As Integer = Convert.ToInt32(invalidString) ' Returns 0 (default value) instead of throwing an error.
Console.WriteLine(result) ' Outputs: 0

$vbLabelText $csharpLabel

如果您想要更精確地轉換字符串，可以使用int.TryParse()，它返回一個布林值以指示轉換是否成功：

string invalidInput = "abc";
if (int.TryParse(invalidInput, out int result))
{
    Console.WriteLine(result);
}
else
{
    Console.WriteLine("Parsing failed.");
}

string invalidInput = "abc";
if (int.TryParse(invalidInput, out int result))
{
    Console.WriteLine(result);
}
else
{
    Console.WriteLine("Parsing failed.");
}

Dim invalidInput As String = "abc"
Dim result As Integer
If Integer.TryParse(invalidInput, result) Then
	Console.WriteLine(result)
Else
	Console.WriteLine("Parsing failed.")
End If

$vbLabelText $csharpLabel

在這種情況下，TryParse() 使用 out 參數來存儲轉換後的整數，這樣即使轉換失敗，方法也能返回一個值而不拋出異常，若轉換失敗則會執行 else 語句，而不是簡單地使程式崩潰。否則，程序將顯示成功解析的輸入字串中的數字結果。使用 int.TryParse 在轉換失敗可能預期的情況下很有幫助，並且您想避免程式崩潰。

使用 IronPDF 解析 PDF 中的數據

為什麼使用 IronPDF 來解析數據？

Parseint C#（對開發人員的工作原理）：圖 1

在處理PDF文件時，您可能會遇到包含字串形式數據的表格或非結構化文本。要提取和處理這些數據，將字串轉換為整數是至關重要的。 IronPDF 使這個過程變得簡單，提供了靈活性和強大的功能來讀取 PDF 內容，並執行將字串轉換為數值等操作。

以下是IronPDF提供的一些主要功能：

HTML 轉 PDF 轉換：IronPDF 可以將HTML內容（包括 CSS、圖像和 JavaScript）轉換成完整格式的 PDF。這對於將動態網頁或報告渲染為PDF特別有用。
PDF 編輯：使用 IronPDF，您可以通過新增文字、圖片和圖形來操作現有的 PDF 文件，亦可編輯現有頁面的內容。
文字及圖片擷取：此工具庫允許您從PDF中擷取文字及圖片，使解析和分析PDF內容變得簡單。
水印：您也可以為 PDF 文件添加水印以進行品牌推廣或版權保護。

入門 IronPDF

要開始使用IronPDF，您首先需要安裝它。如果已經安裝，則可以跳到下一部分。否則，以下步驟將介紹如何安裝IronPDF庫。

透過 NuGet 套件管理器主控台

若要使用 NuGet 套件管理器主控台安裝 IronPDF，請開啟 Visual Studio 並導航至套件管理器主控台。然後執行以下命令：

Install-Package IronPdf

Install-Package IronPdf

'INSTANT VB TODO TASK: The following line uses invalid syntax:
'Install-Package IronPdf

$vbLabelText $csharpLabel

透過 NuGet 封裝管理器為方案進行操作

打開 Visual Studio，前往「工具 -> NuGet 套件管理員 -> 為方案管理 NuGet 套件」並搜尋 IronPDF。從這裡開始，您只需選擇您的專案並點擊「安裝」，IronPDF 就會被添加到您的專案中。

Parseint C#（對開發者來說是如何運作的）：圖2

安裝 IronPDF 後，您只需在程式碼的頂部新增正確的 using 語句即可開始使用 IronPDF：

using IronPdf;

using IronPdf;

Imports IronPdf

$vbLabelText $csharpLabel

解鎖免費試用

IronPDF 提供免費試用，可完整使用其功能。訪問IronPDF 網站下載試用版，開始將先進的 PDF 處理集成到您的 .NET 專案中。

範例：從 PDF 中提取並解析數字

以下 C# 程式碼演示如何使用 IronPDF 從 PDF 中提取文本，然後使用正則表達式在提取的文本中查找和解析所有數值。該程式碼處理整數和小數，清除貨幣符號等非數字字符。

using IronPdf;
using System.Text.RegularExpressions;
public class Program
{
    public static void Main(string[] args)
    {
        // Load a PDF file
        PdfDocument pdf = PdfDocument.FromFile("example.pdf");
        // Extract all text from the PDF
        string text = pdf.ExtractAllText();
        // Print the extracted text (for reference)
        Console.WriteLine("Extracted Text: ");
        Console.WriteLine(text);
        // Parse and print all numbers found in the extracted text
        Console.WriteLine("\nParsed Numbers:");
        // Use regular expression to find all number patterns, including integers and decimals
        var numberMatches = Regex.Matches(text, @"\d+(\.\d+)?");
        // Iterate through all matched numbers and print them
        foreach (Match match in numberMatches)
        {
            // Print each matched number
            Console.WriteLine($"{match.Value}");
        }
    }
}

using IronPdf;
using System.Text.RegularExpressions;
public class Program
{
    public static void Main(string[] args)
    {
        // Load a PDF file
        PdfDocument pdf = PdfDocument.FromFile("example.pdf");
        // Extract all text from the PDF
        string text = pdf.ExtractAllText();
        // Print the extracted text (for reference)
        Console.WriteLine("Extracted Text: ");
        Console.WriteLine(text);
        // Parse and print all numbers found in the extracted text
        Console.WriteLine("\nParsed Numbers:");
        // Use regular expression to find all number patterns, including integers and decimals
        var numberMatches = Regex.Matches(text, @"\d+(\.\d+)?");
        // Iterate through all matched numbers and print them
        foreach (Match match in numberMatches)
        {
            // Print each matched number
            Console.WriteLine($"{match.Value}");
        }
    }
}

Imports Microsoft.VisualBasic
Imports IronPdf
Imports System.Text.RegularExpressions
Public Class Program
	Public Shared Sub Main(ByVal args() As String)
		' Load a PDF file
		Dim pdf As PdfDocument = PdfDocument.FromFile("example.pdf")
		' Extract all text from the PDF
		Dim text As String = pdf.ExtractAllText()
		' Print the extracted text (for reference)
		Console.WriteLine("Extracted Text: ")
		Console.WriteLine(text)
		' Parse and print all numbers found in the extracted text
		Console.WriteLine(vbLf & "Parsed Numbers:")
		' Use regular expression to find all number patterns, including integers and decimals
		Dim numberMatches = Regex.Matches(text, "\d+(\.\d+)?")
		' Iterate through all matched numbers and print them
		For Each match As Match In numberMatches
			' Print each matched number
			Console.WriteLine($"{match.Value}")
		Next match
	End Sub
End Class

$vbLabelText $csharpLabel

輸入 PDF

Parseint C#（開發人員如何使用）：圖3

控制台輸出

Parseint C# （如何運作給開發者使用）：圖 4

程式碼說明：

從 PDF 提取文本：
該程式碼首先使用IronPDF加載PDF檔案。然後從 PDF 中提取所有文字。
使用正則表達式尋找數字：
此代碼使用正則表達式（用於匹配文本的模式）搜尋提取的文本並查找任何數字。正則表達式同時尋找整數（例如，12345）和小數（例如，50.75）。
解析和打印數字：
一旦找到這些數字，程式會將每個數字打印到控制台。這包括整數和小數。
為什麼使用正則表達式：
正規表示式被使用是因為它們是尋找文本中模式（如數字）的強大工具。它們可以處理帶有符號的數字（如貨幣符號 $），使過程更加靈活。

常見挑戰及IronPDF的解決方案

從複雜的 PDF 結構中提取乾淨數據通常會產生可能需要進一步處理的字串值，例如將字串轉換為整數。以下是一些常見的挑戰，以及IronPDF如何提供幫助：

PDF中的格式錯誤

PDF 文件通常包含格式為文字的數字（例如，"1,234.56" 或 "12,345 USD"）。要正確處理這些內容，需要確保數字的字串表示形式是正確的解析格式。 IronPDF 允許您乾淨地提取文本，並且您可以使用字串操作方法（例如：Replace()）在轉換前調整格式。

範例：

string formattedNumber = "1,234.56"; // String value with commas
string cleanNumber = formattedNumber.Replace(",", ""); // Remove commas
int result = Convert.ToInt32(Convert.ToDouble(cleanNumber)); // Convert to integer
Console.WriteLine(result); // Outputs: 1234

string formattedNumber = "1,234.56"; // String value with commas
string cleanNumber = formattedNumber.Replace(",", ""); // Remove commas
int result = Convert.ToInt32(Convert.ToDouble(cleanNumber)); // Convert to integer
Console.WriteLine(result); // Outputs: 1234

Dim formattedNumber As String = "1,234.56" ' String value with commas
Dim cleanNumber As String = formattedNumber.Replace(",", "") ' Remove commas
Dim result As Integer = Convert.ToInt32(Convert.ToDouble(cleanNumber)) ' Convert to integer
Console.WriteLine(result) ' Outputs: 1234

$vbLabelText $csharpLabel

在文本中處理多個數值

在複雜的 PDF 中，數值可能會以不同的格式出現，或分散在不同的位置。使用 IronPDF，您可以提取所有文本，然後使用正則表達式高效地查找和將字串轉換為整數。

結論

在 C# 中解析整數是開發人員的一項基本技能，特別是在處理用戶輸入或從各種來源提取數據時。雖然內建的方法如 int.Parse() 和 Convert.ToInt32() 很有用，但處理像是 PDF 中找到的文字這樣的非結構化或半結構化資料時，可能會帶來額外的挑戰。這就是 IronPDF 發揮作用的地方，提供了一種強大且簡單的方法，用於從 PDF 中提取文字並在 .NET 應用程式中使用。

通過使用IronPDF，您可以輕鬆地從包括掃描文件在內的複雜PDF中提取文本，並將這些數據轉換為可用的數值。借助掃描 PDF 的 OCR 功能和強大的文本提取工具，IronPDF 使您能夠簡化數據處理，即使是在具有挑戰性的格式中。

無論您正在處理發票、財務報告或其他包含數據的文檔，將 C# 的 ParseInt 方法與 IronPDF 結合使用，將幫助您更高效且準確地工作。

不要讓複雜的PDF文件拖慢您的開發過程—開始使用IronPDF是探索IronPDF如何提升您的工作流程的完美機會，為什麼不試試看它如何簡化您的下一個項目呢？

奇佩戈·卡林达

立即與工程團隊聊天

軟體工程師

Chipego 擁有天生的傾聽技能，這幫助他理解客戶問題，並提供智能解決方案。他在獲得信息技術理學學士學位後，于 2023 年加入 Iron Software 團隊。IronPDF 和 IronOCR 是 Chipego 專注的兩個產品，但隨著他每天找到新的方法來支持客戶，他對所有產品的了解也在不斷增長。他喜歡在 Iron Software 的協作生活，公司內的團隊成員從各自不同的經歷中共同努力，創造出有效的創新解決方案。當 Chipego 離開辦公桌時，他常常享受讀好書或踢足球的樂趣。

< 上一頁
C# 時間跨度格式（開發人員如何使用）

下一個 >
C# MySQL 連線（開發人員的運作方式）