Invoice Processing in C#: Generate, Extract, and Automate PDF Invoices with .NET
Przetwarzanie faktur w C# .NET z IronPDF obejmuje pełny cykl życia dokumentu: generowanie profesjonalnych faktur PDF z szablonów HTML, zgodność z normami faktur elektronicznych ZUGFeRD i Factur-X, ekstrakcję danych strukturalnych z otrzymanych faktur z wykorzystaniem parsingu tekstu oraz przetwarzania zasilanego przez AI, oraz tworzenie potoków automatyzacji wsadowej integrujących się z systemami księgowymi, jak QuickBooks, Xero i SAP.
TL;DR: Przewodnik Quickstart
Ten samouczek obejmuje generowanie, ekstrakcję i automatyzację faktur PDF w C# .NET, w tym zgodność z fakturami elektronicznymi, parsowanie wspomagane przez AI oraz integrację systemów księgowych.
- Dla kogo: Dla deweloperów .NET budujących moduły fakturowania, automatyzację rachunków do zapłacenia lub zgodność z fakturami elektronicznymi.
- Co zbudujesz: Generowanie faktur z szablonów HTML z pozycjami i obliczeniami podatkowymi, kody QR dla linków do płatności, zgodne z ZUGFeRD/Factur-X pliki PDF/A-3, ekstrakcję tekstu z regex, parsowanie faktur zasilane przez AI oraz przetwarzanie wsadowe z integracją systemów księgowych.
- Gdzie to działa: .NET 10, .NET 8 LTS, .NET Framework 4.6.2+ i .NET Standard 2.0. Bez zależności od zewnętrznych usług.
- Kiedy stosować to podejście: Kiedy zachodzi potrzeba generowania faktur PDF, spełniania unijnych wymagań dotyczących fakturowania elektronicznego lub wyodrębniania danych z faktur dostawcy dla rachunków do zapłacenia.
- Dlaczego to ważne technicznie: IronPDF renderuje HTML do PDF z dokładnością do pikseli, obsługuje PDF/A-3 dla osadzonego XML oraz dostarcza API do ekstrakcji tekstu, które można łączyć z regex lub AI, aby przekształcać niestrukturalne faktury w dane strukturalne.
Wygeneruj pierwszą fakturę PDF przy użyciu zaledwie kilku linii kodu:
-
Install IronPDF with NuGet Package Manager
PM > Install-Package IronPdf -
Skopiuj i uruchom ten fragment kodu.
var renderer = new IronPdf.ChromePdfRenderer(); var pdf = renderer.RenderHtmlAsPdf("<h1>Invoice #1001</h1><p>Total: $500.00</p>"); pdf.SaveAs("invoice.pdf"); -
Wdrożenie do testowania w środowisku produkcyjnym
Rozpocznij używanie IronPDF w swoim projekcie już dziś z darmową wersją próbną
Po zakupie lub zapisaniu się na 30-dniowy okres próbny IronPDF, dodaj swój klucz licencyjny na początku aplikacji.
IronPdf.License.LicenseKey = "KEY";
IronPdf.License.LicenseKey = "KEY";
Imports IronPdf
IronPdf.License.LicenseKey = "KEY"
Rozpocznij używanie IronPDF w swoim projekcie już dziś dzięki darmowej wersji próbnej.
Spis treści
- TL;DR: Przewodnik Szybkiego Startu
- Generowanie Profesjonalnych Faktur PDF
- Spełniaj standardy faktur elektronicznych
- Wyodrębnij dane z faktur PDF
- Przetwarzanie faktur zasilane przez AI
- Integracja z systemami księgowymi
Czym jest cykl życia faktury i dlaczego PDF pozostaje standardem?
Przed zagłębianiem się w kod, warto zrozumieć pełną podróż, jaką faktura odbywa w nowoczesnym systemie biznesowym. Cykl życia faktury składa się z pięciu odrębnych faz: generowanie, dystrybucja, odbiór, ekstrakcja danych i integracja księgowa.
Proces fakturowania zaczyna się od generowania. Firma tworzy fakturę zawierającą pozycje, ceny, obliczenia podatków, warunki płatności i elementy brandingowe. Faktura musi wyglądać profesjonalnie i spełniać wszystkie wymagania prawne. Następnie następuje dystrybucja, gdzie faktura jest wysyłana do klienta e-mailem, poprzez portal klienta lub tradycyjną pocztą. Gdy klient ja otrzymuje, zespół zajmujący się rozrachunkami do zapłacenia rejestruje dokument i przygotowuje go do przetwarzania. Ekstrakcja danych wyciąga kluczowe informacje z faktury, takie jak dane dostawcy, pozycje, sumy i terminy płatności, aby można było je porównać i dopasować do zamówień zakupu. Na koniec, integracja księgowa przenosi te dane do systemów finansowych, takich jak QuickBooks, Xero czy SAP, w celu dokonania płatności i archiwizacji.
Dlaczego nadal PDF jest najszerzej używanym formatem po tylu latach? Chodzi o unikalne połączenie zalet. PDF-y utrzymują spójność formatowania twojej faktury, niezależnie od urządzenia czy systemu operacyjnego, z którego korzystasz. Czy ktoś otworzy twoją fakturę na Windowsie, Macu, czy na telefonie, wygląda ona dokładnie tak, jak ją zaprojektowałeś. PDF-y są również trudniejsze do przypadkowych zmian, co lepiej chroni integralność twoich dokumentów w porównaniu z formatami takimi jak Word lub Excel. Możesz dodać podpisy cyfrowe dla autentyczności i użyć szyfrowania dla bezpieczeństwa. Najważniejsze, PDF-y stały się uniwersalnym standardem, który każdy system biznesowy uznaje i obsługuje.
Oczywiście jest pewne wyzwanie. PDF-y są zaprojektowane tak, aby były łatwe do czytania przez ludzi, a nie do przetwarzania przez komputery. Zamiast przechowywać informacje w strukturze danych, PDF zapisuje teksty, linie, kształty i obrazy na podstawie ich położenia na stronie. Dlatego narzędzia takie jak IronPDF są tak pomocne: umożliwiają przekształcenie dokumentów przyjaznych dla ludzi w dane, z którymi oprogramowanie może pracować.
Jak generować profesjonalne faktury PDF w C
Generowanie faktur programowo wymaga przekształcenia danych strukturalnych, takich jak informacje o kliencie, pozycje i obliczenia, w elegancki dokument PDF. IronPDF czyni to niezwykle prostym, wykorzystując HTML i CSS, technologie, które większość programistów już dobrze zna.
W tym samouczku przejdziemy przez scenariusze, które prawdopodobnie napotkasz w rzeczywistości. Można również pobrać projekt pokazany poniżej tutaj.
Jak zbudować szablon HTML faktury
Fundamentem generowania faktur z IronPDF jest HTML. Zamiast zmagać się z niskopoziomowymi poleceniami rysowania PDF, projektujesz swoja fakture uzywajac standardowego HTML i CSS, a potem pozwalasz Chrome-owi IronPDF przekształcić ją w perfekcyjny PDF.
Oto podstawowy szablon faktury demonstrujący to podejście:
:path=/static-assets/pdf/content-code-examples/tutorials/csharp-invoice-processing/basic-invoice-template.cs
using IronPdf;
// Define the HTML template for a basic invoice
// Uses inline CSS for styling headers, tables, and totals
string invoiceHtml = @"
E html>
le>
body { font-family: Arial, sans-serif; padding: 40px; }
.header { text-align: right; margin-bottom: 40px; }
.company-name { font-size: 24px; font-weight: bold; color: #333; }
.invoice-title { font-size: 32px; margin: 20px 0; }
.bill-to { margin: 20px 0; }
table { width: 100%; border-collapse: collapse; margin: 20px 0; }
th { background-color: #2A95D5; color: white; padding: 10px; text-align: left; }
td { padding: 10px; border-bottom: 1px solid #ddd; }
.total { text-align: right; font-size: 20px; font-weight: bold; margin-top: 20px; }
yle>
class='header'>
<div class='company-name'>Your Company Name</div>
<div>123 Business Street</div>
<div>City, State 12345</div>
v>
class='invoice-title'>INVOICE</div>
class='bill-to'>
<strong>Bill To:</strong><br>
Customer Name<br>
456 Customer Avenue<br>
City, State 67890
v>
le>
<tr>
<th>Description</th>
<th>Quantity</th>
<th>Price</th>
<th>Total</th>
</tr>
<tr>
<td>Web Development Services</td>
<td>10 hours</td>
<td>$100.00</td>
<td>$1,000.00</td>
</tr>
<tr>
<td>Consulting</td>
<td>5 hours</td>
<td>$150.00</td>
<td>$750.00</td>
</tr>
ble>
class='total'>Total: $1,750.00</div>
;
// Initialize the Chrome-based PDF renderer
var renderer = new ChromePdfRenderer();
// Convert the HTML string to a PDF document
var pdf = renderer.RenderHtmlAsPdf(invoiceHtml);
// Save the generated PDF to disk
pdf.SaveAs("basic-invoice.pdf");
Imports IronPdf
' Define the HTML template for a basic invoice
' Uses inline CSS for styling headers, tables, and totals
Dim invoiceHtml As String = "
<!DOCTYPE html>
<html>
<head>
<style>
body { font-family: Arial, sans-serif; padding: 40px; }
.header { text-align: right; margin-bottom: 40px; }
.company-name { font-size: 24px; font-weight: bold; color: #333; }
.invoice-title { font-size: 32px; margin: 20px 0; }
.bill-to { margin: 20px 0; }
table { width: 100%; border-collapse: collapse; margin: 20px 0; }
th { background-color: #2A95D5; color: white; padding: 10px; text-align: left; }
td { padding: 10px; border-bottom: 1px solid #ddd; }
.total { text-align: right; font-size: 20px; font-weight: bold; margin-top: 20px; }
</style>
</head>
<body>
<div class='header'>
<div class='company-name'>Your Company Name</div>
<div>123 Business Street</div>
<div>City, State 12345</div>
</div>
<div class='invoice-title'>INVOICE</div>
<div class='bill-to'>
<strong>Bill To:</strong><br>
Customer Name<br>
456 Customer Avenue<br>
City, State 67890
</div>
<table>
<tr>
<th>Description</th>
<th>Quantity</th>
<th>Price</th>
<th>Total</th>
</tr>
<tr>
<td>Web Development Services</td>
<td>10 hours</td>
<td>$100.00</td>
<td>$1,000.00</td>
</tr>
<tr>
<td>Consulting</td>
<td>5 hours</td>
<td>$150.00</td>
<td>$750.00</td>
</tr>
</table>
<div class='total'>Total: $1,750.00</div>
</body>
</html>
"
' Initialize the Chrome-based PDF renderer
Dim renderer As New ChromePdfRenderer()
' Convert the HTML string to a PDF document
Dim pdf = renderer.RenderHtmlAsPdf(invoiceHtml)
' Save the generated PDF to disk
pdf.SaveAs("basic-invoice.pdf")
Przykładowy Wynik
To podejście daje ogromną elastyczność. Każdy CSS, który działa w Chrome, będzie działać w wynikowym pliku PDF, w tym nowoczesne funkcje jak flexbox, układy siatki i czcionki niestandardowe. Można nawet używać zewnętrznych arkuszy stylów i obrazów, podając URL-e lub lokalne ścieżki plików.
Jak dodawać dynamiczne pozycje i obliczenia sumaryczne
Rzeczywiste faktury rzadko mają statyczną treść. Musisz uzupełnić pozycje z bazy danych, obliczyć sumy częściowe, zastosować stawki podatku i formatować wartości walutowe. Następujący przykład demonstruje gotowy do produkcji wzorzec dla dynamicznego generowania faktur:
using IronPdf;
using System;
using System.Collections.Generic;
using System.Linq;
// Represents a single line item on an invoice
public class InvoiceLineItem
{
public string Description { get; set; }
public decimal Quantity { get; set; }
public decimal UnitPrice { get; set; }
// Auto-calculates line total from quantity and unit price
public decimal Total => Quantity * UnitPrice;
}
// Represents a complete invoice with customer details and line items
public class Invoice
{
public string InvoiceNumber { get; set; }
public DateTime InvoiceDate { get; set; }
public string CustomerName { get; set; }
public string CustomerAddress { get; set; }
public List<InvoiceLineItem> LineItems { get; set; }
// Computed properties for invoice totals
public decimal Subtotal => LineItems.Sum(item => item.Total);
public decimal TaxRate { get; set; } = 0.08m; // Default 8% tax rate
public decimal Tax => Subtotal * TaxRate;
public decimal Total => Subtotal + Tax;
}
// Generates PDF invoices from Invoice objects using HTML templates
public class InvoiceGenerator
{
public PdfDocument GenerateInvoice(Invoice invoice)
{
// Build HTML table rows dynamically from line items
string lineItemsHtml = string.Join("", invoice.LineItems.Select(item => $@"
<tr>
<td>{item.Description}</td>
<td>{item.Quantity}</td>
<td>${item.UnitPrice:F2}</td>
<td>${item.Total:F2}</td>
</tr>
"));
// Build the complete HTML invoice using string interpolation
// All invoice data is injected into the template dynamically
string invoiceHtml = $@"
<!DOCTYPE html>
<html>
<head>
<style>
body {{ font-family: Arial, sans-serif; padding: 40px; }}
.header {{ text-align: right; margin-bottom: 40px; }}
.company-name {{ font-size: 24px; font-weight: bold; color: #333; }}
.invoice-details {{ margin: 20px 0; }}
table {{ width: 100%; border-collapse: collapse; margin: 20px 0; }}
th {{ background-color: #2A95D5; color: white; padding: 10px; text-align: left; }}
td {{ padding: 10px; border-bottom: 1px solid #ddd; }}
.totals {{ text-align: right; margin-top: 20px; }}
.totals div {{ margin: 5px 0; }}
.grand-total {{ font-size: 20px; font-weight: bold; color: #2A95D5; }}
</style>
</head>
<body>
<div class='header'>
<div class='company-name'>Your Company Name</div>
</div>
<h1>INVOICE</h1>
<div class='invoice-details'>
<strong>Invoice Number:</strong> {invoice.InvoiceNumber}<br>
<strong>Date:</strong> {invoice.InvoiceDate:MMM dd, yyyy}<br>
<strong>Bill To:</strong> {invoice.CustomerName}<br>
{invoice.CustomerAddress}
</div>
<table>
<tr>
<th>Description</th>
<th>Quantity</th>
<th>Unit Price</th>
<th>Total</th>
</tr>
{lineItemsHtml}
</table>
<div class='totals'>
<div>Subtotal: ${invoice.Subtotal:F2}</div>
<div>Tax ({invoice.TaxRate:P0}): ${invoice.Tax:F2}</div>
<div class='grand-total'>Total: ${invoice.Total:F2}</div>
</div>
</body>
</html>";
// Render HTML to PDF and return the document
var renderer = new ChromePdfRenderer();
return renderer.RenderHtmlAsPdf(invoiceHtml);
}
}
using IronPdf;
using System;
using System.Collections.Generic;
using System.Linq;
// Represents a single line item on an invoice
public class InvoiceLineItem
{
public string Description { get; set; }
public decimal Quantity { get; set; }
public decimal UnitPrice { get; set; }
// Auto-calculates line total from quantity and unit price
public decimal Total => Quantity * UnitPrice;
}
// Represents a complete invoice with customer details and line items
public class Invoice
{
public string InvoiceNumber { get; set; }
public DateTime InvoiceDate { get; set; }
public string CustomerName { get; set; }
public string CustomerAddress { get; set; }
public List<InvoiceLineItem> LineItems { get; set; }
// Computed properties for invoice totals
public decimal Subtotal => LineItems.Sum(item => item.Total);
public decimal TaxRate { get; set; } = 0.08m; // Default 8% tax rate
public decimal Tax => Subtotal * TaxRate;
public decimal Total => Subtotal + Tax;
}
// Generates PDF invoices from Invoice objects using HTML templates
public class InvoiceGenerator
{
public PdfDocument GenerateInvoice(Invoice invoice)
{
// Build HTML table rows dynamically from line items
string lineItemsHtml = string.Join("", invoice.LineItems.Select(item => $@"
<tr>
<td>{item.Description}</td>
<td>{item.Quantity}</td>
<td>${item.UnitPrice:F2}</td>
<td>${item.Total:F2}</td>
</tr>
"));
// Build the complete HTML invoice using string interpolation
// All invoice data is injected into the template dynamically
string invoiceHtml = $@"
<!DOCTYPE html>
<html>
<head>
<style>
body {{ font-family: Arial, sans-serif; padding: 40px; }}
.header {{ text-align: right; margin-bottom: 40px; }}
.company-name {{ font-size: 24px; font-weight: bold; color: #333; }}
.invoice-details {{ margin: 20px 0; }}
table {{ width: 100%; border-collapse: collapse; margin: 20px 0; }}
th {{ background-color: #2A95D5; color: white; padding: 10px; text-align: left; }}
td {{ padding: 10px; border-bottom: 1px solid #ddd; }}
.totals {{ text-align: right; margin-top: 20px; }}
.totals div {{ margin: 5px 0; }}
.grand-total {{ font-size: 20px; font-weight: bold; color: #2A95D5; }}
</style>
</head>
<body>
<div class='header'>
<div class='company-name'>Your Company Name</div>
</div>
<h1>INVOICE</h1>
<div class='invoice-details'>
<strong>Invoice Number:</strong> {invoice.InvoiceNumber}<br>
<strong>Date:</strong> {invoice.InvoiceDate:MMM dd, yyyy}<br>
<strong>Bill To:</strong> {invoice.CustomerName}<br>
{invoice.CustomerAddress}
</div>
<table>
<tr>
<th>Description</th>
<th>Quantity</th>
<th>Unit Price</th>
<th>Total</th>
</tr>
{lineItemsHtml}
</table>
<div class='totals'>
<div>Subtotal: ${invoice.Subtotal:F2}</div>
<div>Tax ({invoice.TaxRate:P0}): ${invoice.Tax:F2}</div>
<div class='grand-total'>Total: ${invoice.Total:F2}</div>
</div>
</body>
</html>";
// Render HTML to PDF and return the document
var renderer = new ChromePdfRenderer();
return renderer.RenderHtmlAsPdf(invoiceHtml);
}
}
Imports IronPdf
Imports System
Imports System.Collections.Generic
Imports System.Linq
' Represents a single line item on an invoice
Public Class InvoiceLineItem
Public Property Description As String
Public Property Quantity As Decimal
Public Property UnitPrice As Decimal
' Auto-calculates line total from quantity and unit price
Public ReadOnly Property Total As Decimal
Get
Return Quantity * UnitPrice
End Get
End Property
End Class
' Represents a complete invoice with customer details and line items
Public Class Invoice
Public Property InvoiceNumber As String
Public Property InvoiceDate As DateTime
Public Property CustomerName As String
Public Property CustomerAddress As String
Public Property LineItems As List(Of InvoiceLineItem)
' Computed properties for invoice totals
Public ReadOnly Property Subtotal As Decimal
Get
Return LineItems.Sum(Function(item) item.Total)
End Get
End Property
Public Property TaxRate As Decimal = 0.08D ' Default 8% tax rate
Public ReadOnly Property Tax As Decimal
Get
Return Subtotal * TaxRate
End Get
End Property
Public ReadOnly Property Total As Decimal
Get
Return Subtotal + Tax
End Get
End Property
End Class
' Generates PDF invoices from Invoice objects using HTML templates
Public Class InvoiceGenerator
Public Function GenerateInvoice(invoice As Invoice) As PdfDocument
' Build HTML table rows dynamically from line items
Dim lineItemsHtml As String = String.Join("", invoice.LineItems.Select(Function(item) $"
<tr>
<td>{item.Description}</td>
<td>{item.Quantity}</td>
<td>${item.UnitPrice:F2}</td>
<td>${item.Total:F2}</td>
</tr>
"))
' Build the complete HTML invoice using string interpolation
' All invoice data is injected into the template dynamically
Dim invoiceHtml As String = $"
<!DOCTYPE html>
<html>
<head>
<style>
body {{ font-family: Arial, sans-serif; padding: 40px; }}
.header {{ text-align: right; margin-bottom: 40px; }}
.company-name {{ font-size: 24px; font-weight: bold; color: #333; }}
.invoice-details {{ margin: 20px 0; }}
table {{ width: 100%; border-collapse: collapse; margin: 20px 0; }}
th {{ background-color: #2A95D5; color: white; padding: 10px; text-align: left; }}
td {{ padding: 10px; border-bottom: 1px solid #ddd; }}
.totals {{ text-align: right; margin-top: 20px; }}
.totals div {{ margin: 5px 0; }}
.grand-total {{ font-size: 20px; font-weight: bold; color: #2A95D5; }}
</style>
</head>
<body>
<div class='header'>
<div class='company-name'>Your Company Name</div>
</div>
<h1>INVOICE</h1>
<div class='invoice-details'>
<strong>Invoice Number:</strong> {invoice.InvoiceNumber}<br>
<strong>Date:</strong> {invoice.InvoiceDate:MMM dd, yyyy}<br>
<strong>Bill To:</strong> {invoice.CustomerName}<br>
{invoice.CustomerAddress}
</div>
<table>
<tr>
<th>Description</th>
<th>Quantity</th>
<th>Unit Price</th>
<th>Total</th>
</tr>
{lineItemsHtml}
</table>
<div class='totals'>
<div>Subtotal: ${invoice.Subtotal:F2}</div>
<div>Tax ({invoice.TaxRate:P0}): ${invoice.Tax:F2}</div>
<div class='grand-total'>Total: ${invoice.Total:F2}</div>
</div>
</body>
</html>"
' Render HTML to PDF and return the document
Dim renderer As New ChromePdfRenderer()
Return renderer.RenderHtmlAsPdf(invoiceHtml)
End Function
End Class
Przykładowy Wynik
Klasa Invoice kapsułkuje wszystkie dane faktury z obliczanymi właściwościami dla sumy częściowej, podatku i sumy całkowitej. Generator przekształca te dane w HTML wykorzystując interpolację tekstu, a potem renderuje je do PDF. To rozdzielenie obowiązków sprawia, że kod jest utrzymywalny i testowalny.
Jak dodac elementy brandingowe firmy i znaki wodne do faktur
Profesjonalne faktury potrzebują elementów brandingowych, takich jak logo i, w niektórych przypadkach, znaków wodnych, aby wskazać stan płatności. IronPDF obsługuje zarówno osadzone obrazy w HTML, jak i programowe znakowanie wodne po renderowaniu.
:path=/static-assets/pdf/content-code-examples/tutorials/csharp-invoice-processing/branding-watermarks.cs
using IronPdf;
using IronPdf;
var renderer = new ChromePdfRenderer();
// Invoice HTML template with company logo embedded via URL
// Logo can also be Base64-encoded or a local file path
string htmlWithLogo = @"
E html>
le>
body { font-family: Arial, sans-serif; padding: 40px; }
.logo { width: 200px; margin-bottom: 20px; }
yle>
style='text-align: center;'>
<img src='https://yourcompany.com/logo.png' alt='Company Logo' class='logo' />
v>
INVOICE</h1>
strong>Invoice Number:</strong> INV-2024-001</p>
strong>Total:</strong> $1,250.00</p>
;
// Render the HTML to PDF
var pdf = renderer.RenderHtmlAsPdf(htmlWithLogo);
// Apply a diagonal "UNPAID" watermark to mark invoice status
// 30% opacity keeps the content readable while the watermark is visible
pdf.ApplyWatermark("<h1 style='color: red;'>UNPAID</h1>",
opacity: 30,
rotation: 45,
verticalAlignment: IronPdf.Editing.VerticalAlignment.Middle);
pdf.SaveAs("invoice-with-watermark.pdf");
using IronPdf;
Imports IronPdf
Dim renderer As New ChromePdfRenderer()
' Invoice HTML template with company logo embedded via URL
' Logo can also be Base64-encoded or a local file path
Dim htmlWithLogo As String = "
<!DOCTYPE html>
<html>
<head>
<style>
body { font-family: Arial, sans-serif; padding: 40px; }
.logo { width: 200px; margin-bottom: 20px; }
</style>
</head>
<body>
<div style='text-align: center;'>
<img src='https://yourcompany.com/logo.png' alt='Company Logo' class='logo' />
<h1>INVOICE</h1>
<p><strong>Invoice Number:</strong> INV-2024-001</p>
<p><strong>Total:</strong> $1,250.00</p>
</div>
</body>
</html>
"
' Render the HTML to PDF
Dim pdf = renderer.RenderHtmlAsPdf(htmlWithLogo)
' Apply a diagonal "UNPAID" watermark to mark invoice status
' 30% opacity keeps the content readable while the watermark is visible
pdf.ApplyWatermark("<h1 style='color: red;'>UNPAID</h1>",
opacity:=30,
rotation:=45,
verticalAlignment:=IronPdf.Editing.VerticalAlignment.Middle)
pdf.SaveAs("invoice-with-watermark.pdf")
Przykładowy Wynik
Metoda ApplyWatermark akceptuje zawartość HTML, dając ci pełną kontrolę nad wyglądem znaku wodnego. Możesz dostosować przezroczystość, obrót i pozycjonowanie, aby osiągnąć dokładnie taki wygląd, jaki potrzebujesz. Jest to szczególnie przydatne do oznaczania faktur jako "OPLACONE", "PROJEKT" lub "ANULOWANE" bez przetwarzania całego dokumentu na nowo.
Jak osadzic kody QR w linkach płatnosci
Nowoczesne faktury często zawierają kody QR, które klienci mogą skanować, aby szybko dokonywać płatności. Chociaż IronPDF koncentruje się na generowaniu PDF-ów, współpracuje bezproblemowo z IronQR do tworzenia kodów kreskowych:
:path=/static-assets/pdf/content-code-examples/tutorials/csharp-invoice-processing/qr-code-payment.cs
using IronPdf;
using IronQr;
using IronSoftware.Drawing;
string invoiceNumber = "INV-2026-002";
decimal amount = 1500.00m;
// Create a payment URL with invoice details as query parameters
string paymentUrl = $"https://yourcompany.com/pay?invoice={invoiceNumber}&amount={amount}";
// Generate QR code from the payment URL using IronQR
QrCode qrCode = QrWriter.Write(paymentUrl);
AnyBitmap qrImage = qrCode.Save();
qrImage.SaveAs("payment-qr.png", AnyBitmap.ImageFormat.Png);
// Build invoice HTML with the QR code image embedded
// Customers can scan the QR to pay directly from their phone
string invoiceHtml = $@"
E html>
le>
body {{ font-family: Arial, sans-serif; padding: 40px; }}
.payment-section {{ margin-top: 40px; text-align: center;
border-top: 2px solid #eee; padding-top: 20px; }}
.qr-code {{ width: 150px; height: 150px; }}
yle>
INVOICE {invoiceNumber}</h1>
strong>Amount Due:</strong> ${amount:F2}</p>
class='payment-section'>
<p><strong>Scan to Pay Instantly:</strong></p>
<img src='payment-qr.png' alt='Payment QR Code' class='qr-code' />
<p style='font-size: 12px; color: #666;'>
Or visit: {paymentUrl}
</p>
v>
;
// Convert HTML to PDF and save
var renderer = new ChromePdfRenderer();
var pdf = renderer.RenderHtmlAsPdf(invoiceHtml);
pdf.SaveAs($"invoice-{invoiceNumber}.pdf");
Imports IronPdf
Imports IronQr
Imports IronSoftware.Drawing
Dim invoiceNumber As String = "INV-2026-002"
Dim amount As Decimal = 1500.00D
' Create a payment URL with invoice details as query parameters
Dim paymentUrl As String = $"https://yourcompany.com/pay?invoice={invoiceNumber}&amount={amount}"
' Generate QR code from the payment URL using IronQR
Dim qrCode As QrCode = QrWriter.Write(paymentUrl)
Dim qrImage As AnyBitmap = qrCode.Save()
qrImage.SaveAs("payment-qr.png", AnyBitmap.ImageFormat.Png)
' Build invoice HTML with the QR code image embedded
' Customers can scan the QR to pay directly from their phone
Dim invoiceHtml As String = $"
<!DOCTYPE html>
<html>
<head>
<title>Invoice</title>
<style>
body {{ font-family: Arial, sans-serif; padding: 40px; }}
.payment-section {{ margin-top: 40px; text-align: center; border-top: 2px solid #eee; padding-top: 20px; }}
.qr-code {{ width: 150px; height: 150px; }}
</style>
</head>
<body>
<h1>INVOICE {invoiceNumber}</h1>
<p><strong>Amount Due:</strong> ${amount:F2}</p>
<div class='payment-section'>
<p><strong>Scan to Pay Instantly:</strong></p>
<img src='payment-qr.png' alt='Payment QR Code' class='qr-code' />
<p style='font-size: 12px; color: #666;'>
Or visit: {paymentUrl}
</p>
</div>
</body>
</html>"
' Convert HTML to PDF and save
Dim renderer As New ChromePdfRenderer()
Dim pdf = renderer.RenderHtmlAsPdf(invoiceHtml)
pdf.SaveAs($"invoice-{invoiceNumber}.pdf")
Przykładowy Wynik
Kod QR linkuje bezpośrednio do strony płatności, redukując opory klientów i przyspieszając twój przepływ gotówki. Ten wzorzec działa z każdym dostawcą płatności, który obsługuje inicjowanie płatności oparte na URL.
Jak zachować zgodność ze standardami fakturowania elektronicznego ZUGFeRD i Factur-X w C
Fakturowanie elektroniczne szybko staje się obowiązkowe w całej Europie. Niemcy prowadzily z ZUGFeRD, a Francja poszła ich śladem z Factur-X. Te standardy osadzają maszynowo-czytelne dane XML w fakturach PDF, umożliwiając automatyczne przetwarzanie jednocześnie z utrzymywaniem dokumentow czytelnych dla ludzi. Rozumienie i wdrażanie tych standardów jest coraz bardziej niezbędne dla firm działających na rynkach europejskich.
Czym jest ZUGFeRD i jak działa?
ZUGFeRD (Zentraler User Guide des Forums elektronische Rechnung Deutschland) to niemiecki standard e-fakturowania, który osadza dane faktury jako załącznik pliku XML w dokumencie zgodnym z PDF/A-3. Osadzony XML umożliwia automatyczny drenaż danych bez potrzeby OCR lub parsowania.
Standard definiuje trzy poziomy zgodności, z każdym oferującym coraz bardziej drobiazgowe dane:
- Basic: Zawiera podstawowe dane faktury odpowiednie do prostego zautomatyzowanego przetwarzania
- Comfort: Dodaje szczegółowe informacje umożliwiające w pełni zautomatyzowane przetwarzanie faktur
- Extended: Obejmuje kompleksowe dane dla złożonych scenariuszy biznesowych w różnych branżach
XML śledzi schemat UN/CEFACT Cross-Industry Invoice (CII), który stał się podstawą do standaryzacji europejskiego fakturowania elektronicznego.
Czym jest Factur-X i jak rózni sie od ZUGFeRD?
Factur-X to francuskie wdrożenie tego samego podłożowego standardu. ZUGFeRD 2.0 i Factur-X są technicznie identyczne. Dzielą ten sam schemat XML i profile zgodności bazujące na Europejskiej Normie EN 16931. Różnica sprowadza się do regionalnej nazwy: faktura utworzona według specyfikacji ZUGFeRD będzie ważna w ramach Factur-X, i odwrotnie.
Jak osadzić dane XML w fakturach PDF/A-3
IronPDF dostarcza funkcje załączania potrzebne do tworzenia zgodnych e-faktur. Procedura obejmuje wygenerowanie faktury PDF, stworzenie danych XML według schematu CII oraz osadzenie XML jako załącznika z odpowiednimi konwencjami nazewnictwa:
using System;
using System.Xml.Linq;
// Generates ZUGFeRD-compliant invoices by embedding structured XML data
// ZUGFeRD allows automated processing while keeping a human-readable PDF
public class ZUGFeRDInvoiceGenerator
{
public void GenerateZUGFeRDInvoice(Invoice invoice)
{
// First, create the visual PDF that humans will read
var renderer = new ChromePdfRenderer();
string invoiceHtml = BuildInvoiceHtml(invoice);
var pdf = renderer.RenderHtmlAsPdf(invoiceHtml);
// Define the UN/CEFACT namespaces required by the ZUGFeRD standard
// These are mandatory for compliance with European e-invoicing regulations
XNamespace rsm = "urn:un:unece:uncefact:data:standard:CrossIndustryInvoice:100";
XNamespace ram = "urn:un:unece:uncefact:data:standard:ReusableAggregateBusinessInformationEntity:100";
XNamespace udt = "urn:un:unece:uncefact:data:standard:UnqualifiedDataType:100";
// Build the ZUGFeRD XML structure following the Cross-Industry Invoice schema
var zugferdXml = new XDocument(
new XDeclaration("1.0", "UTF-8", null),
new XElement(rsm + "CrossIndustryInvoice",
new XAttribute(XNamespace.Xmlns + "rsm", rsm.NamespaceName),
new XAttribute(XNamespace.Xmlns + "ram", ram.NamespaceName),
new XAttribute(XNamespace.Xmlns + "udt", udt.NamespaceName),
// Document context identifies which e-invoicing guideline is being followed
new XElement(rsm + "ExchangedDocumentContext",
new XElement(ram + "GuidelineSpecifiedDocumentContextParameter",
new XElement(ram + "ID", "urn:cen.eu:en16931:2017")
)
),
// Core document identification: invoice number, type, and date
new XElement(rsm + "ExchangedDocument",
new XElement(ram + "ID", invoice.InvoiceNumber),
new XElement(ram + "TypeCode", "380"), // 380 = Commercial Invoice per UN/CEFACT
new XElement(ram + "IssueDateTime",
new XElement(udt + "DateTimeString",
new XAttribute("format", "102"),
invoice.InvoiceDate.ToString("yyyyMMdd")
)
)
),
// A complete implementation would include additional sections:
// - Seller information (ram:SellerTradeParty)
// - Buyer information (ram:BuyerTradeParty)
// - Line items (ram:IncludedSupplyChainTradeLineItem)
// - Payment terms (ram:SpecifiedTradePaymentTerms)
// - Tax summaries (ram:ApplicableTradeTax)
// Financial summary with all monetary totals
new XElement(rsm + "SupplyChainTradeTransaction",
new XElement(ram + "ApplicableHeaderTradeSettlement",
new XElement(ram + "InvoiceCurrencyCode", "EUR"),
new XElement(ram + "SpecifiedTradeSettlementHeaderMonetarySummation",
new XElement(ram + "TaxBasisTotalAmount", invoice.Subtotal),
new XElement(ram + "TaxTotalAmount",
new XAttribute("currencyID", "EUR"),
invoice.Tax),
new XElement(ram + "GrandTotalAmount", invoice.Total),
new XElement(ram + "DuePayableAmount", invoice.Total)
)
)
)
)
);
// Save the XML to a temp file for embedding
string xmlPath = $"zugferd-{invoice.InvoiceNumber}.xml";
zugferdXml.Save(xmlPath);
// Attach the XML to the PDF - filename must follow ZUGFeRD conventions
pdf.Attachments.AddFile(xmlPath, "zugferd-invoice.xml", "ZUGFeRD Invoice Data");
// Final PDF contains both visual invoice and machine-readable XML
pdf.SaveAs($"invoice-{invoice.InvoiceNumber}-zugferd.pdf");
}
// Generates simple HTML for the visual portion of the invoice
private string BuildInvoiceHtml(Invoice invoice)
{
return $@"
<!DOCTYPE html>
<html>
<head>
<style>
body {{ font-family: Arial, sans-serif; padding: 40px; }}
h1 {{ color: #333; }}
.zugferd-notice {{
margin-top: 30px; padding: 10px;
background: #f0f0f0; font-size: 11px;
}}
</style>
</head>
<body>
<h1>RECHNUNG / INVOICE</h1>
<p><strong>Rechnungsnummer:</strong> {invoice.InvoiceNumber}</p>
<p><strong>Datum:</strong> {invoice.InvoiceDate:dd.MM.yyyy}</p>
<p><strong>Betrag:</strong> €{invoice.Total:F2}</p>
<div class='zugferd-notice'>
This invoice contains embedded ZUGFeRD data for automated processing.
</div>
</body>
</html>";
}
}
using System;
using System.Xml.Linq;
// Generates ZUGFeRD-compliant invoices by embedding structured XML data
// ZUGFeRD allows automated processing while keeping a human-readable PDF
public class ZUGFeRDInvoiceGenerator
{
public void GenerateZUGFeRDInvoice(Invoice invoice)
{
// First, create the visual PDF that humans will read
var renderer = new ChromePdfRenderer();
string invoiceHtml = BuildInvoiceHtml(invoice);
var pdf = renderer.RenderHtmlAsPdf(invoiceHtml);
// Define the UN/CEFACT namespaces required by the ZUGFeRD standard
// These are mandatory for compliance with European e-invoicing regulations
XNamespace rsm = "urn:un:unece:uncefact:data:standard:CrossIndustryInvoice:100";
XNamespace ram = "urn:un:unece:uncefact:data:standard:ReusableAggregateBusinessInformationEntity:100";
XNamespace udt = "urn:un:unece:uncefact:data:standard:UnqualifiedDataType:100";
// Build the ZUGFeRD XML structure following the Cross-Industry Invoice schema
var zugferdXml = new XDocument(
new XDeclaration("1.0", "UTF-8", null),
new XElement(rsm + "CrossIndustryInvoice",
new XAttribute(XNamespace.Xmlns + "rsm", rsm.NamespaceName),
new XAttribute(XNamespace.Xmlns + "ram", ram.NamespaceName),
new XAttribute(XNamespace.Xmlns + "udt", udt.NamespaceName),
// Document context identifies which e-invoicing guideline is being followed
new XElement(rsm + "ExchangedDocumentContext",
new XElement(ram + "GuidelineSpecifiedDocumentContextParameter",
new XElement(ram + "ID", "urn:cen.eu:en16931:2017")
)
),
// Core document identification: invoice number, type, and date
new XElement(rsm + "ExchangedDocument",
new XElement(ram + "ID", invoice.InvoiceNumber),
new XElement(ram + "TypeCode", "380"), // 380 = Commercial Invoice per UN/CEFACT
new XElement(ram + "IssueDateTime",
new XElement(udt + "DateTimeString",
new XAttribute("format", "102"),
invoice.InvoiceDate.ToString("yyyyMMdd")
)
)
),
// A complete implementation would include additional sections:
// - Seller information (ram:SellerTradeParty)
// - Buyer information (ram:BuyerTradeParty)
// - Line items (ram:IncludedSupplyChainTradeLineItem)
// - Payment terms (ram:SpecifiedTradePaymentTerms)
// - Tax summaries (ram:ApplicableTradeTax)
// Financial summary with all monetary totals
new XElement(rsm + "SupplyChainTradeTransaction",
new XElement(ram + "ApplicableHeaderTradeSettlement",
new XElement(ram + "InvoiceCurrencyCode", "EUR"),
new XElement(ram + "SpecifiedTradeSettlementHeaderMonetarySummation",
new XElement(ram + "TaxBasisTotalAmount", invoice.Subtotal),
new XElement(ram + "TaxTotalAmount",
new XAttribute("currencyID", "EUR"),
invoice.Tax),
new XElement(ram + "GrandTotalAmount", invoice.Total),
new XElement(ram + "DuePayableAmount", invoice.Total)
)
)
)
)
);
// Save the XML to a temp file for embedding
string xmlPath = $"zugferd-{invoice.InvoiceNumber}.xml";
zugferdXml.Save(xmlPath);
// Attach the XML to the PDF - filename must follow ZUGFeRD conventions
pdf.Attachments.AddFile(xmlPath, "zugferd-invoice.xml", "ZUGFeRD Invoice Data");
// Final PDF contains both visual invoice and machine-readable XML
pdf.SaveAs($"invoice-{invoice.InvoiceNumber}-zugferd.pdf");
}
// Generates simple HTML for the visual portion of the invoice
private string BuildInvoiceHtml(Invoice invoice)
{
return $@"
<!DOCTYPE html>
<html>
<head>
<style>
body {{ font-family: Arial, sans-serif; padding: 40px; }}
h1 {{ color: #333; }}
.zugferd-notice {{
margin-top: 30px; padding: 10px;
background: #f0f0f0; font-size: 11px;
}}
</style>
</head>
<body>
<h1>RECHNUNG / INVOICE</h1>
<p><strong>Rechnungsnummer:</strong> {invoice.InvoiceNumber}</p>
<p><strong>Datum:</strong> {invoice.InvoiceDate:dd.MM.yyyy}</p>
<p><strong>Betrag:</strong> €{invoice.Total:F2}</p>
<div class='zugferd-notice'>
This invoice contains embedded ZUGFeRD data for automated processing.
</div>
</body>
</html>";
}
}
Imports System
Imports System.Xml.Linq
' Generates ZUGFeRD-compliant invoices by embedding structured XML data
' ZUGFeRD allows automated processing while keeping a human-readable PDF
Public Class ZUGFeRDInvoiceGenerator
Public Sub GenerateZUGFeRDInvoice(invoice As Invoice)
' First, create the visual PDF that humans will read
Dim renderer = New ChromePdfRenderer()
Dim invoiceHtml As String = BuildInvoiceHtml(invoice)
Dim pdf = renderer.RenderHtmlAsPdf(invoiceHtml)
' Define the UN/CEFACT namespaces required by the ZUGFeRD standard
' These are mandatory for compliance with European e-invoicing regulations
Dim rsm As XNamespace = "urn:un:unece:uncefact:data:standard:CrossIndustryInvoice:100"
Dim ram As XNamespace = "urn:un:unece:uncefact:data:standard:ReusableAggregateBusinessInformationEntity:100"
Dim udt As XNamespace = "urn:un:unece:uncefact:data:standard:UnqualifiedDataType:100"
' Build the ZUGFeRD XML structure following the Cross-Industry Invoice schema
Dim zugferdXml = New XDocument(
New XDeclaration("1.0", "UTF-8", Nothing),
New XElement(rsm + "CrossIndustryInvoice",
New XAttribute(XNamespace.Xmlns + "rsm", rsm.NamespaceName),
New XAttribute(XNamespace.Xmlns + "ram", ram.NamespaceName),
New XAttribute(XNamespace.Xmlns + "udt", udt.NamespaceName),
' Document context identifies which e-invoicing guideline is being followed
New XElement(rsm + "ExchangedDocumentContext",
New XElement(ram + "GuidelineSpecifiedDocumentContextParameter",
New XElement(ram + "ID", "urn:cen.eu:en16931:2017")
)
),
' Core document identification: invoice number, type, and date
New XElement(rsm + "ExchangedDocument",
New XElement(ram + "ID", invoice.InvoiceNumber),
New XElement(ram + "TypeCode", "380"), ' 380 = Commercial Invoice per UN/CEFACT
New XElement(ram + "IssueDateTime",
New XElement(udt + "DateTimeString",
New XAttribute("format", "102"),
invoice.InvoiceDate.ToString("yyyyMMdd")
)
)
),
' A complete implementation would include additional sections:
' - Seller information (ram:SellerTradeParty)
' - Buyer information (ram:BuyerTradeParty)
' - Line items (ram:IncludedSupplyChainTradeLineItem)
' - Payment terms (ram:SpecifiedTradePaymentTerms)
' - Tax summaries (ram:ApplicableTradeTax)
' Financial summary with all monetary totals
New XElement(rsm + "SupplyChainTradeTransaction",
New XElement(ram + "ApplicableHeaderTradeSettlement",
New XElement(ram + "InvoiceCurrencyCode", "EUR"),
New XElement(ram + "SpecifiedTradeSettlementHeaderMonetarySummation",
New XElement(ram + "TaxBasisTotalAmount", invoice.Subtotal),
New XElement(ram + "TaxTotalAmount",
New XAttribute("currencyID", "EUR"),
invoice.Tax),
New XElement(ram + "GrandTotalAmount", invoice.Total),
New XElement(ram + "DuePayableAmount", invoice.Total)
)
)
)
)
)
' Save the XML to a temp file for embedding
Dim xmlPath As String = $"zugferd-{invoice.InvoiceNumber}.xml"
zugferdXml.Save(xmlPath)
' Attach the XML to the PDF - filename must follow ZUGFeRD conventions
pdf.Attachments.AddFile(xmlPath, "zugferd-invoice.xml", "ZUGFeRD Invoice Data")
' Final PDF contains both visual invoice and machine-readable XML
pdf.SaveAs($"invoice-{invoice.InvoiceNumber}-zugferd.pdf")
End Sub
' Generates simple HTML for the visual portion of the invoice
Private Function BuildInvoiceHtml(invoice As Invoice) As String
Return $"
<!DOCTYPE html>
<html>
<head>
<style>
body {{ font-family: Arial, sans-serif; padding: 40px; }}
h1 {{ color: #333; }}
.zugferd-notice {{
margin-top: 30px; padding: 10px;
background: #f0f0f0; font-size: 11px;
}}
</style>
</head>
<body>
<h1>RECHNUNG / INVOICE</h1>
<p><strong>Rechnungsnummer:</strong> {invoice.InvoiceNumber}</p>
<p><strong>Datum:</strong> {invoice.InvoiceDate:dd.MM.yyyy}</p>
<p><strong>Betrag:</strong> €{invoice.Total:F2}</p>
<div class='zugferd-notice'>
This invoice contains embedded ZUGFeRD data for automated processing.
</div>
</body>
</html>"
End Function
End Class
Przykładowy Wynik
Kluczowe aspekty zgodności to użycie poprawnych przestrzeni nazw XML, podążanie za strukturą schematu CII oraz osadzenie XML z odpowiednią nazwą pliku. Kod Typu "380" specyficznie oznacza dokument jako fakturę handlową w standardzie UN/CEFACT.
Jak przygotować faktury na przyszłość w obliczu wymogow UE
Unia Europejska stopniowo mandatować e-faktury w państwach członkowskich. Włochy już wymagają tego dla transakcji B2B, Francja wdraża wymagania do roku 2026, a Niemcy ogłosiły obowiązkowe e-fakturowanie B2B począwszy od 2025. Wdrożenie wsparcia ZUGFeRD/Factur-X teraz przygotowuje system na te wymogi regulacyjne.
Oto wzorzec dla generatora faktur uwzgledniającego zgodność, który można skierować na różne standardy:
using IronPdf;
using System;
// Enum representing supported European e-invoicing standards
public enum InvoiceStandard
{
None,
ZUGFeRD, // German standard - uses CII XML format
FacturX, // French standard - technically identical to ZUGFeRD 2.0
Peppol // Pan-European standard - uses UBL XML format
}
// Factory class that generates invoices compliant with different e-invoicing standards
// Allows switching between standards without changing core invoice generation logic
public class CompliantInvoiceGenerator
{
public PdfDocument GenerateCompliantInvoice(Invoice invoice, InvoiceStandard standard)
{
// Generate the base PDF from HTML
var renderer = new ChromePdfRenderer();
string html = BuildInvoiceHtml(invoice);
var pdf = renderer.RenderHtmlAsPdf(html);
// Attach the appropriate XML format based on target market/regulation
switch (standard)
{
case InvoiceStandard.ZUGFeRD:
case InvoiceStandard.FacturX:
// Both use Cross-Industry Invoice format, just different filenames
EmbedCIIXmlData(pdf, invoice, standard);
break;
case InvoiceStandard.Peppol:
// Peppol uses Universal Business Language format
EmbedUBLXmlData(pdf, invoice);
break;
}
return pdf;
}
// Creates and embeds CII-format XML (used by ZUGFeRD and Factur-X)
private void EmbedCIIXmlData(PdfDocument pdf, Invoice invoice, InvoiceStandard standard)
{
string xml = GenerateCIIXml(invoice);
// Filename convention differs between German and French standards
string filename = standard == InvoiceStandard.ZUGFeRD
? "zugferd-invoice.xml"
: "factur-x.xml";
System.IO.File.WriteAllText("temp-invoice.xml", xml);
pdf.Attachments.AddFile("temp-invoice.xml", filename, $"{standard} Invoice Data");
}
// Creates and embeds UBL-format XML for Peppol network compliance
private void EmbedUBLXmlData(PdfDocument pdf, Invoice invoice)
{
// UBL (Universal Business Language) is the Peppol standard format
string xml = $@"<?xml version='1.0' encoding='UTF-8'?>
<Invoice xmlns='urn:oasis:names:specification:ubl:schema:xsd:Invoice-2'>
<id>{invoice.InvoiceNumber}</id>
<IssueDate>{invoice.InvoiceDate:yyyy-MM-dd}</IssueDate>
<DocumentCurrencyCode>EUR</DocumentCurrencyCode>
<LegalMonetaryTotal>
<PayableAmount currencyID='EUR'>{invoice.Total}</PayableAmount>
</LegalMonetaryTotal>
</Invoice>";
System.IO.File.WriteAllText("peppol-invoice.xml", xml);
pdf.Attachments.AddFile("peppol-invoice.xml", "invoice.xml", "Peppol UBL Invoice");
}
// Generates minimal CII XML structure for demonstration
private string GenerateCIIXml(Invoice invoice)
{
return $@"<?xml version='1.0' encoding='UTF-8'?>
<rsm:CrossIndustryInvoice
xmlns:rsm='urn:un:unece:uncefact:data:standard:CrossIndustryInvoice:100'
xmlns:ram='urn:un:unece:uncefact:data:standard:ReusableAggregateBusinessInformationEntity:100'>
<rsm:ExchangedDocument>
<ram:ID>{invoice.InvoiceNumber}</ram:ID>
<ram:TypeCode>380</ram:TypeCode>
</rsm:ExchangedDocument>
</rsm:CrossIndustryInvoice>";
}
private string BuildInvoiceHtml(Invoice invoice)
{
return $"<html><body><h1>Invoice {invoice.InvoiceNumber}</h1></body></html>";
}
}
using IronPdf;
using System;
// Enum representing supported European e-invoicing standards
public enum InvoiceStandard
{
None,
ZUGFeRD, // German standard - uses CII XML format
FacturX, // French standard - technically identical to ZUGFeRD 2.0
Peppol // Pan-European standard - uses UBL XML format
}
// Factory class that generates invoices compliant with different e-invoicing standards
// Allows switching between standards without changing core invoice generation logic
public class CompliantInvoiceGenerator
{
public PdfDocument GenerateCompliantInvoice(Invoice invoice, InvoiceStandard standard)
{
// Generate the base PDF from HTML
var renderer = new ChromePdfRenderer();
string html = BuildInvoiceHtml(invoice);
var pdf = renderer.RenderHtmlAsPdf(html);
// Attach the appropriate XML format based on target market/regulation
switch (standard)
{
case InvoiceStandard.ZUGFeRD:
case InvoiceStandard.FacturX:
// Both use Cross-Industry Invoice format, just different filenames
EmbedCIIXmlData(pdf, invoice, standard);
break;
case InvoiceStandard.Peppol:
// Peppol uses Universal Business Language format
EmbedUBLXmlData(pdf, invoice);
break;
}
return pdf;
}
// Creates and embeds CII-format XML (used by ZUGFeRD and Factur-X)
private void EmbedCIIXmlData(PdfDocument pdf, Invoice invoice, InvoiceStandard standard)
{
string xml = GenerateCIIXml(invoice);
// Filename convention differs between German and French standards
string filename = standard == InvoiceStandard.ZUGFeRD
? "zugferd-invoice.xml"
: "factur-x.xml";
System.IO.File.WriteAllText("temp-invoice.xml", xml);
pdf.Attachments.AddFile("temp-invoice.xml", filename, $"{standard} Invoice Data");
}
// Creates and embeds UBL-format XML for Peppol network compliance
private void EmbedUBLXmlData(PdfDocument pdf, Invoice invoice)
{
// UBL (Universal Business Language) is the Peppol standard format
string xml = $@"<?xml version='1.0' encoding='UTF-8'?>
<Invoice xmlns='urn:oasis:names:specification:ubl:schema:xsd:Invoice-2'>
<id>{invoice.InvoiceNumber}</id>
<IssueDate>{invoice.InvoiceDate:yyyy-MM-dd}</IssueDate>
<DocumentCurrencyCode>EUR</DocumentCurrencyCode>
<LegalMonetaryTotal>
<PayableAmount currencyID='EUR'>{invoice.Total}</PayableAmount>
</LegalMonetaryTotal>
</Invoice>";
System.IO.File.WriteAllText("peppol-invoice.xml", xml);
pdf.Attachments.AddFile("peppol-invoice.xml", "invoice.xml", "Peppol UBL Invoice");
}
// Generates minimal CII XML structure for demonstration
private string GenerateCIIXml(Invoice invoice)
{
return $@"<?xml version='1.0' encoding='UTF-8'?>
<rsm:CrossIndustryInvoice
xmlns:rsm='urn:un:unece:uncefact:data:standard:CrossIndustryInvoice:100'
xmlns:ram='urn:un:unece:uncefact:data:standard:ReusableAggregateBusinessInformationEntity:100'>
<rsm:ExchangedDocument>
<ram:ID>{invoice.InvoiceNumber}</ram:ID>
<ram:TypeCode>380</ram:TypeCode>
</rsm:ExchangedDocument>
</rsm:CrossIndustryInvoice>";
}
private string BuildInvoiceHtml(Invoice invoice)
{
return $"<html><body><h1>Invoice {invoice.InvoiceNumber}</h1></body></html>";
}
}
Imports IronPdf
Imports System
' Enum representing supported European e-invoicing standards
Public Enum InvoiceStandard
None
ZUGFeRD ' German standard - uses CII XML format
FacturX ' French standard - technically identical to ZUGFeRD 2.0
Peppol ' Pan-European standard - uses UBL XML format
End Enum
' Factory class that generates invoices compliant with different e-invoicing standards
' Allows switching between standards without changing core invoice generation logic
Public Class CompliantInvoiceGenerator
Public Function GenerateCompliantInvoice(invoice As Invoice, standard As InvoiceStandard) As PdfDocument
' Generate the base PDF from HTML
Dim renderer As New ChromePdfRenderer()
Dim html As String = BuildInvoiceHtml(invoice)
Dim pdf As PdfDocument = renderer.RenderHtmlAsPdf(html)
' Attach the appropriate XML format based on target market/regulation
Select Case standard
Case InvoiceStandard.ZUGFeRD, InvoiceStandard.FacturX
' Both use Cross-Industry Invoice format, just different filenames
EmbedCIIXmlData(pdf, invoice, standard)
Case InvoiceStandard.Peppol
' Peppol uses Universal Business Language format
EmbedUBLXmlData(pdf, invoice)
End Select
Return pdf
End Function
' Creates and embeds CII-format XML (used by ZUGFeRD and Factur-X)
Private Sub EmbedCIIXmlData(pdf As PdfDocument, invoice As Invoice, standard As InvoiceStandard)
Dim xml As String = GenerateCIIXml(invoice)
' Filename convention differs between German and French standards
Dim filename As String = If(standard = InvoiceStandard.ZUGFeRD, "zugferd-invoice.xml", "factur_x.xml")
System.IO.File.WriteAllText("temp-invoice.xml", xml)
pdf.Attachments.AddFile("temp-invoice.xml", filename, $"{standard} Invoice Data")
End Sub
' Creates and embeds UBL-format XML for Peppol network compliance
Private Sub EmbedUBLXmlData(pdf As PdfDocument, invoice As Invoice)
' UBL (Universal Business Language) is the Peppol standard format
Dim xml As String = $"<?xml version='1.0' encoding='UTF-8'?>
<Invoice xmlns='urn:oasis:names:specification:ubl:schema:xsd:Invoice-2'>
<id>{invoice.InvoiceNumber}</id>
<IssueDate>{invoice.InvoiceDate:yyyy-MM-dd}</IssueDate>
<DocumentCurrencyCode>EUR</DocumentCurrencyCode>
<LegalMonetaryTotal>
<PayableAmount currencyID='EUR'>{invoice.Total}</PayableAmount>
</LegalMonetaryTotal>
</Invoice>"
System.IO.File.WriteAllText("peppol-invoice.xml", xml)
pdf.Attachments.AddFile("peppol-invoice.xml", "invoice.xml", "Peppol UBL Invoice")
End Sub
' Generates minimal CII XML structure for demonstration
Private Function GenerateCIIXml(invoice As Invoice) As String
Return $"<?xml version='1.0' encoding='UTF-8'?>
<rsm:CrossIndustryInvoice
xmlns:rsm='urn:un:unece:uncefact:data:standard:CrossIndustryInvoice:100'
xmlns:ram='urn:un:unece:uncefact:data:standard:ReusableAggregateBusinessInformationEntity:100'>
<rsm:ExchangedDocument>
<ram:ID>{invoice.InvoiceNumber}</ram:ID>
<ram:TypeCode>380</ram:TypeCode>
</rsm:ExchangedDocument>
</rsm:CrossIndustryInvoice>"
End Function
Private Function BuildInvoiceHtml(invoice As Invoice) As String
Return $"<html><body><h1>Invoice {invoice.InvoiceNumber}</h1></body></html>"
End Function
End Class
Ta architektura pozwala na dodanie nowych standardów bez restrukturyzacji podstawowej logiki generowania faktur. Podejście oparte na enumach ułatwia pozwolenie użytkownikom lub konfiguracji, aby decydowali, który tryb zgodności użyć.
Jak wyodrębniać dane z faktur PDF w C
Generowanie faktur to tylko połowa równania. Większość firm otrzymuje również faktury od dostawców i potrzebuje wyodrębnić dane do przetworzenia. IronPDF oferuje potężne możliwości ekstrakcji tekstu, które stanowią podstawę przechwytywania danych faktur.
Jak wyodrębnić tekst z faktury PDF
Podstawowa operacja ekstrakcji pobiera całą zawartość tekstu z PDF. Metoda ExtractAllText IronPDF radzi sobie ze złożonością kodowania i pozycjonowania tekstu PDF:
using IronPdf;
using System;
// Extracts raw text content from PDF invoices for further processing
public class InvoiceTextExtractor
{
// Extracts all text from a PDF in one operation
// Best for single-page invoices or when you need the complete content
public string ExtractInvoiceText(string pdfPath)
{
var pdf = PdfDocument.FromFile(pdfPath);
// IronPDF handles the complexity of PDF text encoding and positioning
string allText = pdf.ExtractAllText();
Console.WriteLine("Full invoice text:");
Console.WriteLine(allText);
return allText;
}
// Extracts text page by page - useful for multi-page invoices
// Allows you to process header info separately from line items
public void ExtractTextByPage(string pdfPath)
{
var pdf = PdfDocument.FromFile(pdfPath);
// Iterate through each page (0-indexed)
for (int i = 0; i < pdf.PageCount; i++)
{
string pageText = pdf.ExtractTextFromPage(i);
Console.WriteLine($"\n--- Page {i + 1} ---");
Console.WriteLine(pageText);
}
}
}
using IronPdf;
using System;
// Extracts raw text content from PDF invoices for further processing
public class InvoiceTextExtractor
{
// Extracts all text from a PDF in one operation
// Best for single-page invoices or when you need the complete content
public string ExtractInvoiceText(string pdfPath)
{
var pdf = PdfDocument.FromFile(pdfPath);
// IronPDF handles the complexity of PDF text encoding and positioning
string allText = pdf.ExtractAllText();
Console.WriteLine("Full invoice text:");
Console.WriteLine(allText);
return allText;
}
// Extracts text page by page - useful for multi-page invoices
// Allows you to process header info separately from line items
public void ExtractTextByPage(string pdfPath)
{
var pdf = PdfDocument.FromFile(pdfPath);
// Iterate through each page (0-indexed)
for (int i = 0; i < pdf.PageCount; i++)
{
string pageText = pdf.ExtractTextFromPage(i);
Console.WriteLine($"\n--- Page {i + 1} ---");
Console.WriteLine(pageText);
}
}
}
Imports IronPdf
Imports System
' Extracts raw text content from PDF invoices for further processing
Public Class InvoiceTextExtractor
' Extracts all text from a PDF in one operation
' Best for single-page invoices or when you need the complete content
Public Function ExtractInvoiceText(pdfPath As String) As String
Dim pdf = PdfDocument.FromFile(pdfPath)
' IronPDF handles the complexity of PDF text encoding and positioning
Dim allText As String = pdf.ExtractAllText()
Console.WriteLine("Full invoice text:")
Console.WriteLine(allText)
Return allText
End Function
' Extracts text page by page - useful for multi-page invoices
' Allows you to process header info separately from line items
Public Sub ExtractTextByPage(pdfPath As String)
Dim pdf = PdfDocument.FromFile(pdfPath)
' Iterate through each page (0-indexed)
For i As Integer = 0 To pdf.PageCount - 1
Dim pageText As String = pdf.ExtractTextFromPage(i)
Console.WriteLine(vbCrLf & "--- Page " & (i + 1).ToString() & " ---")
Console.WriteLine(pageText)
Next
End Sub
End Class
Ekstrakcja strona po stronie jest szczególnie użyteczna dla wielostronicowych faktur, gdzie trzeba znaleźć konkretne sekcje, takie jak wyszukiwanie pozycji, które rozciągają się na kilka stron, podczas gdy informacje nagłówkowe pojawiają się tylko na pierwszej stronie.
Jak wyodrębnić dane tabel dla pozycji
Pozycje faktur zazwyczaj pojawiają się w formacie tabelarycznym. PDF-y nie mają natywnej struktury tabel, ale można wyodrębnić tekst i przeparsować go, aby odbudować dane tabel:
using IronPdf;
using System;
using System.Collections.Generic;
// Data model for a single invoice line item
public class InvoiceLineItem
{
public string Description { get; set; }
public decimal Quantity { get; set; }
public decimal UnitPrice { get; set; }
public decimal Total { get; set; }
}
// Extracts tabular line item data from PDF invoices
// Note: PDFs don't have native table structure, so this uses text parsing
public class InvoiceTableExtractor
{
public List<InvoiceLineItem> ExtractLineItems(string pdfPath)
{
var pdf = PdfDocument.FromFile(pdfPath);
string text = pdf.ExtractAllText();
var lineItems = new List<InvoiceLineItem>();
string[] lines = text.Split('\n');
foreach (string line in lines)
{
// Currency symbols indicate potential line items with amounts
if (line.Contains("$") || line.Contains("€"))
{
Console.WriteLine($"Potential line item: {line.Trim()}");
// Split on whitespace to separate columns
// Actual parsing logic depends on your invoice format
string[] parts = line.Split(new[] { '\t', ' ' },
StringSplitOptions.RemoveEmptyEntries);
// Try to find numeric values that could be amounts
foreach (string part in parts)
{
string cleaned = part.Replace("$", "").Replace("€", "").Replace(",", "");
if (decimal.TryParse(cleaned, out decimal amount))
{
Console.WriteLine($" Found amount: {amount:C}");
}
}
}
}
return lineItems;
}
}
using IronPdf;
using System;
using System.Collections.Generic;
// Data model for a single invoice line item
public class InvoiceLineItem
{
public string Description { get; set; }
public decimal Quantity { get; set; }
public decimal UnitPrice { get; set; }
public decimal Total { get; set; }
}
// Extracts tabular line item data from PDF invoices
// Note: PDFs don't have native table structure, so this uses text parsing
public class InvoiceTableExtractor
{
public List<InvoiceLineItem> ExtractLineItems(string pdfPath)
{
var pdf = PdfDocument.FromFile(pdfPath);
string text = pdf.ExtractAllText();
var lineItems = new List<InvoiceLineItem>();
string[] lines = text.Split('\n');
foreach (string line in lines)
{
// Currency symbols indicate potential line items with amounts
if (line.Contains("$") || line.Contains("€"))
{
Console.WriteLine($"Potential line item: {line.Trim()}");
// Split on whitespace to separate columns
// Actual parsing logic depends on your invoice format
string[] parts = line.Split(new[] { '\t', ' ' },
StringSplitOptions.RemoveEmptyEntries);
// Try to find numeric values that could be amounts
foreach (string part in parts)
{
string cleaned = part.Replace("$", "").Replace("€", "").Replace(",", "");
if (decimal.TryParse(cleaned, out decimal amount))
{
Console.WriteLine($" Found amount: {amount:C}");
}
}
}
}
return lineItems;
}
}
Imports IronPdf
Imports System
Imports System.Collections.Generic
' Data model for a single invoice line item
Public Class InvoiceLineItem
Public Property Description As String
Public Property Quantity As Decimal
Public Property UnitPrice As Decimal
Public Property Total As Decimal
End Class
' Extracts tabular line item data from PDF invoices
' Note: PDFs don't have native table structure, so this uses text parsing
Public Class InvoiceTableExtractor
Public Function ExtractLineItems(pdfPath As String) As List(Of InvoiceLineItem)
Dim pdf = PdfDocument.FromFile(pdfPath)
Dim text As String = pdf.ExtractAllText()
Dim lineItems As New List(Of InvoiceLineItem)()
Dim lines() As String = text.Split(ControlChars.Lf)
For Each line As String In lines
' Currency symbols indicate potential line items with amounts
If line.Contains("$") OrElse line.Contains("€") Then
Console.WriteLine($"Potential line item: {line.Trim()}")
' Split on whitespace to separate columns
' Actual parsing logic depends on your invoice format
Dim parts() As String = line.Split(New Char() {ControlChars.Tab, " "c}, StringSplitOptions.RemoveEmptyEntries)
' Try to find numeric values that could be amounts
For Each part As String In parts
Dim cleaned As String = part.Replace("$", "").Replace("€", "").Replace(",", "")
Dim amount As Decimal
If Decimal.TryParse(cleaned, amount) Then
Console.WriteLine($" Found amount: {amount:C}")
End If
Next
End If
Next
Return lineItems
End Function
End Class
Logika parsowania będzie się różnić w zależności od formatów faktur. Dla faktur o spójnych układach od znanych dostawców, można zbudować parsery specyficzne dla formatu. Dla zróżnicowanych formatów, rozważ ekstrakcję wspomaganą przez AI omówioną później w tym artykule.
Jak używać dopasowania wzorców dla numerów faktur, dat i sum
Wyrażenia regularne są nieocenione przy wyodrębnianiu konkretnych punktów danych z tekstu faktur. Kluczowe pola, jak numery faktur, daty i sumy, często podążają za rozpoznawalnymi wzorcami:
using IronPdf;
using System;
using System.Text.RegularExpressions;
// Data model for extracted invoice information
public class InvoiceData
{
public string InvoiceNumber { get; set; }
public string InvoiceDate { get; set; }
public decimal TotalAmount { get; set; }
public string VendorName { get; set; }
}
// Extracts key invoice fields using regex pattern matching
// Multiple patterns handle variations across different vendors
public class InvoiceParser
{
public InvoiceData ParseInvoice(string pdfPath)
{
var pdf = PdfDocument.FromFile(pdfPath);
string text = pdf.ExtractAllText();
var invoiceData = new InvoiceData();
// Try multiple patterns to find invoice number
// Handles: "Invoice #123", "INV-123", "Invoice Number: 123", German "Rechnungsnummer"
string[] invoiceNumberPatterns = new[]
{
@"Invoice\s*#?\s*:?\s*([A-Z0-9-]+)",
@"INV[-\s]?(\d+)",
@"Invoice\s+Number\s*:?\s*([A-Z0-9-]+)",
@"Rechnungsnummer\s*:?\s*([A-Z0-9-]+)"
};
foreach (string pattern in invoiceNumberPatterns)
{
var match = Regex.Match(text, pattern, RegexOptions.IgnoreCase);
if (match.Success)
{
invoiceData.InvoiceNumber = match.Groups[1].Value;
Console.WriteLine($"Found Invoice Number: {invoiceData.InvoiceNumber}");
break;
}
}
// Date patterns for US, European, and written formats
string[] datePatterns = new[]
{
@"Date\s*:?\s*(\d{1,2}[/-]\d{1,2}[/-]\d{2,4})",
@"Invoice\s+Date\s*:?\s*(\d{1,2}[/-]\d{1,2}[/-]\d{2,4})",
@"(\d{1,2}\.\d{1,2}\.\d{4})", // European: DD.MM.YYYY
@"(\w+\s+\d{1,2},?\s+\d{4})" // Written: January 15, 2024
};
foreach (string pattern in datePatterns)
{
var match = Regex.Match(text, pattern, RegexOptions.IgnoreCase);
if (match.Success)
{
invoiceData.InvoiceDate = match.Groups[1].Value;
Console.WriteLine($"Found Date: {invoiceData.InvoiceDate}");
break;
}
}
// Look for total amount with various labels
string[] totalPatterns = new[]
{
@"Total\s*:?\s*[\$€]?\s*([\d,]+\.\d{2})",
@"Amount\s+Due\s*:?\s*[\$€]?\s*([\d,]+\.\d{2})",
@"Grand\s+Total\s*:?\s*[\$€]?\s*([\d,]+\.\d{2})",
@"Balance\s+Due\s*:?\s*[\$€]?\s*([\d,]+\.\d{2})"
};
foreach (string pattern in totalPatterns)
{
var match = Regex.Match(text, pattern, RegexOptions.IgnoreCase);
if (match.Success)
{
// Remove commas before parsing
string amountStr = match.Groups[1].Value.Replace(",", "");
if (decimal.TryParse(amountStr, out decimal amount))
{
invoiceData.TotalAmount = amount;
Console.WriteLine($"Found Total: ${invoiceData.TotalAmount:F2}");
break;
}
}
}
return invoiceData;
}
}
using IronPdf;
using System;
using System.Text.RegularExpressions;
// Data model for extracted invoice information
public class InvoiceData
{
public string InvoiceNumber { get; set; }
public string InvoiceDate { get; set; }
public decimal TotalAmount { get; set; }
public string VendorName { get; set; }
}
// Extracts key invoice fields using regex pattern matching
// Multiple patterns handle variations across different vendors
public class InvoiceParser
{
public InvoiceData ParseInvoice(string pdfPath)
{
var pdf = PdfDocument.FromFile(pdfPath);
string text = pdf.ExtractAllText();
var invoiceData = new InvoiceData();
// Try multiple patterns to find invoice number
// Handles: "Invoice #123", "INV-123", "Invoice Number: 123", German "Rechnungsnummer"
string[] invoiceNumberPatterns = new[]
{
@"Invoice\s*#?\s*:?\s*([A-Z0-9-]+)",
@"INV[-\s]?(\d+)",
@"Invoice\s+Number\s*:?\s*([A-Z0-9-]+)",
@"Rechnungsnummer\s*:?\s*([A-Z0-9-]+)"
};
foreach (string pattern in invoiceNumberPatterns)
{
var match = Regex.Match(text, pattern, RegexOptions.IgnoreCase);
if (match.Success)
{
invoiceData.InvoiceNumber = match.Groups[1].Value;
Console.WriteLine($"Found Invoice Number: {invoiceData.InvoiceNumber}");
break;
}
}
// Date patterns for US, European, and written formats
string[] datePatterns = new[]
{
@"Date\s*:?\s*(\d{1,2}[/-]\d{1,2}[/-]\d{2,4})",
@"Invoice\s+Date\s*:?\s*(\d{1,2}[/-]\d{1,2}[/-]\d{2,4})",
@"(\d{1,2}\.\d{1,2}\.\d{4})", // European: DD.MM.YYYY
@"(\w+\s+\d{1,2},?\s+\d{4})" // Written: January 15, 2024
};
foreach (string pattern in datePatterns)
{
var match = Regex.Match(text, pattern, RegexOptions.IgnoreCase);
if (match.Success)
{
invoiceData.InvoiceDate = match.Groups[1].Value;
Console.WriteLine($"Found Date: {invoiceData.InvoiceDate}");
break;
}
}
// Look for total amount with various labels
string[] totalPatterns = new[]
{
@"Total\s*:?\s*[\$€]?\s*([\d,]+\.\d{2})",
@"Amount\s+Due\s*:?\s*[\$€]?\s*([\d,]+\.\d{2})",
@"Grand\s+Total\s*:?\s*[\$€]?\s*([\d,]+\.\d{2})",
@"Balance\s+Due\s*:?\s*[\$€]?\s*([\d,]+\.\d{2})"
};
foreach (string pattern in totalPatterns)
{
var match = Regex.Match(text, pattern, RegexOptions.IgnoreCase);
if (match.Success)
{
// Remove commas before parsing
string amountStr = match.Groups[1].Value.Replace(",", "");
if (decimal.TryParse(amountStr, out decimal amount))
{
invoiceData.TotalAmount = amount;
Console.WriteLine($"Found Total: ${invoiceData.TotalAmount:F2}");
break;
}
}
}
return invoiceData;
}
}
Imports IronPdf
Imports System
Imports System.Text.RegularExpressions
' Data model for extracted invoice information
Public Class InvoiceData
Public Property InvoiceNumber As String
Public Property InvoiceDate As String
Public Property TotalAmount As Decimal
Public Property VendorName As String
End Class
' Extracts key invoice fields using regex pattern matching
' Multiple patterns handle variations across different vendors
Public Class InvoiceParser
Public Function ParseInvoice(pdfPath As String) As InvoiceData
Dim pdf = PdfDocument.FromFile(pdfPath)
Dim text As String = pdf.ExtractAllText()
Dim invoiceData As New InvoiceData()
' Try multiple patterns to find invoice number
' Handles: "Invoice #123", "INV-123", "Invoice Number: 123", German "Rechnungsnummer"
Dim invoiceNumberPatterns As String() = {
"Invoice\s*#?\s*:?\s*([A-Z0-9-]+)",
"INV[-\s]?(\d+)",
"Invoice\s+Number\s*:?\s*([A-Z0-9-]+)",
"Rechnungsnummer\s*:?\s*([A-Z0-9-]+)"
}
For Each pattern As String In invoiceNumberPatterns
Dim match = Regex.Match(text, pattern, RegexOptions.IgnoreCase)
If match.Success Then
invoiceData.InvoiceNumber = match.Groups(1).Value
Console.WriteLine($"Found Invoice Number: {invoiceData.InvoiceNumber}")
Exit For
End If
Next
' Date patterns for US, European, and written formats
Dim datePatterns As String() = {
"Date\s*:?\s*(\d{1,2}[/-]\d{1,2}[/-]\d{2,4})",
"Invoice\s+Date\s*:?\s*(\d{1,2}[/-]\d{1,2}[/-]\d{2,4})",
"(\d{1,2}\.\d{1,2}\.\d{4})", ' European: DD.MM.YYYY
"(\w+\s+\d{1,2},?\s+\d{4})" ' Written: January 15, 2024
}
For Each pattern As String In datePatterns
Dim match = Regex.Match(text, pattern, RegexOptions.IgnoreCase)
If match.Success Then
invoiceData.InvoiceDate = match.Groups(1).Value
Console.WriteLine($"Found Date: {invoiceData.InvoiceDate}")
Exit For
End If
Next
' Look for total amount with various labels
Dim totalPatterns As String() = {
"Total\s*:?\s*[\$€]?\s*([\d,]+\.\d{2})",
"Amount\s+Due\s*:?\s*[\$€]?\s*([\d,]+\.\d{2})",
"Grand\s+Total\s*:?\s*[\$€]?\s*([\d,]+\.\d{2})",
"Balance\s+Due\s*:?\s*[\$€]?\s*([\d,]+\.\d{2})"
}
For Each pattern As String In totalPatterns
Dim match = Regex.Match(text, pattern, RegexOptions.IgnoreCase)
If match.Success Then
' Remove commas before parsing
Dim amountStr As String = match.Groups(1).Value.Replace(",", "")
Dim amount As Decimal
If Decimal.TryParse(amountStr, amount) Then
invoiceData.TotalAmount = amount
Console.WriteLine($"Found Total: ${invoiceData.TotalAmount:F2}")
Exit For
End If
End If
Next
Return invoiceData
End Function
End Class
To podejście oparte na wzorcach działa dobrze dla faktur o przewidywalnych formatach. Rozmaite warianty wzorców obsługują częste różnice formatowania wśród dostawcow, takie jak "Invoice #" i "Invoice Number:"
Co z fakturami zeskanowanymi lub w formie obrazów?
Metody ekstrakcji tekstu pokazane powyżej działają z PDF-ami, które zawierają osadzony tekst. Jednak zeskanowane dokumenty i PDF-y obrazowe nie mają wyodrębniającego się tekstu. Są one zasadniczo obrazami faktur.
Jak użyć AI do przetwarzania faktur w .NET
Tradycyjne dopasowanie wzorców działa dobrze dla zstandardyzowanych faktur, ale działy rozrachunków do zapłacenia w rzeczywistym świecie otrzymują dokumenty w niezliczonej liczbie formatów. Tu błyszczy ekstrakcja wspomagana przez AI. Duże modele języków mogą rozumieć semantykę faktur i wyciągać strukturalne dane nawet z nieznanych układów.
Jak zintegrować AI do parsowania faktur
Wzorzec dla przetwarzania faktur wspomaganego przez AI łączy ekstrakcję tekstu z IronPDF z wywołaniami API LLM. Oto generyczna implementacja, która działa z każdą zgodną z OpenAI API:
using IronPdf;
using System;
using System.Net.Http;
using System.Text;
using System.Text.Json;
using System.Threading.Tasks;
// Data model for extracted invoice information
public class InvoiceData
{
public string InvoiceNumber { get; set; }
public string InvoiceDate { get; set; }
public string VendorName { get; set; }
public decimal TotalAmount { get; set; }
}
// Leverages AI/LLM APIs to extract structured data from any invoice format
// Works with OpenAI or any compatible API endpoint
public class AIInvoiceParser
{
private readonly HttpClient _httpClient;
private readonly string _apiKey;
private readonly string _apiUrl;
public AIInvoiceParser(string apiKey, string apiUrl = "https://api.openai.com/v1/chat/completions")
{
_apiKey = apiKey;
_apiUrl = apiUrl;
_httpClient = new HttpClient();
_httpClient.DefaultRequestHeaders.Add("Authorization", $"Bearer {_apiKey}");
}
public async Task<InvoiceData> ParseInvoiceWithAI(string pdfPath)
{
// First extract raw text from the PDF using IronPDF
var pdf = PdfDocument.FromFile(pdfPath);
string invoiceText = pdf.ExtractAllText();
// Construct a prompt that instructs the AI to return structured JSON
// Being explicit about the format reduces parsing errors
string prompt = $@"Extract the following information from this invoice text.
Return ONLY valid JSON with no additional text or markdown formatting.
Required fields:
- InvoiceNumber: The invoice or document number
- InvoiceDate: The invoice date in YYYY-MM-DD format
- VendorName: The company or person who sent the invoice
- TotalAmount: The total amount due as a number (no currency symbols)
Invoice text:
{invoiceText}
JSON response:";
// Build the API request with a system prompt for context
var requestBody = new
{
model = "gpt-4",
messages = new[]
{
new {
role = "system",
content = "You are an invoice data extraction assistant. Extract structured data from invoices and return valid JSON only."
},
new { role = "user", content = prompt }
},
temperature = 0.1 // Low temperature ensures consistent, deterministic results
};
var json = JsonSerializer.Serialize(requestBody);
var content = new StringContent(json, Encoding.UTF8, "application/json");
var response = await _httpClient.PostAsync(_apiUrl, content);
var responseJson = await response.Content.ReadAsStringAsync();
// Navigate the API response structure to get the extracted content
using var doc = JsonDocument.Parse(responseJson);
var messageContent = doc.RootElement
.GetProperty("choices")[0]
.GetProperty("message")
.GetProperty("content")
.GetString();
Console.WriteLine("AI Extracted Data:");
Console.WriteLine(messageContent);
// Deserialize the AI's JSON response into our data class
var invoiceData = JsonSerializer.Deserialize<InvoiceData>(messageContent,
new JsonSerializerOptions { PropertyNameCaseInsensitive = true });
return invoiceData;
}
}
using IronPdf;
using System;
using System.Net.Http;
using System.Text;
using System.Text.Json;
using System.Threading.Tasks;
// Data model for extracted invoice information
public class InvoiceData
{
public string InvoiceNumber { get; set; }
public string InvoiceDate { get; set; }
public string VendorName { get; set; }
public decimal TotalAmount { get; set; }
}
// Leverages AI/LLM APIs to extract structured data from any invoice format
// Works with OpenAI or any compatible API endpoint
public class AIInvoiceParser
{
private readonly HttpClient _httpClient;
private readonly string _apiKey;
private readonly string _apiUrl;
public AIInvoiceParser(string apiKey, string apiUrl = "https://api.openai.com/v1/chat/completions")
{
_apiKey = apiKey;
_apiUrl = apiUrl;
_httpClient = new HttpClient();
_httpClient.DefaultRequestHeaders.Add("Authorization", $"Bearer {_apiKey}");
}
public async Task<InvoiceData> ParseInvoiceWithAI(string pdfPath)
{
// First extract raw text from the PDF using IronPDF
var pdf = PdfDocument.FromFile(pdfPath);
string invoiceText = pdf.ExtractAllText();
// Construct a prompt that instructs the AI to return structured JSON
// Being explicit about the format reduces parsing errors
string prompt = $@"Extract the following information from this invoice text.
Return ONLY valid JSON with no additional text or markdown formatting.
Required fields:
- InvoiceNumber: The invoice or document number
- InvoiceDate: The invoice date in YYYY-MM-DD format
- VendorName: The company or person who sent the invoice
- TotalAmount: The total amount due as a number (no currency symbols)
Invoice text:
{invoiceText}
JSON response:";
// Build the API request with a system prompt for context
var requestBody = new
{
model = "gpt-4",
messages = new[]
{
new {
role = "system",
content = "You are an invoice data extraction assistant. Extract structured data from invoices and return valid JSON only."
},
new { role = "user", content = prompt }
},
temperature = 0.1 // Low temperature ensures consistent, deterministic results
};
var json = JsonSerializer.Serialize(requestBody);
var content = new StringContent(json, Encoding.UTF8, "application/json");
var response = await _httpClient.PostAsync(_apiUrl, content);
var responseJson = await response.Content.ReadAsStringAsync();
// Navigate the API response structure to get the extracted content
using var doc = JsonDocument.Parse(responseJson);
var messageContent = doc.RootElement
.GetProperty("choices")[0]
.GetProperty("message")
.GetProperty("content")
.GetString();
Console.WriteLine("AI Extracted Data:");
Console.WriteLine(messageContent);
// Deserialize the AI's JSON response into our data class
var invoiceData = JsonSerializer.Deserialize<InvoiceData>(messageContent,
new JsonSerializerOptions { PropertyNameCaseInsensitive = true });
return invoiceData;
}
}
Imports IronPdf
Imports System
Imports System.Net.Http
Imports System.Text
Imports System.Text.Json
Imports System.Threading.Tasks
' Data model for extracted invoice information
Public Class InvoiceData
Public Property InvoiceNumber As String
Public Property InvoiceDate As String
Public Property VendorName As String
Public Property TotalAmount As Decimal
End Class
' Leverages AI/LLM APIs to extract structured data from any invoice format
' Works with OpenAI or any compatible API endpoint
Public Class AIInvoiceParser
Private ReadOnly _httpClient As HttpClient
Private ReadOnly _apiKey As String
Private ReadOnly _apiUrl As String
Public Sub New(apiKey As String, Optional apiUrl As String = "https://api.openai.com/v1/chat/completions")
_apiKey = apiKey
_apiUrl = apiUrl
_httpClient = New HttpClient()
_httpClient.DefaultRequestHeaders.Add("Authorization", $"Bearer {_apiKey}")
End Sub
Public Async Function ParseInvoiceWithAI(pdfPath As String) As Task(Of InvoiceData)
' First extract raw text from the PDF using IronPDF
Dim pdf = PdfDocument.FromFile(pdfPath)
Dim invoiceText As String = pdf.ExtractAllText()
' Construct a prompt that instructs the AI to return structured JSON
' Being explicit about the format reduces parsing errors
Dim prompt As String = $"Extract the following information from this invoice text.
Return ONLY valid JSON with no additional text or markdown formatting.
Required fields:
- InvoiceNumber: The invoice or document number
- InvoiceDate: The invoice date in YYYY-MM-DD format
- VendorName: The company or person who sent the invoice
- TotalAmount: The total amount due as a number (no currency symbols)
Invoice text:
{invoiceText}
JSON response:"
' Build the API request with a system prompt for context
Dim requestBody = New With {
.model = "gpt-4",
.messages = New Object() {
New With {
.role = "system",
.content = "You are an invoice data extraction assistant. Extract structured data from invoices and return valid JSON only."
},
New With {
.role = "user",
.content = prompt
}
},
.temperature = 0.1 ' Low temperature ensures consistent, deterministic results
}
Dim json As String = JsonSerializer.Serialize(requestBody)
Dim content As New StringContent(json, Encoding.UTF8, "application/json")
Dim response = Await _httpClient.PostAsync(_apiUrl, content)
Dim responseJson As String = Await response.Content.ReadAsStringAsync()
' Navigate the API response structure to get the extracted content
Using doc = JsonDocument.Parse(responseJson)
Dim messageContent As String = doc.RootElement _
.GetProperty("choices")(0) _
.GetProperty("message") _
.GetProperty("content") _
.GetString()
Console.WriteLine("AI Extracted Data:")
Console.WriteLine(messageContent)
' Deserialize the AI's JSON response into our data class
Dim invoiceData As InvoiceData = JsonSerializer.Deserialize(Of InvoiceData)(messageContent, New JsonSerializerOptions With {.PropertyNameCaseInsensitive = True})
Return invoiceData
End Using
End Function
End Class
Ustawienie niskiej temperatury (0.1) zachęca do deterministycznych wyników, co jest ważne w zadaniach związanych z ekstrakcją danych, gdzie chcesz uzyskać spójne wyniki dla tych samych wejść.
Jak wyodrębnić strukturalne dane JSON z faktur
Dla bardziej złożonych faktur z pozycjami, danymi dostawcow i informacji o klientach, możesz poprosić o bogatszą strukturę JSON:
using IronPdf;
using System;
using System.Collections.Generic;
using System.Text.Json;
using System.Threading.Tasks;
// Comprehensive invoice data model with all details
public class DetailedInvoiceData
{
public string InvoiceNumber { get; set; }
public DateTime InvoiceDate { get; set; }
public DateTime DueDate { get; set; }
public VendorInfo Vendor { get; set; }
public CustomerInfo Customer { get; set; }
public List<LineItem> LineItems { get; set; }
public decimal Subtotal { get; set; }
public decimal Tax { get; set; }
public decimal Total { get; set; }
}
public class VendorInfo
{
public string Name { get; set; }
public string Address { get; set; }
public string TaxId { get; set; }
}
public class CustomerInfo
{
public string Name { get; set; }
public string Address { get; set; }
}
public class LineItem
{
public string Description { get; set; }
public decimal Quantity { get; set; }
public decimal UnitPrice { get; set; }
public decimal Total { get; set; }
}
// Extracts comprehensive invoice data including line items and party details
public class StructuredInvoiceExtractor
{
private readonly AIInvoiceParser _aiParser;
public StructuredInvoiceExtractor(string apiKey)
{
_aiParser = new AIInvoiceParser(apiKey);
}
public async Task<DetailedInvoiceData> ExtractDetailedData(string pdfPath)
{
var pdf = PdfDocument.FromFile(pdfPath);
string text = pdf.ExtractAllText();
// Define the exact JSON structure we want the AI to return
// This schema guides the AI to extract all relevant fields
string jsonSchema = @"{
""InvoiceNumber"": ""string"",
""InvoiceDate"": ""YYYY-MM-DD"",
""DueDate"": ""YYYY-MM-DD"",
""Vendor"": {
""Name"": ""string"",
""Address"": ""string"",
""TaxId"": ""string or null""
},
""Customer"": {
""Name"": ""string"",
""Address"": ""string""
},
""LineItems"": [
{
""Description"": ""string"",
""Quantity"": 0.0,
""UnitPrice"": 0.00,
""Total"": 0.00
}
],
""Subtotal"": 0.00,
""Tax"": 0.00,
""Total"": 0.00
}";
// Prompt includes both the schema and the extracted text
string prompt = $@"Extract all invoice data and return it in this exact JSON structure:
{jsonSchema}
Invoice text:
{text}
Return only valid JSON, no markdown formatting or additional text.";
// Call AI API and parse response (implementation as shown above)
// Return deserialized DetailedInvoiceData
return new DetailedInvoiceData(); // Placeholder
}
}
using IronPdf;
using System;
using System.Collections.Generic;
using System.Text.Json;
using System.Threading.Tasks;
// Comprehensive invoice data model with all details
public class DetailedInvoiceData
{
public string InvoiceNumber { get; set; }
public DateTime InvoiceDate { get; set; }
public DateTime DueDate { get; set; }
public VendorInfo Vendor { get; set; }
public CustomerInfo Customer { get; set; }
public List<LineItem> LineItems { get; set; }
public decimal Subtotal { get; set; }
public decimal Tax { get; set; }
public decimal Total { get; set; }
}
public class VendorInfo
{
public string Name { get; set; }
public string Address { get; set; }
public string TaxId { get; set; }
}
public class CustomerInfo
{
public string Name { get; set; }
public string Address { get; set; }
}
public class LineItem
{
public string Description { get; set; }
public decimal Quantity { get; set; }
public decimal UnitPrice { get; set; }
public decimal Total { get; set; }
}
// Extracts comprehensive invoice data including line items and party details
public class StructuredInvoiceExtractor
{
private readonly AIInvoiceParser _aiParser;
public StructuredInvoiceExtractor(string apiKey)
{
_aiParser = new AIInvoiceParser(apiKey);
}
public async Task<DetailedInvoiceData> ExtractDetailedData(string pdfPath)
{
var pdf = PdfDocument.FromFile(pdfPath);
string text = pdf.ExtractAllText();
// Define the exact JSON structure we want the AI to return
// This schema guides the AI to extract all relevant fields
string jsonSchema = @"{
""InvoiceNumber"": ""string"",
""InvoiceDate"": ""YYYY-MM-DD"",
""DueDate"": ""YYYY-MM-DD"",
""Vendor"": {
""Name"": ""string"",
""Address"": ""string"",
""TaxId"": ""string or null""
},
""Customer"": {
""Name"": ""string"",
""Address"": ""string""
},
""LineItems"": [
{
""Description"": ""string"",
""Quantity"": 0.0,
""UnitPrice"": 0.00,
""Total"": 0.00
}
],
""Subtotal"": 0.00,
""Tax"": 0.00,
""Total"": 0.00
}";
// Prompt includes both the schema and the extracted text
string prompt = $@"Extract all invoice data and return it in this exact JSON structure:
{jsonSchema}
Invoice text:
{text}
Return only valid JSON, no markdown formatting or additional text.";
// Call AI API and parse response (implementation as shown above)
// Return deserialized DetailedInvoiceData
return new DetailedInvoiceData(); // Placeholder
}
}
Imports IronPdf
Imports System
Imports System.Collections.Generic
Imports System.Text.Json
Imports System.Threading.Tasks
' Comprehensive invoice data model with all details
Public Class DetailedInvoiceData
Public Property InvoiceNumber As String
Public Property InvoiceDate As DateTime
Public Property DueDate As DateTime
Public Property Vendor As VendorInfo
Public Property Customer As CustomerInfo
Public Property LineItems As List(Of LineItem)
Public Property Subtotal As Decimal
Public Property Tax As Decimal
Public Property Total As Decimal
End Class
Public Class VendorInfo
Public Property Name As String
Public Property Address As String
Public Property TaxId As String
End Class
Public Class CustomerInfo
Public Property Name As String
Public Property Address As String
End Class
Public Class LineItem
Public Property Description As String
Public Property Quantity As Decimal
Public Property UnitPrice As Decimal
Public Property Total As Decimal
End Class
' Extracts comprehensive invoice data including line items and party details
Public Class StructuredInvoiceExtractor
Private ReadOnly _aiParser As AIInvoiceParser
Public Sub New(apiKey As String)
_aiParser = New AIInvoiceParser(apiKey)
End Sub
Public Async Function ExtractDetailedData(pdfPath As String) As Task(Of DetailedInvoiceData)
Dim pdf = PdfDocument.FromFile(pdfPath)
Dim text As String = pdf.ExtractAllText()
' Define the exact JSON structure we want the AI to return
' This schema guides the AI to extract all relevant fields
Dim jsonSchema As String = "{
""InvoiceNumber"": ""string"",
""InvoiceDate"": ""YYYY-MM-DD"",
""DueDate"": ""YYYY-MM-DD"",
""Vendor"": {
""Name"": ""string"",
""Address"": ""string"",
""TaxId"": ""string or null""
},
""Customer"": {
""Name"": ""string"",
""Address"": ""string""
},
""LineItems"": [
{
""Description"": ""string"",
""Quantity"": 0.0,
""UnitPrice"": 0.00,
""Total"": 0.00
}
],
""Subtotal"": 0.00,
""Tax"": 0.00,
""Total"": 0.00
}"
' Prompt includes both the schema and the extracted text
Dim prompt As String = $"
Extract all invoice data and return it in this exact JSON structure:
{jsonSchema}
Invoice text:
{text}
Return only valid JSON, no markdown formatting or additional text."
' Call AI API and parse response (implementation as shown above)
' Return deserialized DetailedInvoiceData
Return New DetailedInvoiceData() ' Placeholder
End Function
End Class
Jak obsługiwać niespójne formaty faktur
Prawdziwa moc ekstrakcji AI ujawnia się przy przetwarzaniu faktur od wielu dostawców, z których każdy posiada unikalne formaty. Inteligentny procesor może najpierw próbować ekstrakcji opartej na wzorcach (szybciej i taniej) i sięgać po AI tylko w razie potrzeby:
using IronPdf;
using System.Threading.Tasks;
// Hybrid processor that optimizes for cost and capability
// Tries fast regex patterns first, uses AI only when patterns fail
public class SmartInvoiceProcessor
{
private readonly AIInvoiceParser _aiParser;
public SmartInvoiceProcessor(string aiApiKey)
{
_aiParser = new AIInvoiceParser(aiApiKey);
}
public async Task<InvoiceData> ProcessAnyInvoice(string pdfPath)
{
var pdf = PdfDocument.FromFile(pdfPath);
string text = pdf.ExtractAllText();
// First attempt: regex patterns (fast and free)
var patternParser = new InvoiceParser();
var standardResult = patternParser.ParseInvoiceFromText(text);
// If pattern matching found all required fields, use that result
if (IsComplete(standardResult))
{
Console.WriteLine("Pattern extraction successful");
return standardResult;
}
// Fallback: use AI for complex or unusual invoice formats
// This costs money but handles any layout
Console.WriteLine("Using AI extraction for complex invoice format");
var aiResult = await _aiParser.ParseInvoiceWithAI(pdfPath);
return aiResult;
}
// Validates that we have the minimum required fields
private bool IsComplete(InvoiceData data)
{
return !string.IsNullOrEmpty(data.InvoiceNumber) &&
!string.IsNullOrEmpty(data.InvoiceDate) &&
data.TotalAmount > 0;
}
}
using IronPdf;
using System.Threading.Tasks;
// Hybrid processor that optimizes for cost and capability
// Tries fast regex patterns first, uses AI only when patterns fail
public class SmartInvoiceProcessor
{
private readonly AIInvoiceParser _aiParser;
public SmartInvoiceProcessor(string aiApiKey)
{
_aiParser = new AIInvoiceParser(aiApiKey);
}
public async Task<InvoiceData> ProcessAnyInvoice(string pdfPath)
{
var pdf = PdfDocument.FromFile(pdfPath);
string text = pdf.ExtractAllText();
// First attempt: regex patterns (fast and free)
var patternParser = new InvoiceParser();
var standardResult = patternParser.ParseInvoiceFromText(text);
// If pattern matching found all required fields, use that result
if (IsComplete(standardResult))
{
Console.WriteLine("Pattern extraction successful");
return standardResult;
}
// Fallback: use AI for complex or unusual invoice formats
// This costs money but handles any layout
Console.WriteLine("Using AI extraction for complex invoice format");
var aiResult = await _aiParser.ParseInvoiceWithAI(pdfPath);
return aiResult;
}
// Validates that we have the minimum required fields
private bool IsComplete(InvoiceData data)
{
return !string.IsNullOrEmpty(data.InvoiceNumber) &&
!string.IsNullOrEmpty(data.InvoiceDate) &&
data.TotalAmount > 0;
}
}
Imports IronPdf
Imports System.Threading.Tasks
' Hybrid processor that optimizes for cost and capability
' Tries fast regex patterns first, uses AI only when patterns fail
Public Class SmartInvoiceProcessor
Private ReadOnly _aiParser As AIInvoiceParser
Public Sub New(aiApiKey As String)
_aiParser = New AIInvoiceParser(aiApiKey)
End Sub
Public Async Function ProcessAnyInvoice(pdfPath As String) As Task(Of InvoiceData)
Dim pdf = PdfDocument.FromFile(pdfPath)
Dim text As String = pdf.ExtractAllText()
' First attempt: regex patterns (fast and free)
Dim patternParser = New InvoiceParser()
Dim standardResult = patternParser.ParseInvoiceFromText(text)
' If pattern matching found all required fields, use that result
If IsComplete(standardResult) Then
Console.WriteLine("Pattern extraction successful")
Return standardResult
End If
' Fallback: use AI for complex or unusual invoice formats
' This costs money but handles any layout
Console.WriteLine("Using AI extraction for complex invoice format")
Dim aiResult = Await _aiParser.ParseInvoiceWithAI(pdfPath)
Return aiResult
End Function
' Validates that we have the minimum required fields
Private Function IsComplete(data As InvoiceData) As Boolean
Return Not String.IsNullOrEmpty(data.InvoiceNumber) AndAlso
Not String.IsNullOrEmpty(data.InvoiceDate) AndAlso
data.TotalAmount > 0
End Function
End Class
Jak zbudować potok automatyzacji rachunkow do zapłacenia
Łącząc wszystkie te elementy, oto kompletny potok automatyzacji, który przetwarza przychodzące faktury, wyodrębnia dane, waliduje je i przygotowuje do systemu księgowego:
using IronPdf;
using System;
using System.IO;
using System.Threading.Tasks;
using System.Collections.Generic;
using System.Linq;
// Tracks the outcome of processing each invoice
public class ProcessingResult
{
public string FileName { get; set; }
public bool Success { get; set; }
public string InvoiceNumber { get; set; }
public string ErrorMessage { get; set; }
}
// Complete automation pipeline for accounts payable
// Watches a folder, extracts data, validates, and routes to accounting system
public class InvoiceAutomationPipeline
{
private readonly SmartInvoiceProcessor _processor;
private readonly string _inputFolder;
private readonly string _processedFolder;
private readonly string _errorFolder;
public InvoiceAutomationPipeline(string apiKey, string inputFolder)
{
_processor = new SmartInvoiceProcessor(apiKey);
_inputFolder = inputFolder;
_processedFolder = Path.Combine(inputFolder, "processed");
_errorFolder = Path.Combine(inputFolder, "errors");
// Create output directories if they don't exist
Directory.CreateDirectory(_processedFolder);
Directory.CreateDirectory(_errorFolder);
}
// Main entry point - processes all PDFs in the input folder
public async Task<List<ProcessingResult>> ProcessInvoiceBatch()
{
string[] invoiceFiles = Directory.GetFiles(_inputFolder, "*.pdf");
Console.WriteLine($"Found {invoiceFiles.Length} invoices to process");
var results = new List<ProcessingResult>();
foreach (string invoicePath in invoiceFiles)
{
string fileName = Path.GetFileName(invoicePath);
try
{
Console.WriteLine($"Processing: {fileName}");
// Extract data using smart processor (patterns first, then AI)
var invoiceData = await _processor.ProcessAnyInvoice(invoicePath);
// Ensure we have minimum required fields before proceeding
if (ValidateInvoiceData(invoiceData))
{
// Send to accounting system (QuickBooks, Xero, etc.)
await SaveToAccountingSystem(invoiceData);
// Archive successful invoices
string destPath = Path.Combine(_processedFolder, fileName);
File.Move(invoicePath, destPath, overwrite: true);
results.Add(new ProcessingResult
{
FileName = fileName,
Success = true,
InvoiceNumber = invoiceData.InvoiceNumber
});
Console.WriteLine($"✓ Processed: {invoiceData.InvoiceNumber}");
}
else
{
throw new Exception("Validation failed - missing required fields");
}
}
catch (Exception ex)
{
Console.WriteLine($"✗ Failed: {fileName} - {ex.Message}");
// Quarantine failed invoices for manual review
string destPath = Path.Combine(_errorFolder, fileName);
File.Move(invoicePath, destPath, overwrite: true);
results.Add(new ProcessingResult
{
FileName = fileName,
Success = false,
ErrorMessage = ex.Message
});
}
}
GenerateReport(results);
return results;
}
// Checks for minimum required fields
private bool ValidateInvoiceData(InvoiceData data)
{
return !string.IsNullOrEmpty(data.InvoiceNumber) &&
!string.IsNullOrEmpty(data.VendorName) &&
data.TotalAmount > 0;
}
// Placeholder for accounting system integration
private async Task SaveToAccountingSystem(InvoiceData data)
{
// Integrate with your accounting system here
// Examples: QuickBooks API, Xero API, SAP, or database storage
Console.WriteLine($" Saved invoice {data.InvoiceNumber} to accounting system");
await Task.CompletedTask;
}
// Outputs a summary of the batch processing results
private void GenerateReport(List<ProcessingResult> results)
{
int successful = results.Count(r => r.Success);
int failed = results.Count(r => !r.Success);
Console.WriteLine($"\n========== Processing Complete ==========");
Console.WriteLine($"Total Processed: {results.Count}");
Console.WriteLine($"Successful: {successful}");
Console.WriteLine($"Failed: {failed}");
if (failed > 0)
{
Console.WriteLine("\nFailed invoices requiring review:");
foreach (var failure in results.Where(r => !r.Success))
{
Console.WriteLine($" • {failure.FileName}: {failure.ErrorMessage}");
}
}
}
}
using IronPdf;
using System;
using System.IO;
using System.Threading.Tasks;
using System.Collections.Generic;
using System.Linq;
// Tracks the outcome of processing each invoice
public class ProcessingResult
{
public string FileName { get; set; }
public bool Success { get; set; }
public string InvoiceNumber { get; set; }
public string ErrorMessage { get; set; }
}
// Complete automation pipeline for accounts payable
// Watches a folder, extracts data, validates, and routes to accounting system
public class InvoiceAutomationPipeline
{
private readonly SmartInvoiceProcessor _processor;
private readonly string _inputFolder;
private readonly string _processedFolder;
private readonly string _errorFolder;
public InvoiceAutomationPipeline(string apiKey, string inputFolder)
{
_processor = new SmartInvoiceProcessor(apiKey);
_inputFolder = inputFolder;
_processedFolder = Path.Combine(inputFolder, "processed");
_errorFolder = Path.Combine(inputFolder, "errors");
// Create output directories if they don't exist
Directory.CreateDirectory(_processedFolder);
Directory.CreateDirectory(_errorFolder);
}
// Main entry point - processes all PDFs in the input folder
public async Task<List<ProcessingResult>> ProcessInvoiceBatch()
{
string[] invoiceFiles = Directory.GetFiles(_inputFolder, "*.pdf");
Console.WriteLine($"Found {invoiceFiles.Length} invoices to process");
var results = new List<ProcessingResult>();
foreach (string invoicePath in invoiceFiles)
{
string fileName = Path.GetFileName(invoicePath);
try
{
Console.WriteLine($"Processing: {fileName}");
// Extract data using smart processor (patterns first, then AI)
var invoiceData = await _processor.ProcessAnyInvoice(invoicePath);
// Ensure we have minimum required fields before proceeding
if (ValidateInvoiceData(invoiceData))
{
// Send to accounting system (QuickBooks, Xero, etc.)
await SaveToAccountingSystem(invoiceData);
// Archive successful invoices
string destPath = Path.Combine(_processedFolder, fileName);
File.Move(invoicePath, destPath, overwrite: true);
results.Add(new ProcessingResult
{
FileName = fileName,
Success = true,
InvoiceNumber = invoiceData.InvoiceNumber
});
Console.WriteLine($"✓ Processed: {invoiceData.InvoiceNumber}");
}
else
{
throw new Exception("Validation failed - missing required fields");
}
}
catch (Exception ex)
{
Console.WriteLine($"✗ Failed: {fileName} - {ex.Message}");
// Quarantine failed invoices for manual review
string destPath = Path.Combine(_errorFolder, fileName);
File.Move(invoicePath, destPath, overwrite: true);
results.Add(new ProcessingResult
{
FileName = fileName,
Success = false,
ErrorMessage = ex.Message
});
}
}
GenerateReport(results);
return results;
}
// Checks for minimum required fields
private bool ValidateInvoiceData(InvoiceData data)
{
return !string.IsNullOrEmpty(data.InvoiceNumber) &&
!string.IsNullOrEmpty(data.VendorName) &&
data.TotalAmount > 0;
}
// Placeholder for accounting system integration
private async Task SaveToAccountingSystem(InvoiceData data)
{
// Integrate with your accounting system here
// Examples: QuickBooks API, Xero API, SAP, or database storage
Console.WriteLine($" Saved invoice {data.InvoiceNumber} to accounting system");
await Task.CompletedTask;
}
// Outputs a summary of the batch processing results
private void GenerateReport(List<ProcessingResult> results)
{
int successful = results.Count(r => r.Success);
int failed = results.Count(r => !r.Success);
Console.WriteLine($"\n========== Processing Complete ==========");
Console.WriteLine($"Total Processed: {results.Count}");
Console.WriteLine($"Successful: {successful}");
Console.WriteLine($"Failed: {failed}");
if (failed > 0)
{
Console.WriteLine("\nFailed invoices requiring review:");
foreach (var failure in results.Where(r => !r.Success))
{
Console.WriteLine($" • {failure.FileName}: {failure.ErrorMessage}");
}
}
}
}
Imports IronPdf
Imports System
Imports System.IO
Imports System.Threading.Tasks
Imports System.Collections.Generic
Imports System.Linq
' Tracks the outcome of processing each invoice
Public Class ProcessingResult
Public Property FileName As String
Public Property Success As Boolean
Public Property InvoiceNumber As String
Public Property ErrorMessage As String
End Class
' Complete automation pipeline for accounts payable
' Watches a folder, extracts data, validates, and routes to accounting system
Public Class InvoiceAutomationPipeline
Private ReadOnly _processor As SmartInvoiceProcessor
Private ReadOnly _inputFolder As String
Private ReadOnly _processedFolder As String
Private ReadOnly _errorFolder As String
Public Sub New(apiKey As String, inputFolder As String)
_processor = New SmartInvoiceProcessor(apiKey)
_inputFolder = inputFolder
_processedFolder = Path.Combine(inputFolder, "processed")
_errorFolder = Path.Combine(inputFolder, "errors")
' Create output directories if they don't exist
Directory.CreateDirectory(_processedFolder)
Directory.CreateDirectory(_errorFolder)
End Sub
' Main entry point - processes all PDFs in the input folder
Public Async Function ProcessInvoiceBatch() As Task(Of List(Of ProcessingResult))
Dim invoiceFiles As String() = Directory.GetFiles(_inputFolder, "*.pdf")
Console.WriteLine($"Found {invoiceFiles.Length} invoices to process")
Dim results As New List(Of ProcessingResult)()
For Each invoicePath As String In invoiceFiles
Dim fileName As String = Path.GetFileName(invoicePath)
Try
Console.WriteLine($"Processing: {fileName}")
' Extract data using smart processor (patterns first, then AI)
Dim invoiceData = Await _processor.ProcessAnyInvoice(invoicePath)
' Ensure we have minimum required fields before proceeding
If ValidateInvoiceData(invoiceData) Then
' Send to accounting system (QuickBooks, Xero, etc.)
Await SaveToAccountingSystem(invoiceData)
' Archive successful invoices
Dim destPath As String = Path.Combine(_processedFolder, fileName)
File.Move(invoicePath, destPath, overwrite:=True)
results.Add(New ProcessingResult With {
.FileName = fileName,
.Success = True,
.InvoiceNumber = invoiceData.InvoiceNumber
})
Console.WriteLine($"✓ Processed: {invoiceData.InvoiceNumber}")
Else
Throw New Exception("Validation failed - missing required fields")
End If
Catch ex As Exception
Console.WriteLine($"✗ Failed: {fileName} - {ex.Message}")
' Quarantine failed invoices for manual review
Dim destPath As String = Path.Combine(_errorFolder, fileName)
File.Move(invoicePath, destPath, overwrite:=True)
results.Add(New ProcessingResult With {
.FileName = fileName,
.Success = False,
.ErrorMessage = ex.Message
})
End Try
Next
GenerateReport(results)
Return results
End Function
' Checks for minimum required fields
Private Function ValidateInvoiceData(data As InvoiceData) As Boolean
Return Not String.IsNullOrEmpty(data.InvoiceNumber) AndAlso
Not String.IsNullOrEmpty(data.VendorName) AndAlso
data.TotalAmount > 0
End Function
' Placeholder for accounting system integration
Private Async Function SaveToAccountingSystem(data As InvoiceData) As Task
' Integrate with your accounting system here
' Examples: QuickBooks API, Xero API, SAP, or database storage
Console.WriteLine($" Saved invoice {data.InvoiceNumber} to accounting system")
Await Task.CompletedTask
End Function
' Outputs a summary of the batch processing results
Private Sub GenerateReport(results As List(Of ProcessingResult))
Dim successful As Integer = results.Count(Function(r) r.Success)
Dim failed As Integer = results.Count(Function(r) Not r.Success)
Console.WriteLine(vbCrLf & "========== Processing Complete ==========")
Console.WriteLine($"Total Processed: {results.Count}")
Console.WriteLine($"Successful: {successful}")
Console.WriteLine($"Failed: {failed}")
If failed > 0 Then
Console.WriteLine(vbCrLf & "Failed invoices requiring review:")
For Each failure In results.Where(Function(r) Not r.Success)
Console.WriteLine($" • {failure.FileName}: {failure.ErrorMessage}")
Next
End If
End Sub
End Class
Potok ten wdraża pełny przepływ pracy: skanuje folder w poszukiwaniu przychodzących plików PDF, przetwarza każdy z nich, waliduje wyodrębnione dane, przenosi udane ekstrakcje do systemu księgowego i poddaje kwarantannie niepowodzenia do ręcznego przeglądu. Raport podsumowujący daje wgląd w wyniki przetwarzania.
Jak zintegrować przetwarzanie faktur w C# z systemami księgowymi
Wyodrębnione dane faktur w końcu muszą przepłynąć do systemów księgowych w celu dokonania płatności i prowadzenia ewidencji. Specyfika zalezy od platformy, ale wzorce integracji są stałe.
Jakie sa powszechne wzorce integracji dla QuickBooks, Xero i SAP?
Większość platform księgowych oferuje REST API do programowego tworzenia rachunków lub faktur. Oto ogolny wzorzec, który możesz zaadaptować do swojej konkretnej platformy:
using System;
using System.Net.Http;
using System.Text;
using System.Text.Json;
using System.Threading.Tasks;
// Generic integration layer for pushing invoice data to accounting systems
// Adapt the API calls based on your specific platform
public class AccountingSystemIntegration
{
private readonly HttpClient _httpClient;
private readonly string _apiKey;
private readonly string _baseUrl;
public AccountingSystemIntegration(string apiKey, string baseUrl)
{
_apiKey = apiKey;
_baseUrl = baseUrl;
_httpClient = new HttpClient();
_httpClient.DefaultRequestHeaders.Add("Authorization", $"Bearer {_apiKey}");
}
// Creates a Bill in QuickBooks (vendor invoices are called "Bills")
public async Task SendToQuickBooks(InvoiceData invoice)
{
// QuickBooks Bill structure - see their API docs for full schema
var bill = new
{
VendorRef = new { name = invoice.VendorName },
TxnDate = invoice.InvoiceDate,
DocNumber = invoice.InvoiceNumber,
TotalAmt = invoice.TotalAmount,
Line = new[]
{
new
{
Amount = invoice.TotalAmount,
DetailType = "AccountBasedExpenseLineDetail",
AccountBasedExpenseLineDetail = new
{
AccountRef = new { name = "Accounts Payable" }
}
}
}
};
await PostToApi("/v3/company/{companyId}/bill", bill);
}
// Creates an accounts payable invoice in Xero
public async Task SendToXero(InvoiceData invoice)
{
// ACCPAY type indicates this is a bill to pay (not a sales invoice)
var bill = new
{
Type = "ACCPAY",
Contact = new { Name = invoice.VendorName },
Date = invoice.InvoiceDate,
InvoiceNumber = invoice.InvoiceNumber,
Total = invoice.TotalAmount
};
await PostToApi("/api.xro/2.0/Invoices", bill);
}
// Generic POST helper with error handling
private async Task PostToApi(string endpoint, object payload)
{
string json = JsonSerializer.Serialize(payload);
var content = new StringContent(json, Encoding.UTF8, "application/json");
var response = await _httpClient.PostAsync($"{_baseUrl}{endpoint}", content);
if (!response.IsSuccessStatusCode)
{
string error = await response.Content.ReadAsStringAsync();
throw new Exception($"API Error: {response.StatusCode} - {error}");
}
Console.WriteLine($"Successfully posted to {endpoint}");
}
}
using System;
using System.Net.Http;
using System.Text;
using System.Text.Json;
using System.Threading.Tasks;
// Generic integration layer for pushing invoice data to accounting systems
// Adapt the API calls based on your specific platform
public class AccountingSystemIntegration
{
private readonly HttpClient _httpClient;
private readonly string _apiKey;
private readonly string _baseUrl;
public AccountingSystemIntegration(string apiKey, string baseUrl)
{
_apiKey = apiKey;
_baseUrl = baseUrl;
_httpClient = new HttpClient();
_httpClient.DefaultRequestHeaders.Add("Authorization", $"Bearer {_apiKey}");
}
// Creates a Bill in QuickBooks (vendor invoices are called "Bills")
public async Task SendToQuickBooks(InvoiceData invoice)
{
// QuickBooks Bill structure - see their API docs for full schema
var bill = new
{
VendorRef = new { name = invoice.VendorName },
TxnDate = invoice.InvoiceDate,
DocNumber = invoice.InvoiceNumber,
TotalAmt = invoice.TotalAmount,
Line = new[]
{
new
{
Amount = invoice.TotalAmount,
DetailType = "AccountBasedExpenseLineDetail",
AccountBasedExpenseLineDetail = new
{
AccountRef = new { name = "Accounts Payable" }
}
}
}
};
await PostToApi("/v3/company/{companyId}/bill", bill);
}
// Creates an accounts payable invoice in Xero
public async Task SendToXero(InvoiceData invoice)
{
// ACCPAY type indicates this is a bill to pay (not a sales invoice)
var bill = new
{
Type = "ACCPAY",
Contact = new { Name = invoice.VendorName },
Date = invoice.InvoiceDate,
InvoiceNumber = invoice.InvoiceNumber,
Total = invoice.TotalAmount
};
await PostToApi("/api.xro/2.0/Invoices", bill);
}
// Generic POST helper with error handling
private async Task PostToApi(string endpoint, object payload)
{
string json = JsonSerializer.Serialize(payload);
var content = new StringContent(json, Encoding.UTF8, "application/json");
var response = await _httpClient.PostAsync($"{_baseUrl}{endpoint}", content);
if (!response.IsSuccessStatusCode)
{
string error = await response.Content.ReadAsStringAsync();
throw new Exception($"API Error: {response.StatusCode} - {error}");
}
Console.WriteLine($"Successfully posted to {endpoint}");
}
}
Imports System
Imports System.Net.Http
Imports System.Text
Imports System.Text.Json
Imports System.Threading.Tasks
' Generic integration layer for pushing invoice data to accounting systems
' Adapt the API calls based on your specific platform
Public Class AccountingSystemIntegration
Private ReadOnly _httpClient As HttpClient
Private ReadOnly _apiKey As String
Private ReadOnly _baseUrl As String
Public Sub New(apiKey As String, baseUrl As String)
_apiKey = apiKey
_baseUrl = baseUrl
_httpClient = New HttpClient()
_httpClient.DefaultRequestHeaders.Add("Authorization", $"Bearer {_apiKey}")
End Sub
' Creates a Bill in QuickBooks (vendor invoices are called "Bills")
Public Async Function SendToQuickBooks(invoice As InvoiceData) As Task
' QuickBooks Bill structure - see their API docs for full schema
Dim bill = New With {
.VendorRef = New With {.name = invoice.VendorName},
.TxnDate = invoice.InvoiceDate,
.DocNumber = invoice.InvoiceNumber,
.TotalAmt = invoice.TotalAmount,
.Line = New Object() {
New With {
.Amount = invoice.TotalAmount,
.DetailType = "AccountBasedExpenseLineDetail",
.AccountBasedExpenseLineDetail = New With {
.AccountRef = New With {.name = "Accounts Payable"}
}
}
}
}
Await PostToApi("/v3/company/{companyId}/bill", bill)
End Function
' Creates an accounts payable invoice in Xero
Public Async Function SendToXero(invoice As InvoiceData) As Task
' ACCPAY type indicates this is a bill to pay (not a sales invoice)
Dim bill = New With {
.Type = "ACCPAY",
.Contact = New With {.Name = invoice.VendorName},
.Date = invoice.InvoiceDate,
.InvoiceNumber = invoice.InvoiceNumber,
.Total = invoice.TotalAmount
}
Await PostToApi("/api.xro/2.0/Invoices", bill)
End Function
' Generic POST helper with error handling
Private Async Function PostToApi(endpoint As String, payload As Object) As Task
Dim json As String = JsonSerializer.Serialize(payload)
Dim content = New StringContent(json, Encoding.UTF8, "application/json")
Dim response = Await _httpClient.PostAsync($"{_baseUrl}{endpoint}", content)
If Not response.IsSuccessStatusCode Then
Dim error As String = Await response.Content.ReadAsStringAsync()
Throw New Exception($"API Error: {response.StatusCode} - {error}")
End If
Console.WriteLine($"Successfully posted to {endpoint}")
End Function
End Class
Każda platforma ma swój mechanizm uwierzytelniania (OAuth dla QuickBooks i Xero, rozne metody dla SAP), wymagane pola i konwencje API. Skonsultuj się z dokumentacją docelowej platformy po szczególy, ale wzorzec przekształcania wyodrębnionych danych fakturowych w ładunki API pozostaje stały.
Jak przetwarzać wsadowo setki faktur
Fakturowanie w dużej skali wymaga starannej uwagi na współbieżność i zarządzanie zasobami. Oto wzorzec wykorzystujący przetwarzanie równoległe z kontrolowaną współbieżnością:
using System;
using System.Collections.Concurrent;
using System.Collections.Generic;
using System.Linq;
using System.Threading;
using System.Threading.Tasks;
// Tracks the result of processing a single invoice in a batch
public class BatchResult
{
public string FilePath { get; set; }
public bool Success { get; set; }
public string InvoiceNumber { get; set; }
public string Error { get; set; }
}
// High-volume invoice processor with controlled parallelism
// Prevents overwhelming APIs while maximizing throughput
public class BatchInvoiceProcessor
{
private readonly SmartInvoiceProcessor _invoiceProcessor;
private readonly AccountingSystemIntegration _accountingIntegration;
private readonly int _maxConcurrency;
public BatchInvoiceProcessor(string aiApiKey, string accountingApiKey,
string accountingUrl, int maxConcurrency = 5)
{
_invoiceProcessor = new SmartInvoiceProcessor(aiApiKey);
_accountingIntegration = new AccountingSystemIntegration(accountingApiKey, accountingUrl);
_maxConcurrency = maxConcurrency; // Adjust based on API rate limits
}
// Processes multiple invoices in parallel with controlled concurrency
public async Task<List<BatchResult>> ProcessInvoiceBatch(List<string> invoicePaths)
{
// Thread-safe collection for gathering results from parallel tasks
var results = new ConcurrentBag<BatchResult>();
// Semaphore limits how many invoices process simultaneously
var semaphore = new SemaphoreSlim(_maxConcurrency);
// Create a task for each invoice
var tasks = invoicePaths.Select(async path =>
{
// Wait for a slot to become available
await semaphore.WaitAsync();
try
{
var result = await ProcessSingleInvoice(path);
results.Add(result);
}
finally
{
// Release slot for next invoice
semaphore.Release();
}
});
// Wait for all invoices to complete
await Task.WhenAll(tasks);
// Output summary statistics
var resultList = results.ToList();
int successful = resultList.Count(r => r.Success);
int failed = resultList.Count(r => !r.Success);
Console.WriteLine($"\nBatch Processing Complete:");
Console.WriteLine($" Total: {resultList.Count}");
Console.WriteLine($" Successful: {successful}");
Console.WriteLine($" Failed: {failed}");
return resultList;
}
// Processes one invoice: extract data and send to accounting system
private async Task<BatchResult> ProcessSingleInvoice(string pdfPath)
{
try
{
Console.WriteLine($"Processing: {pdfPath}");
var invoiceData = await _invoiceProcessor.ProcessAnyInvoice(pdfPath);
await _accountingIntegration.SendToQuickBooks(invoiceData);
Console.WriteLine($"✓ Completed: {invoiceData.InvoiceNumber}");
return new BatchResult
{
FilePath = pdfPath,
Success = true,
InvoiceNumber = invoiceData.InvoiceNumber
};
}
catch (Exception ex)
{
Console.WriteLine($"✗ Failed: {pdfPath}");
return new BatchResult
{
FilePath = pdfPath,
Success = false,
Error = ex.Message
};
}
}
}
using System;
using System.Collections.Concurrent;
using System.Collections.Generic;
using System.Linq;
using System.Threading;
using System.Threading.Tasks;
// Tracks the result of processing a single invoice in a batch
public class BatchResult
{
public string FilePath { get; set; }
public bool Success { get; set; }
public string InvoiceNumber { get; set; }
public string Error { get; set; }
}
// High-volume invoice processor with controlled parallelism
// Prevents overwhelming APIs while maximizing throughput
public class BatchInvoiceProcessor
{
private readonly SmartInvoiceProcessor _invoiceProcessor;
private readonly AccountingSystemIntegration _accountingIntegration;
private readonly int _maxConcurrency;
public BatchInvoiceProcessor(string aiApiKey, string accountingApiKey,
string accountingUrl, int maxConcurrency = 5)
{
_invoiceProcessor = new SmartInvoiceProcessor(aiApiKey);
_accountingIntegration = new AccountingSystemIntegration(accountingApiKey, accountingUrl);
_maxConcurrency = maxConcurrency; // Adjust based on API rate limits
}
// Processes multiple invoices in parallel with controlled concurrency
public async Task<List<BatchResult>> ProcessInvoiceBatch(List<string> invoicePaths)
{
// Thread-safe collection for gathering results from parallel tasks
var results = new ConcurrentBag<BatchResult>();
// Semaphore limits how many invoices process simultaneously
var semaphore = new SemaphoreSlim(_maxConcurrency);
// Create a task for each invoice
var tasks = invoicePaths.Select(async path =>
{
// Wait for a slot to become available
await semaphore.WaitAsync();
try
{
var result = await ProcessSingleInvoice(path);
results.Add(result);
}
finally
{
// Release slot for next invoice
semaphore.Release();
}
});
// Wait for all invoices to complete
await Task.WhenAll(tasks);
// Output summary statistics
var resultList = results.ToList();
int successful = resultList.Count(r => r.Success);
int failed = resultList.Count(r => !r.Success);
Console.WriteLine($"\nBatch Processing Complete:");
Console.WriteLine($" Total: {resultList.Count}");
Console.WriteLine($" Successful: {successful}");
Console.WriteLine($" Failed: {failed}");
return resultList;
}
// Processes one invoice: extract data and send to accounting system
private async Task<BatchResult> ProcessSingleInvoice(string pdfPath)
{
try
{
Console.WriteLine($"Processing: {pdfPath}");
var invoiceData = await _invoiceProcessor.ProcessAnyInvoice(pdfPath);
await _accountingIntegration.SendToQuickBooks(invoiceData);
Console.WriteLine($"✓ Completed: {invoiceData.InvoiceNumber}");
return new BatchResult
{
FilePath = pdfPath,
Success = true,
InvoiceNumber = invoiceData.InvoiceNumber
};
}
catch (Exception ex)
{
Console.WriteLine($"✗ Failed: {pdfPath}");
return new BatchResult
{
FilePath = pdfPath,
Success = false,
Error = ex.Message
};
}
}
}
Imports System
Imports System.Collections.Concurrent
Imports System.Collections.Generic
Imports System.Linq
Imports System.Threading
Imports System.Threading.Tasks
' Tracks the result of processing a single invoice in a batch
Public Class BatchResult
Public Property FilePath As String
Public Property Success As Boolean
Public Property InvoiceNumber As String
Public Property Error As String
End Class
' High-volume invoice processor with controlled parallelism
' Prevents overwhelming APIs while maximizing throughput
Public Class BatchInvoiceProcessor
Private ReadOnly _invoiceProcessor As SmartInvoiceProcessor
Private ReadOnly _accountingIntegration As AccountingSystemIntegration
Private ReadOnly _maxConcurrency As Integer
Public Sub New(aiApiKey As String, accountingApiKey As String, accountingUrl As String, Optional maxConcurrency As Integer = 5)
_invoiceProcessor = New SmartInvoiceProcessor(aiApiKey)
_accountingIntegration = New AccountingSystemIntegration(accountingApiKey, accountingUrl)
_maxConcurrency = maxConcurrency ' Adjust based on API rate limits
End Sub
' Processes multiple invoices in parallel with controlled concurrency
Public Async Function ProcessInvoiceBatch(invoicePaths As List(Of String)) As Task(Of List(Of BatchResult))
' Thread-safe collection for gathering results from parallel tasks
Dim results = New ConcurrentBag(Of BatchResult)()
' Semaphore limits how many invoices process simultaneously
Dim semaphore = New SemaphoreSlim(_maxConcurrency)
' Create a task for each invoice
Dim tasks = invoicePaths.Select(Async Function(path)
' Wait for a slot to become available
Await semaphore.WaitAsync()
Try
Dim result = Await ProcessSingleInvoice(path)
results.Add(result)
Finally
' Release slot for next invoice
semaphore.Release()
End Try
End Function)
' Wait for all invoices to complete
Await Task.WhenAll(tasks)
' Output summary statistics
Dim resultList = results.ToList()
Dim successful = resultList.Count(Function(r) r.Success)
Dim failed = resultList.Count(Function(r) Not r.Success)
Console.WriteLine(vbCrLf & "Batch Processing Complete:")
Console.WriteLine($" Total: {resultList.Count}")
Console.WriteLine($" Successful: {successful}")
Console.WriteLine($" Failed: {failed}")
Return resultList
End Function
' Processes one invoice: extract data and send to accounting system
Private Async Function ProcessSingleInvoice(pdfPath As String) As Task(Of BatchResult)
Try
Console.WriteLine($"Processing: {pdfPath}")
Dim invoiceData = Await _invoiceProcessor.ProcessAnyInvoice(pdfPath)
Await _accountingIntegration.SendToQuickBooks(invoiceData)
Console.WriteLine($"✓ Completed: {invoiceData.InvoiceNumber}")
Return New BatchResult With {
.FilePath = pdfPath,
.Success = True,
.InvoiceNumber = invoiceData.InvoiceNumber
}
Catch ex As Exception
Console.WriteLine($"✗ Failed: {pdfPath}")
Return New BatchResult With {
.FilePath = pdfPath,
.Success = False,
.Error = ex.Message
}
End Try
End Function
End Class
SemaphoreSlim zapewnia, że nie przeciążysz zewnętrznych API ani nie wyczerpisz zasobów systemowych. Dopasuj _maxConcurrency na podstawie limitów wywołań API oraz pojemności serwera. ConcurrentBag bezpiecznie zbiera wyniki z operacji równoległych.
Kolejne kroki
Automatyzacja fakturowania reprezentuje znaczna okazje na redukcje recznej pracy, minimalizowanie bledow i przyspieszanie procesow biznesowych. Ten przewodnik przprowadził cie przez cały cykl zycia: generowanie profesjonalnych faktur z szablonow HTML, spełnianie wymogow ZUGFeRD i Factur-X fakturowania elektronicznego, wyodrębniając dane z otrzymanych faktur z uzyciem dopasowania wzorca i przetwarzanie zasilane przez AI, oraz budowanie skalowalnych potokow automatyzacji.
IronPDF służy jako podstawa dla tych możliwości, dostarczając solidne renderowanie HTML do PDF, niezawodną ekstrakcję tekstu i funkcje załączników potrzebne do zgodności z pdf/A-3. Jego silnik renderowania oparty na Chrome zapewnia, że faktury wyglądają dokładnie tak, jak zostały zaprojektowane, podczas gdy metody ekstrakcji automatycznie radzą sobie z złożonością kodowania tekstu w PDF.
Pokazane tutaj wzorce są punktami startowymi. Rzeczywiste wdrożenia będą wymagały dostosowania do twoich konkretnych formatów faktur, systemów księgowych i zasad biznesowych. Dla scenariuszy o dużej skali, samouczek przetwarzania wsadowego obejmuje wykonanie równoległe z kontrolowaną współbieżnością i odzyskiwaniem błędów.
Czy jesteś gotowy do rozpoczęcia budowy? Pobierz IronPDF i wypróbuj z bezpłatną wersją próbną. Biblioteka zawiera darmową licencję rozwojową, więc możesz w pełni ocenić możliwości generowania faktur, ekstrakcji danych i tworzenia raportów PDF przed zobowiązaniem się do licencji produkcyjnej. Jeśli masz pytania dotyczące automatyzacji faktur lub integracji systemu księgowego, skontaktuj się z naszym zespołem wsparcia technicznego.
Często Zadawane Pytania
What is IronPDF used for in C# invoice processing?
IronPDF is used in C# invoice processing to generate professional PDF invoices, extract structured data, and automate invoice workflows while ensuring compliance with standards like ZUGFeRD and Factur-X.
How can I generate a PDF invoice using IronPDF in C#?
You can generate a PDF invoice using IronPDF in C# by leveraging its API to create and customize PDF documents programmatically. This includes adding elements such as text, tables, and images that constitute an invoice.
What are ZUGFeRD and Factur-X, and how does IronPDF support them?
ZUGFeRD and Factur-X are e-invoicing standards that ensure invoices are both human-readable and machine-readable. IronPDF supports these standards by allowing you to generate PDF invoices that comply with these specifications.
How can IronPDF help in automating accounts payable processes?
IronPDF can automate accounts payable processes by extracting structured data from invoices and integrating with automation pipelines, reducing manual data entry and improving efficiency.
Can IronPDF extract data from existing PDF invoices?
Yes, IronPDF can extract structured data from existing PDF invoices, making it easier to process and analyze invoice information automatically.
What are the benefits of using IronPDF for invoice processing in C#?
The benefits of using IronPDF for invoice processing in C# include streamlined invoice generation, compliance with international invoicing standards, efficient data extraction, and enhanced automation capabilities.
Is it possible to customize the appearance of a PDF invoice with IronPDF?
Yes, IronPDF allows you to customize the appearance of a PDF invoice by adding various design elements such as logos, text formatting, and layout adjustments to meet branding requirements.
What are the typical steps to automate invoice processing using IronPDF?
To automate invoice processing using IronPDF, you typically generate the invoice, extract necessary data, and integrate with other systems or automation tools to streamline workflows.
How does IronPDF handle different invoice formats?
IronPDF can handle various invoice formats by providing tools to generate, manipulate, and read PDF documents, ensuring compatibility with common e-invoicing standards.

