Merge PDF to HTML via C#

Merge PDF documents into single formats HTML. Use Aspose.PDF for .NET to modify files programmatically

Merge PDF to HTML Using C#

How to merge PDF to HTML? With Aspose.PDF for .NET library you can easily merge PDF to HTML programmatically. PDF software from Aspose is ideal for individuals, and small or large businesses. Since it is able to process a large amount of information, perform the concatenation quickly and efficiently and protect your data. A peculiar feature from Aspose.PDF is an API for merging PDF to HTML.

The trait of this approach is that you only need to open the NuGet package manager, search for ‘Aspose.PDF for .NET’, and install it without any special complex settings. (Use the command from the Package Manager Console for installing). In order to convert PDF to HTML, we’ll use Aspose.PDF for .NET API which is a feature-rich, powerful, and easy-to-use conversion API for .NET platform. Check the details of Installing the Library on the Documentation pages. To verify the benefits of the library, try using the conversion PDF to HTML code snippet.

Package Manager Console

PM > Install-Package Aspose.PDF

How to merge PDF to HTML via C#


.NET developers can easily load & merge PDF files to HTML in just a few lines of code.

  1. Initialize a new Document, and run a loop for merging files
  2. In loop: add a new page to HTML document
  3. In loop: add PDF file to new page
  4. After the loop save the result

Here is an example that demonstrates how to merge PDF to HTML in C#. Combine multiple documents into a single file with ease. If you are developing code in C#, this task can be simpler than it sounds. You can use fully qualified filenames for both PDF reading and HTML writing. Check out this C# example that show how to merge multiple documents of either the same or different file types into one file using C#

Merge PDF files using .NET and save as HTML

Example C#: this sample code shows PDF to HTML concatenation

Input file:

File not added

File not added

Output format:

HTML

Output file:


	// create empty pdf document
	outputDoc = new Aspose.Pdf.Document();

	// read pdf file to Aspose Document
	firstDoc = new Aspose.Pdf.Document("1.pdf");
	secondDoc = new Aspose.Pdf.Document("2.pdf");

	// add page from one document to another directly
	foreach (var page in firstDoc.Pages)
		outputDoc.Pages.Add(page);
	foreach (var page in secondDoc.Pages)
		outputDoc.Pages.Add(page);


	// save pdf to html page
	// embed css into a page
	// embed images into a page
	// enhance conversion of documents with backgrounds
	// use fixed layout render
	var opt1 = new Aspose.Pdf.HtmlSaveOptions
	{
		PartsEmbeddingMode = Aspose.Pdf.HtmlSaveOptions.PartsEmbeddingModes.EmbedAllIntoHtml,
		RasterImagesSavingMode = Aspose.Pdf.HtmlSaveOptions.RasterImagesSavingModes.AsEmbeddedPartsOfPngPageBackground,
		AntialiasingProcessing = Aspose.Pdf.HtmlSaveOptions.AntialiasingProcessingType.TryCorrectResultHtml,
		FixedLayout = true
	};
	outputDoc.Save("Merger_pdf_html.html",opt1);

.NET library to combine PDF to HTML

Aspose.PDF for C# API provides a wide range of features for working with PDF files. Some of the features include:

  • Create PDF documents from scratch or from HTML, XML, or images.
  • Edit existing PDF documents by adding or removing pages, text, images, and other content.
  • Convert PDF documents to other formats such as HTML, XML, and images.
  • Render PDF documents to images or XPS format.
  • Print PDF documents directly from your application.
  • Digitally sign PDF documents.

You can find more information on Aspose.PDF for C# API in this Aspose documentation