Merge PDF to HTML via Java

Merge PDF documents into single formats HTML. Use Aspose.PDF for Java to modify files programmatically

Merge PDF to HTML Using Java

How to merge PDF to HTML? With Aspose.PDF for Java library you can easily merge PDF to HTML programmatically. PDF software from Aspose is ideal for individuals, and small or large businesses. Since it is able to process a large amount of information, perform the concatenation quickly and efficiently and protect your data. A peculiar feature from Aspose.PDF is an API for merging PDF to HTML.

You can download its latest version directly from Maven and install it within your Maven-based project by adding the following configurations to the pom.xml.

Check the details of Installing the Library on the Documentation pages. To verify the benefits of the library, try using the conversion PDF to HTML code snippet.

Repository

<repository>
    <id>AsposeJavaAPI</id>
    <name>Aspose Java AP</name>
    <url>https://releases.aspose.com/java/repo/</url>
</repository>

Dependency

<dependency>
<groupId>com.aspose</groupId>
<artifactId>aspose-pdf</artifactId>
<version>version of aspose-pdf API</version>
</dependency>

How to merge PDF to HTML via Java


Java developers can easily load & merge PDF files to HTML in just a few lines of code.

  1. Initialize a new Document, and run a loop for merging files
  2. In loop: add a new page to HTML document
  3. In loop: add PDF file to new page
  4. After the loop save the result

Here is an example that demonstrates how to merge PDF to HTML in Java. Combine multiple documents into a single file with ease. If you are developing code in Java, this task can be simpler than it sounds. You can use fully qualified filenames for both PDF reading and HTML writing. Check out this Java example that show how to merge multiple documents of either the same or different file types into one file using Java

Merge PDF files using Java and save as HTML

Example Java: this sample code shows PDF to HTML concatenation

Input file:

File not added

File not added

Output format:

HTML

Output file:


	// create empty pdf document
	outputDoc = new com.aspose.pdf.Document();

	// read pdf file to Aspose Document
	firstDoc = new com.aspose.pdf.Document("1.pdf");
	secondDoc = new com.aspose.pdf.Document("2.pdf");

	// add page from one document to another directly
	for (com.aspose.pdf.Page page : firstDoc.getPages())
		outputDoc.getPages().add(page);
	for (com.aspose.pdf.Page page : secondDoc.getPages())
		outputDoc.getPages().add(page);

	// save result html to file
	// embed css into a page
	// embed images into a page
	// enhance conversion of documents with backgrounds
	// use fixed layout render
	opt1 = new com.aspose.pdf.HtmlSaveOptions();
	opt1.setPartsEmbeddingMode(com.aspose.pdf.HtmlSaveOptions.PartsEmbeddingModes.EmbedAllIntoHtml);
	opt1.setRasterImagesSavingMode(com.aspose.pdf.HtmlSaveOptions.RasterImagesSavingModes.AsEmbeddedPartsOfPngPageBackground);
	opt1.setAntialiasingProcessing(com.aspose.pdf.HtmlSaveOptions.AntialiasingProcessingType.TryCorrectResultHtml);
	opt1.setFixedLayout(true);
	outputDoc.save("Merger_pdf_html.html", opt1);

Java library to combine PDF to HTML

Aspose.PDF for Java API is a library that enables developers to add PDF processing capabilities to their applications. It can be used to build any type of 32-bit and 64-bit applications to generate or read, convert and manipulate PDF files without the use of Adobe Acrobat. Aspose.PDF for Java allows developers to insert tables, graphs, images, hyperlinks, custom fonts - and more - into PDF documents. Moreover, it is also possible to compress PDF. Aspose.PDF for Java provides excellent security features to develop secure PDF files.

You can find more information about Aspose.PDF for Java API on documentation and examples on how to use API. Some of the key features of Aspose.PDF for Java API include support for various file formats including HTML, XFA, TXT, PCL, XML, XPS and image file formats, support for various PDF versions, and extensive hyperlink functionality.