Transform PCL to MHTML via Java

On Premise Java API to Render PCL to MHTML without using any third party application

 

You can convert PCL to MHTML by using two simple steps. First you need to render PCL file to DOC using Aspose.PDF for Java . After that, by using powerful Document Processing API Aspose.Words for Java , you can convert DOC to MHTML. Both APIs come under Aspose.Total for Java package.

Java API to Convert PCL to MHTML

  1. Open PCL file using Document class
  2. Convert PCL to DOC by using save method
  3. Load DOC file by using Document class of Aspose.Words
  4. Save the document to MHTML format using save method and set MHTML as SaveFormat

Get Started with Java File Manipulation APIs

You can easily use Aspose.Total for Java directly from a Maven based project and include Aspose.PDF for Java and Aspose.Words for Java in your pom.xml.

Alternatively, you can get a ZIP file from downloads .

// load PCL file with an instance of Document class
Document document = new Document("template.pcl");
// save PCL as a DOC 
document.save("DocOutput.doc", SaveFormat.DOC); 
// load DOC with an instance of Document
Document outputDocument = new com.aspose.words.Document("DocOutput.doc");
// call save method while passing SaveFormat.MHTML
outputDocument.save("output.mhtml", SaveFormat.MHTML);   

Open Password Protected PCL Document via Java

While converting PCL to MHTML, even if your document is password protected, you can still open it using PDF Manipulation API Aspose.PDF for Java . In order to open the encrypted file, you need to create a Document object and open the PCL using the owner’s password.

Document document = new Document("input.pcl", "password");
// save PCL as a DOC 
document.save("DocOutput.doc", SaveFormat.DOC);

Save MHTML Document to a Database via Java

While saving your input document to MHTML file format, you can also save your document to database instead of a file system. You may need to implement storing and retrieving Document objects to and from a database. This would be necessary if you were implementing any type of content management system. In order to save your MHTML to database it is often necessary to serialize the document to obtain a byte array. This can be done using Aspose.Words for Java API. After getting your byte array, you can store it in the database using SQL statement.

public static void StoreToDatabase(Document doc, Connection mConnection) throws Exception {
    // create an output stream which uses byte array to save data
    ByteArrayOutputStream aout = new ByteArrayOutputStream();
    // save the document to byte array
    doc.save(aout, SaveFormat.MHTML);
    // get the byte array from output steam
    // the byte array now contains the document
    byte[] buffer = aout.toByteArray();
    // get the filename from the document.
    String fileName = doc.getOriginalFileName();
    String filePath = fileName.replace("\\", "\\\\");
    // create the SQL command.
    String commandString = "INSERT INTO Documents (FileName, FileContent) VALUES('" + filePath + "', '" + buffer + "')";
    Statement statement = mConnection.createStatement();
    statement.executeUpdate(commandString);
}  

Other Conversion Options

PCL TO DOCM (Microsoft Word 2007 Marco File)
PCL TO OTT (OpenDocument Standard Format)
PCL TO RTF (Rich Text Format)
PCL TO PS (PostScript File)
PCL TO WORDML (Microsoft Word 2003 WordprocessingML)
PCL TO DOT (Microsoft Word Template Files)
PCL TO ODT (OpenDocument Text File Format)
PCL TO DOTM (Microsoft Word 2007+ Template File)
PCL TO DOTX (Microsoft Word Template File )

PCL What is PCL File Format?

PCL stands for Printer Command Language which is a Page Description Language introduced by Hewlett Packard (HP). HP created PCL to provide an efficient way for controlling printer features across many different printing devices. The format was originally developed for HP’s dot matrix and Inkjet printers, but has been part of various thermal, matrix and page printers with the passage of time. The format underwent several different revisions, resulting in different versions where each version was enhanced to meet the demands of time with respect to the printer control features. Today, PCL is the most widely spread printer language in the laster printer market.

Read More

MHTML What is MHTML File Format?

Files with MHTML extension represent a web page archive format that can be created by a number of different applications. The format is known as archive format because it saves the web HTML code and associated resources in a single file. These resources include anything linked to the webpage such as images, applets, animations, audio files and so on. MHTML files can be opened in a variety of applications such as Internet Explorer and Microsoft Word. Microsoft Windows uses MHTML file format for recording scenarios of problems observed during the usage of any application on Windows that raises issues. The MHTML file format encodes the page contents similar to specifications defined in message/rfc822 which is plain text email related specifications. The actual specifications of the format are as detailed by RFC 2557.

Read More